De-duplication of the Test Cases

Modified on Tue, 13 Aug, 2024 at 7:47 PM

TABLE OF CONTENTS

1. Overview

The deduplication feature uses NLP (Natural Language Processing) techniques to identify duplicate or similar test cases within a Feature File.

1.1 How It Works

Identification of Similar Test Cases: The feature compares each test case with all other test cases in the same feature file (Pairwise comparison).
Define Level of Similarity: You can specify the desired level of similarity between test cases by providing appropriate range. Only those test cases that satisfy this similarity range are identified.

Note: By default, the feature identifies duplicate scenarios even when a similarity range is specified.

2. Identifying Similar Test Cases

Perform the following:

Before using this feature, make sure you have generated two or more test cases for a particular automation process in the Test Cases folder.

Navigate to the Test Case generated folder in the Project Explorer pane.
Select 'De-duplicate' option to view the 'Deduplicate TestCases' pop-up window.
In this window, select the 'Duplicate Scenario' option (hover on the info icon to view brief description) and click the 'Generate Report' button.
Click the button to view the report in the PDF format.
If you have only one test case generated or none in the Test Case folder, then you will see the following error message:

The following table provides a sample report details:

Page 1: Overview of Test Cases and Duplicates
	In the sample report, you will find 24 test cases analyzed. Out of these, five duplicates have been identified. By addressing these duplicates, the number of unique test cases is reduced to 21, resulting in a decrease of three overall. Detailed Report Contents: Identified Duplicates: The report highlights pairs of test cases that are same. Revised Test Case List: After removing duplicates, a revised list of unique test cases is presented in a tabular format. The pie chart illustrates the comparison between the number of similar test cases and the revised number of test cases.

Page 1: Overview of Test Cases and Duplicates

In the sample report, you will find 24 test cases analyzed. Out of these, five duplicates have been identified. By addressing these duplicates, the number of unique test cases is reduced to 21, resulting in a decrease of three overall.

Detailed Report Contents:

Identified Duplicates: The report highlights pairs of test cases that are same.
Revised Test Case List: After removing duplicates, a revised list of unique test cases is presented in a tabular format.
The pie chart illustrates the comparison between the number of similar test cases and the revised number of test cases.

Page 2: Analysis of Duplicate Test Cases
	In this report, test cases are represented with tags during execution. For example, @test019 is identical to @test020 and @test023, and @test008 is identical to @test011. The revised list includes only @test008 and @test019 after removing all the duplicate entries.

3. Defining Similarity Range

In the 'Deduplicate TestCases' pop-up window, select the 'Enable Similarity Range' checkbox (hover on the info icon to view brief description).
Enter the required similarity range.
Click the 'Generate Report' button to view the report which includes identified duplicates (if any) and test cases in the specified similarity range. A sample report is attached for your reference.
Managing Large Feature Files:
If a Feature File is large, it divides into smaller Feature Files. In such cases, you can select single or multiple files for deduplication.
Perform the following:
1. Select 'Enable Similarity Range checkbox.
2. Select the appropriate Feature File or Files.
3. Enter the required similarity range.
4. Click 'Generate Report' button.

Attachments (1)

pdf

DeduplicateR....pdf
74.3 KB