Delete duplicate rows from your data

The Delete duplicate rows checks the selected columns, finds duplicate data and removes the second and subsequent duplicate data.

Step 1: Select the data from the left file tree or “files” tab which needs to manipulated.

Step 2: Once you select the file, it will be opened in the “Data Wrangler”.

Step 3: Click on the “Delete duplicate rows” and a popup will appear.

Step 4: Click on the columns in which you want to apply delete duplicate rows under “Select data”.

Step 5: After selecting the columns, every row of selected column of the file will be checked and if the combination of entries across all selected columns for that row has already occurred then the row is deleted.

Step 6: The default “Result” is to overwrite the original file with the updated file.

Select what you want to save as your output file from the “output” drop down menu.

If you want to keep the original file along with your updated file then, click on “Keep all columns” or “Only keep selected columns” under “create new file”. Click “run”. The new file will be saved in the same folder as original file.