Remove duplicate rows

The Remove duplicate rows step removes rows with a duplicate value in any column you choose.

Input/output

Our input data has a List of Products.

After using the Remove duplicate rows and specifying "Product" column to identify the duplicates, The distincted data will displayin the table.

Settings

Pick the dataset.

To determine which duplicates (if any) to keep, select from "Keep first x row(s)" the values:

  • first: Drop duplicates except for the first occurrence.

  • last: Drop duplicates except for the last occurrence.

  • nothing: Drop all duplicates.

Type in "Unique value columns" the column(s) name or index to identify duplicates.

Last updated