Remove Duplicates
Removes duplicated rows
Last updated
Removes duplicated rows
Last updated
The Remove Duplicates Node allows you to identify and remove duplicate rows within your dataset. This node allows you to compare the entire rows or analyze specific columns to find duplicates.
There are two options for identifying duplicates:
By selecting this option, the node will compare the entire row to identify duplicates. When all columns in a row have the same values as another row, it will be considered a duplicate.
This option allows you to specify which columns should be analyzed for duplicates. You can choose one or more columns to compare for duplicates. When you have chosen the columns, the node will only consider rows as duplicates if the selected columns have the same values in both rows.