The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Remove duplicates
Hello everbody,
i hope somebody can help. I've got a data set of about 50000 rows and 2 columns (att1, att2). I wanna remove duplicates in which (and only if) the value of att1(row1) is equal to value of att2(row2) and value off att2(row1) is equal to att1(row2).
Example:
att1 att2
row1 100 200
row2 200 100
Sow row2 will be eliminated. Anybody an idea/solution?
Thanks so much!
Tagged:
0
Answers
Hi @ga34hox,
I think that the Lag Series operator from the Value Series extension can help you.
Does this process answer to your need ? (I just tested it on a partial dataset) :
Regards,
Lionel
lionelderkrikor
thanks for your answer. But i have not yet recognized how this operator can help me. Can you explain in more detail?
Hi @ga34hox,
Have you try to import and execute the process I shared ?
This operator allow to shift the data of a column.
To better understand set Breakpoints on the different operators to see the different transformation/calculations on the data.
Regards,
Lionel