Identify Duplicate examples


Hi,
I've a data in which I want to identify duplicates (unlike remove duplicate i want duplicate fields)
For example I've below data
Month Name Amount
Jul-15 John 10$
Aug-15 Alex 15$
Sep-15 John 5$
Jul-15 John 10$
if the above table is my input then i want only below in my results
Month Name Amount
Jul-15 John 10$
Jul-15 John 10$
Best Answer
-
dr-connie-brett RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 4
Contributor I
If you don't actually need the duplicated examples, but rather need the count of how many times they appear this is how I would handle it:
1 - aggregate the table (Aggregate operator - group by all attributes and count on one of them)
2 - filter examples for all count(attribute) > 1
I'm assuming since there is no unique identifier you are ignoring you don't really need the duplicates the number of times they appear, but it might be useful to know how many times they appear!
0
Answers
hi...that was a good puzzle. I would do it this way:
Scott