The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

"Clustering files"

hunhun Member Posts: 1 Learner III
edited June 2019 in Help
Hello!

I have a question about clutering. I have some GPS data (the most important are longitude and latitude attribute). I would like to cluster the files, but not in rows.

E.g.: I have 10 csv files (contains lon and lat attributes with 1000-2000 rows). 8 routes are similar (home -> work) and 2 routes are different (home -> shop and work -> restaurant). I would like that: cluster0: the 8 similar routes, and cluster1: the others. Or cluster0: 8 similar, cluster1: other one, cluster2: other one.

Any idea, how can I do that?

Thanks in advance.

Adam
Tagged:

Answers

  • MartinLiebigMartinLiebig Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,533 RM Data Scientist
    Hi Adam,

    i think you can simply calculate statistics per file using loop files and aggregate. Stuff like
    #Entries
    #Most common entry
    ...
    And then use clustering on those 8 examples. At least that would be my first guess.

    ~Martin
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
Sign In or Register to comment.