The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Background information Sample datasets in RapidMiner Studio
Best Answers
-
sgenzer Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, Community Manager, Member, University Professor, PM Moderator Posts: 2,959 Community Managerhi @dannyV now that is a good question! I am always amazed on how few people ask about these kind of things.
Some of the sample data sets in RapidMiner come from the UCI (University of California Urvine) Machine Learning Repository. Iris is a good example: https://archive.ics.uci.edu/ml/datasets/Iris. Others, like Titanic, have been used in the field of data science so long that honestly I have no idea where the original source is (just tried googling for 5 min and kept being sent to Kaggle).
As @IngoRM was the one who likely inserted these back in the day, I'm tagging him for some insight here.
Scott
6 -
IngoRM Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,751 RM FounderYip, Scott covered this already. Most of those should be UCI data sets. If you are not finding the corresponding data set there, please ask for the specific one. I may remember the source :-)
5
Answers
I was looking for the SONAR dataset and I might have found it on UCI data sets..
Thank you!
Regards,
Danny