The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
New RapidMiner user seeking advice
Hello
I am greatly hoping that this isn't a silly question, I am just getting started with RapidMiner.
I have a a spreadsheet which contains 24 rows of information on a student engagement with a virtual learning environment. I want to be able to classify students into certain ability groups based on the data on the spreadsheet, which is complete for each user.
My question is, can i explicitly state i.e. if the student spends more than a certain about of time on the virtual learning environment, accesses a forum, gets more that 60% in a quiz, then classify the student as learner type A.. if so what is the best way of going about this i.e. decision tress / association rules.
Any help is appreciated
I am greatly hoping that this isn't a silly question, I am just getting started with RapidMiner.
I have a a spreadsheet which contains 24 rows of information on a student engagement with a virtual learning environment. I want to be able to classify students into certain ability groups based on the data on the spreadsheet, which is complete for each user.
My question is, can i explicitly state i.e. if the student spends more than a certain about of time on the virtual learning environment, accesses a forum, gets more that 60% in a quiz, then classify the student as learner type A.. if so what is the best way of going about this i.e. decision tress / association rules.
Any help is appreciated
0
Answers
first of all, 24 rows, i.e. 24 examples, is a very small learning base for any kind of machine learning algorithm, especially if you have a lot of columns, i.e. attributes. You probably won't be able to automatically train any (good) decision tree based on a data set that small.
To create a new attribute called Type which is set to certain values based on manual rules, you should have a look at the Generate Attributes operator. There you are able to specify e.g. rules with the syntax which will create a new attribute and set it to A if the condition matches and to B otherwise.
For an introduction to the general concepts of RapidMiner I'd like to direct you to the video tutorials on our website.
Best regards,
Marius
As a correction I am using 24 attributes, currently for testing with aprox 200 rows of data, which will increase to apox 1000 rows..
Would your solution still be advisable with the increase in data?
Best regards,
Marius