The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
dataset and coding problem
bangbadabang
Member Posts: 6 Contributor II
Hi,
I have to cluster texts (course lectures) into different categories using hierarchical clustering.
My data input is from mysql database.
1st question, what format does my data have to be in?
I also need to connect RM with the system I work for (nonprofitable org), to trigger it to run automatically after a certain period.
But I have problem with the coding..
I have looked into the documentation, but it's really confusing..
Could you suggest me on where to and what functions in which location that does :
load the data
pick learner type
construct the output model
Thanks so much!
I have to cluster texts (course lectures) into different categories using hierarchical clustering.
My data input is from mysql database.
1st question, what format does my data have to be in?
I also need to connect RM with the system I work for (nonprofitable org), to trigger it to run automatically after a certain period.
But I have problem with the coding..
I have looked into the documentation, but it's really confusing..
Could you suggest me on where to and what functions in which location that does :
load the data
pick learner type
construct the output model
Thanks so much!
0
Answers
Cheers,
Simon
sorry Fischer ,
thanks a lot. I just got to look at this post.
now i'm not sure whether rm or weka is easier to use. the deadline is squeezing in, tho.
I think rm is a quite complex.. just studying how to write xml format alone requires some time..
I want to take data input (from any source now), do clustering (and able to refer back to the model.. so i need to keep to model somewhere.. like i need to write that model down in db or text.. ), test ..etc
there might be some threshold issues... etc.. from studying the tutorial , I only understand like first half of the examples provided. How do you learn which type of input will be needed in the next phase?
and
<quote>use cron or whatever scheduler you will be using to trigger a call to RapidMiner <yourprocess.xml>. I</quote>
how?
1 more thing- what is model applier?
and what stop word languages does RM support?