The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

"Some help when training a regression algorithm"

manwannmanwann Member Posts: 7 Contributor II
edited June 2019 in Help
Hi dear rapid-i coommunity,

I am traying the rapidminer modeling to make a content-based recommender system. To do that i downloaded the movielens 100K dataset which have information about movies and ratings made by users to movies. The ratings have a range bettween 0 and 5 and the movies has genre information (action, commedy, etc). I am training a classifier using the user who has more ratings  (uid= 405; Number of reviews= 737) for doing that i discretize the rating (good >= 3.5; bad < 3.5) but due that the user has a lot of more reviews with label bad the classifier (libSVM) predicts all labels as bad.

Then i used another strategy where i did stratified sampling (http://rapid-i.com/rapidforum/index.php/topic,2190.0.html) to make class labels even. I get the following results

                            true bad                true good                class precision
pre.bad
pre.good
class recall

Answers

  • MariusHelfMariusHelf RapidMiner Certified Expert, Member Posts: 1,869 Unicorn
    Please don't double post. Discussion continues here.
Sign In or Register to comment.