The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Classification of products
Hi there,
I have one problem to solve. I have where each row corresponds to a single product. There are a total of 93 numerical features, which represent counts of different events. There are nine categories for all products. My objective is to classify product into 9 different categories. There are 61878 examples. I tried libSVM, k-NN in rapidminer, but i got very high accuracy,about 99,9 so something isn't done well. Does anybody know how to solve this?
Tagged:
0
Answers
hello @994kaca welcome to the community! Some quick requests so we can help you:
• Post your XML process here in this thread (see this post for instructions on How to Post on the Community)
• Attach your dataset if possible (use a fictionalized version if there are privacy concerns)
• Make sure you have all necessary extensions installed (see https://youtu.be/pjBqG3xtXx4)
Scott
[Edit - I moved your post from Radoop to Getting Started as this is the more appropriate place for your query. SG]
Hi,
I don't have XML process, i can attach my dataset?
Dataset is too large for this message. Here is link where you can find training set for this problem. https://www.kaggle.com/c/otto-group-product-classification-challenge/data
hi @994kaca thank you for your reply. So if you do not have an XML, may I suggest that you start using RapidMiner and let us know whern you get stuck? At that point paste your process in this thread as XML and we can go from there.
Thanks!
Scott
Here you are. I didn't undrstand your message at first,sorry. The problem is that accuracy for this k-nn is 99,9 which is too high,and something is wrong, do you know where the problem is?
Thank you
ah thank you for the XML.
Beats the heck out of me how you get 99% accuracy. I get 71.49%.
Scott