The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

First steps of cleansing

WenkyWongWenkyWong Member Posts: 1 Learner I
edited November 2019 in Help
Hi,

For a schoolproject I have to build a model that achieves the highest accuracy as possible. I have a retrieved a train set and test set from the professor, but I am wondering what would be the first steps of the data cleansing. 

I've attached the train_set and test_set as well. The label variable will be the price_category.

Anyone?

Comments

  • lionelderkrikorlionelderkrikor RapidMiner Certified Analyst, Member Posts: 1,195 Unicorn
    Hi @WenkyWong,

    As a starting point, I would use Turbo Prep

    Click on CLEANSE --> AUTO CLEANSING

    Turbo prep will clean automatically your dataset for you.
    Then you can open the process RapidMiner has builded for you and see the different operations RapidMiner has performed on your initial dataset.

    Then, I would use Auto-Model to train several models with your cleaned training dataset.

    Hope this helps,

    Regards,

    Lionel 
Sign In or Register to comment.