The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Automodel learn/test

DocMusherDocMusher Member Posts: 333 Unicorn
edited June 2019 in Help

Is there a good reason to split the data in 60/20/20% where the last 20% is used to "test the testing of the conclusion", as proposed in another platform?

Best Answer

  • varunm1varunm1 Member Posts: 1,207 Unicorn
    edited June 2019 Solution Accepted
    Hello @DocMusher

    Looks similar to RM automodel, where the data is split into 60:40 (train: test) but the 40% test data is again split into 7 hold out sets to test the model.

    We split the 40% again into 7 parts, evaluate the model on each part, get rid of the two extremes/outliers, and build the average of the rest.  This way we keep many of the benefits of a cross-validation without it's biggest drawback: 5x-10x runtime increases. (Explanation from @IngoRM)
    @IngoRM might add more here.

    Regards,
    Varun
    https://www.varunmandalapu.com/

    Be Safe. Follow precautions and Maintain Social Distancing

Sign In or Register to comment.