The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Can I have help deciding what processes to run on this dataset?

WithSilverWithSilver Member Posts: 1 Learner I
I'm new so let me know if this is out of line for this forum.

I have a project to work on for an intro-level data analytics class, and I've basically been tasked to create a model myself employing whatever process I want from the following:

Association Rule Mining
k-Means Clustering
Linear Regression
Logistic Regression
Decision Trees
Neural Nets

For the COVID-19 dataset I am working with, approved by the instructor and attached here, I should run two of these on the dataset, but it's hard to decide which ones I should go for. Any hands-on we did was so brief and walked-through that choosing which to run myself is difficult. Can I have some tips or insight into which would be the best to run on my dataset and why? My instructor suggested ARM and Decision Trees, but I don't see how those would work at the moment.

Cleaning is another issue, but I will try to figure that out myself. Thank you!

Answers

  • Telcontar120Telcontar120 RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,635 Unicorn
    What are you trying to do with the data? Some of these techniques are unsupervised and some are supervised and they are used for solving different types of problems.

    Brian T.
    Lindon Ventures 
    Data Science Consulting from Certified RapidMiner Experts
Sign In or Register to comment.