The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Prediction Error on a text based dataset.

shroudshroud Member Posts: 2 Learner I
edited May 2020 in Help
Hello everyone, I am new to RapidMiner and I have been stuck on this problem for many hours and I need help.
I am using the movie dataset from UCI http://archive.ics.uci.edu/ml/datasets /Movie
It contains different datasets of movies, actors in those movies and the directors of those movies. The main file is the movies dataset. It has the name of the Movies, Genre, Place it was shot, Release Date, Director and Producer. Except for date all of these are text.

I have tried to use the "Auto Model" of RapidMiner for Predictor of Genres of movies. Basically any model I use it gives an error and I get this "The input data did not have any attributes left after preprocessing. One cause could be that the data only has text columns which have not been processed. Please check your data and settings. "

I searched on the internet but could not find an error like this.

Answers

  • Telcontar120Telcontar120 RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,635 Unicorn
    Did you do text preprocessing prior to Auto Model, or did you select the option for Auto Model to do the text processing for you?  Either way it needs to be transformed into a word vector before conventional ML algorithms will be able to create predictions.
    Brian T.
    Lindon Ventures 
    Data Science Consulting from Certified RapidMiner Experts
Sign In or Register to comment.