The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Cannot parse data sets contents

nevsnevs Member Posts: 3 Learner I
Hello, 
I want to use the following data set but when I tried to import it (Turbo Prep), I got an error (see screenshot). 
Is there any way to fix it? 

Data set: "FIFA 19 complete player dataset" from kaggle (can't post any links).

Thanks!

Answers

  • David_ADavid_A Administrator, Moderator, Employee-RapidMiner, RMResearcher, Member Posts: 297 RM Research
    Hi @nevs ,

    did you do any pre-processing with the file?
    I have just downloaded the data set from kaggle and it worked fine for me.
    The error you see, states that somewhere in the column "Aggression" the value is "58-10", which is of course not a correct number format.
    What rarely can happen is the following, RapidMiner takes a look at the first hundred lines of the data sets and estimates the type (number, date, string). If then somewhere further down in the document there occurs an entry like "58-10" the code stumbles. When you are using the "Read CSV" operator in this case there are options like "read all values as polynomials" and "read not matching values as missings".
    Best,
    David
  • nevsnevs Member Posts: 3 Learner I
    Thanks for your answer but I made a mistake, sorry! 
    The name of data set is "FIFA 18 Complete Player Dataset" not Fifa 19... 
    It is from the user Aman Shrivastava on Kaggle. 

Sign In or Register to comment.