The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

CSV File becomes very large when imported to RapidMiner

konmadkonmad Member Posts: 1 Learner I
edited August 2022 in Help
Hey everyone,

I have a .csv file with a size of 283MB, however when loading the file into RapidMiner it becomes roughly 6.7GB, which is too large for my system to perform text mining. The file contains 3 columns and around 220.000 rows. Two of the columns are simply ID's and one column is the actual text with around 100 words or so each. Maybe some of you have encountered the same issue and can help me fix it or at least understand what is going on here.

Thank you guys in advance!

Answers

  • ceaperezceaperez Member Posts: 541 Unicorn
    Hi @konmad

    This issue could be produced by the encoding and metadata. 
    have you tried to use the Store operator after the CSV importation and then import the data directly from Rapidminer using the Retrieve operator?

    Best, 
    Cesar
Sign In or Register to comment.