The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Data with special character getting ignored while training

vipul_kumarvipul_kumar Member Posts: 20 Contributor I
edited November 2018 in Help

Hello,

 

I have a training model and i train that using csv data set. 
I came across one particular observation where if the row in csv data set contains special characters like \n,\r ot ^M (which can come easily if files are generated on windows and transfered to linux.) , the training process just ignores all these lines.
My csv data contains these specical characters on almost every line and hence the model is not getting trained with all these data as is ignoring these rows with special characters.

 

What is the way to instruct rapidminer server to allow these characters.

 

Please Note: I dont have control over the special characters coming in the CSV. I can do a lot of regular expression replacement in the csv to remove those special characters  but that is the last option.

Tagged:

Answers

  • sgenzersgenzer Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, Community Manager, Member, University Professor, PM Moderator Posts: 2,959 Community Manager

    hello @vipul_kumar - I'd recommend posting your XML process here (see "Read Before Posting" on right when you reply) and attach your dataset. This way we can replicate what you're doing and help you better.

     

    Scott

     

     

Sign In or Register to comment.