The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Replacing special characters when text mining

robinrobin Member Posts: 100 Guru
edited June 2019 in Help
Hi

I am attempting to replace special characters inside of a dataset but not having much luck. 

André Sho
émaker
Adri
é Spéllman

It seems as though RapidMiner is unable to understand those characters and is simply marking them as 
�. Non of the replace operators can seem to fix this issue. 

Should I be doing this inside of a generate attributes operator to account for this or is there a better way?


Best Answers

Answers

  • BalazsBaranyBalazsBarany Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert Posts: 955 Unicorn
    Hi!

    Where is your dataset coming from? Operators like Read CSV that import external data have an "encoding" parameter. If you set the encoding correctly for your data import, you'll see all special characters.

    Regards,
    Balázs
  • rfuentealbarfuentealba RapidMiner Certified Analyst, Member, University Professor Posts: 568 Unicorn
    Another thing: -I is a capital "i".
Sign In or Register to comment.