The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Replacing whole words with dictionnary
Hi Rapid miner community,
I don't find the solution to replace whole words after a "read excel" operator. If I use a "Replace (dictionary)" operator linked with an excel file, words are partially substituted - as they are not tokenized - and sometimes part of the word is substituted and aggregated with the rest of the word. for instance, if in my dictionary I have many entries for the misspelling form of the word « application » (e.g app, apple, etc.) the result can be « applicationlicationncation » ... The reason is that, in my data set, I have many terms misspelled therefore I'd like to use such process to substitute the common misspelling forms.
Inside the « text processing » operator, after tokenization I could do it, but there’s no operator to handle this (as far I’ve seen). the « replace token could do the job, but I have to enter one by one all the entries that I presently have in my misspelling dictionary..
thanks for your help !
thanks for your help !
0
Best Answer
-
kayman Member Posts: 662 UnicornDid you tick the regex box on the replace operator? You also do not need the \b in your with, only in your substitute part.
Extending the match range is a question of modifying what you want to see in your boundaries, as in attached simple example.
Input : my andoid aplication isn't an androit app.
output : my android application isn't an android application.
5
Answers
best regards