The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Dictionary Approach: Avoid multiple count of words

FeliceFelice Member Posts: 3 Learner I
edited November 2019 in Help
Hi, I have a problem with the Dictionary Approach in Text-Mining. The dictionary contains the words digit and digital acceleration. My process counts the ouccurance of digital acceleration double, so once as digit and once as digit acceleration.
Can you recommend an operator which enables that only the occurence of digital acceleration is counted. So that in the end I have only one ouccurance. 

Tanks for helping! 

Answers

  • MartinLiebigMartinLiebig Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,533 RM Data Scientist
    Hi,
    what operator did you use to do it? Can you maybe post the XML?

    Best,
    Martin
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • FeliceFelice Member Posts: 3 Learner I
    Hi Martin, thanks for your reply! Attached you can find the xml. 
  • MartinLiebigMartinLiebig Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,533 RM Data Scientist
    Hi @Felice ,
    you can just switch to binary occurances? Then it is only counting if, not how often a word occurs.

    Cheers,
    Martin
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • FeliceFelice Member Posts: 3 Learner I
    Hi Martin, 
    thanks for your reply. But I need the quantity of occurences, not only if a word occurs. I just want to avoid that longer word combination like digit accel are counted also as digit. 

    Thanks for helping! 
Sign In or Register to comment.