The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Morphological stemming in RapidMiner?
batstache611
Member Posts: 45 Maven
Hi All,
Instead of using the Porter stemming algo or one of its variants, is it possible to stem words to their morphological root in RapidMiner?
Thank you.
Tagged:
0
Answers
Hi,
two options you might want to try include two new operators present in the Operator Toolbox extension.
One option includes using the Stem Tokens Using ExampleSet-Operator using a predefined ExampleSet as a source of potential word stems (similar to the Stem (Dictionary) operator, but using an ExampleSet as an input instead of a file). You'll find an example process included as a tutorial.
Another option would be trying to use the Levenshtein Distance. With it you could search for token with a low Levenshtein Distance and choose the shortest of them. You'll find a Generate Levenshtein Distance-Operator in the Operator Toolbox as well.
Best,
Philipp