The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
"Is it possible to see the stopword list from the extension Text Procession?"
I want to see the English word list from the stopword operator. Because I need to filter one stopword out, "no". In my case this is not a stopword and gives crucial information about something. For example: "no fault found" would be converted to "fault found". I've looked online for stop word lists but they differ a lot, so I want to see what the extension uses.
Tagged:
0
Best Answer
-
rfuentealba RapidMiner Certified Analyst, Member, University Professor Posts: 568 UnicornHello @Prentice,
For your case, I think it is better to get a list of words, see which ones are stopwords for your case and build a custom dictionary. I haven't been able to find the list of stopwords either, so that's what I do. I suspect that the list of stopwords basically includes all the common words that don't correspond to verbs, adjectives or nouns, and that the use case is to extract context, rather than interpreting meaning.
With context I mean: both sentences "food is good" and "food is not good", after being stripped from "is" and "not", are talking about food quality; Meaning is separating between "food is good" and "food is bad". I explain it here because I'm not a native English speaker and probably it's not the way these words are used.
Hope this helps,
Rodrigo.5
Answers