The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
[SOLVED] Filter text from a list of word
Hi everybody,
I build a process to search and count a list of keywords in thousands of files.
I built the keywords list from a Excel file after seraval operations in an example set with a keyword by example.
I would like to be able to do something like an inverse of "Filter stopwords (Dictionary)" using the attribute of my example set (or a word list if someone can explain me how to convert an example set attribute into a word list).
I found the following topics but I don't know if there is something new since:
Thanks in advance
Johan
I build a process to search and count a list of keywords in thousands of files.
I built the keywords list from a Excel file after seraval operations in an example set with a keyword by example.
I would like to be able to do something like an inverse of "Filter stopwords (Dictionary)" using the attribute of my example set (or a word list if someone can explain me how to convert an example set attribute into a word list).
I found the following topics but I don't know if there is something new since:
- http://rapid-i.com/rapidforum/index.php/topic,2754.0.html
- http://rapid-i.com/rapidforum/index.php/topic,6330.0.html
- http://rapid-i.com/rapidforum/index.php/topic,3719.0.html
- http://rapid-i.com/rapidforum/index.php/topic,3493.0.html
Thanks in advance
Johan
Tagged:
0
Answers
Thank you for your reply.
To begin my work I've a table looking like the following:
But "Filter Stopwords (Dictionary)" operator uses a txt file as dictionary.
Finally to solve my problem, I created a new operator "Filter Startword (Dictionary)" by removing the '!' in the class "StopwordOperator" at line 74.
Regarding the list of word (and not WordList) I used the following operator
- "Set Role" to remove the ID as special attribute
- "Select Attributes" with the "Single" parameter to keep only the keywords
- "Write CSV" with a space as column separator and I connected the "file" output
I hope I am not too confus in my explaination.Greetings
Johan