The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
"Text Mining Phrases"
Hi,
I recently created a simple dictionary-based process for locating and counting specific words (i.e. for sentiment the label was "positive" and the words were "good", "awesome" etc.) based on a text file with each line including a single word from the dictionary and applying the dictionary to a set of documents.
I would like to replicate the process but count specific multiple word phrases (i.e. "very good", "better than the best", etc.). I assume this involves a different tokenization and specifying the n-grams, but I cannot figure out the correct process.
Thanks for any assistance.
I recently created a simple dictionary-based process for locating and counting specific words (i.e. for sentiment the label was "positive" and the words were "good", "awesome" etc.) based on a text file with each line including a single word from the dictionary and applying the dictionary to a set of documents.
I would like to replicate the process but count specific multiple word phrases (i.e. "very good", "better than the best", etc.). I assume this involves a different tokenization and specifying the n-grams, but I cannot figure out the correct process.
Thanks for any assistance.
Tagged:
0