The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Answers
Are you asking if you can have terms of more than one word/token? If so, the answer is yes. After you tokenize, use the Generate n-Grams (Terms) operator. This will generate phrases of n sequential tokens. Note: you will still have the single terms in your term-by-document matrix too. For example, generating 2-grams you would have "heart", "attack", and "heart attack" in the matrix.
i think there is no way from preventing it to generate the table. There is the option however to use a clever Regex in Select Attributes and simply remove them.
~Martin
Dortmund, Germany