The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Entity Extraction with Process Documents
data:image/s3,"s3://crabby-images/e9e37/e9e376f86fc989f8be36462752cae2b4a4f55b06" alt="JeffChowaniec"
data:image/s3,"s3://crabby-images/dfa81/dfa81c99b23088ce6f6b14f776b3a4b9ce7da500" alt=""
Entity Extraction:
Using the Text mining extension and coupled process documents operators, we can build a process for entity extraction.
Requirements:
- Text Processing Extension
- Text file of entities to be extracted
- Text file to extract
For the entity file, a simple CSV where each line is an individual entity
Step 1:
Read CSV - Call our entity CSV file
Process documents from data - send the read CSV into the process documents
Inside the process documents from data we will need a filter tokens and a transform cases (lowercase)
Step 2:
Using the word list output from step one, we connect it to a Process Documents operator to extract the word list from the text.
1
Comments
On step 2: what process involved in the second 'process document and after that, how to do to create the document ?
Other question, after create the document (there is sentence that we want to extract).
How to do matching the sentence with database use?