The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
"Reading Microsoft word documents (word count)"
Hi,
I did some searching on this topic and found almost nothing on reading DOC and DOCX documents with 'Read Document' step. Is this possible without converting MS word document to a supported format (e.g. CSV,PDF, RTF, HTML)? I have 1000's of word documents so I would like to read them without pre-processing.
Regards,
Serge
I did some searching on this topic and found almost nothing on reading DOC and DOCX documents with 'Read Document' step. Is this possible without converting MS word document to a supported format (e.g. CSV,PDF, RTF, HTML)? I have 1000's of word documents so I would like to read them without pre-processing.
Regards,
Serge
Tagged:
0
Answers
I'm afraid that is currently not possible.
Regards,
Marco
I have the same problem.
Currently I use a bash script to convert DOC and DOCX but I would like to avoid this pre-processing step.
Please let me know if you find something that can help.
Regards
Johan
You can run the program from your RapidMiner process with the Execute Program operator.
Best regards,
Marius