The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
text processing pdfs
I am trying to build a word cloud from pdfs. Is there some sort of "demo" for this? Do I need to convert the pdfs to text first? I saw a video where he suggested converting to txt files and put them in a separate folder. ((92) Text Processing on Rapid Miner - YouTube)
I tried with a process (see attached xml) but I am getting gibberish for the output (see attached image). Any suggestions here? Thank you!
I tried with a process (see attached xml) but I am getting gibberish for the output (see attached image). Any suggestions here? Thank you!
Tagged:
0
Best Answer
-
MartinLiebig Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,533 RM Data ScientistHi,did you use read_document to read the pdf? it got a setting to read PDFs.Best,Martin- Sr. Director Data Solutions, Altair RapidMiner -
Dortmund, Germany0
Answers