The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
text mining breaks text into symbols
Hello,
I'm trying to do text mining with a large excel table with many text entrys (many words in a cell). Unfortunately my "Process Documents from Files" breaks my text into a mixture of symbols and letters.
I aktually do not know why it is doing that, but also my word list looks like that.
Can you tell why this happens?
Thanks a lot
Imke
Tagged:
0
Answers
Hi,
can you make sure that you tried the right encoding? it looks like this was stored with UTF-8 (Mac/Linux Standard) but read with a Windows Encoding.
Br,
Martin
Dortmund, Germany
Hello,
underneath you can see my process. Maybe you can tell, what is wrong, and why it crashes rapid miner, too.
Thank you
Imke
For Reference,
the issue was that the files in the folder were Excel-Files. Read Document from Files is only able to handle pure text files. The attached process soled the issue.
Best,
Martin
Dortmund, Germany