The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Windows and UTF-8
eisioriginal
Member Posts: 4 Contributor I
Hello,
i currently try to use Rapidminer to crawl some chinese content. I use the crawl web operator and store the crawled pages to my file system. I also use a content filter within my process.
When i set some chinese words within the content filter those characters are ??? when i reload the process within rapid miner. I also have wrong characters in the resulting crawled pages in my folder because the files are stored in ANSI Format.
I already tried the encoding option of rpid miner with no success. How can i run RapidMiner on windows in a way that its storing utf-8 files and process files?
Thank you
Andreas
i currently try to use Rapidminer to crawl some chinese content. I use the crawl web operator and store the crawled pages to my file system. I also use a content filter within my process.
When i set some chinese words within the content filter those characters are ??? when i reload the process within rapid miner. I also have wrong characters in the resulting crawled pages in my folder because the files are stored in ANSI Format.
I already tried the encoding option of rpid miner with no success. How can i run RapidMiner on windows in a way that its storing utf-8 files and process files?
Thank you
Andreas
0