The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
How to import and label multiple txt datas?
Hello, I have 3 folders containing txt files. Each folder represents an attribute that I would like to use for naive bayes classification.(For example folder 1 represents spam, folder 2 ham, folder 3 Neither ham/spam)
How can I import the txt files,every txt file labeled correctly and apply naive bayes on it? I already installed the text processing extension. I can import every txt file from one folder(with "process documents from files") but they aren't labeled. Thank you
How can I import the txt files,every txt file labeled correctly and apply naive bayes on it? I already installed the text processing extension. I can import every txt file from one folder(with "process documents from files") but they aren't labeled. Thank you
Tagged:
0
Answers
There is a loop files operator that will enable you to read all the text files. I understood that you want to label all these files based on folder names. If that is correct, then you can look at the below thread where @KostasBonikos explained very well on extracting folder names and generating attributes with them
https://community.rapidminer.com/discussion/43826/loading-folder-names-for-text-processing
Hope this helps. @kayman might add more details.
Thanks
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
In essence all is described in the mentioned thread, use the foldername as reference, generate an attribute based on this foldername and set it to label using the role operator.