The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
How to add filename to the wordlist output?
Hi,
I am processing text files. I want to add text file name for the word list output.
I would like to see the output as follows:
TextFile_Name| Word | occurances
---------------------------------------------------
R1.doc | java | 2
R1.doc | oracle | 3
R1.doc | database | 1
R2.doc | sql | 1
Can you please suggest on how to achieve the same in Rapid Miner?
Thanks a lot for your help!
Regards
Sridhar
I am processing text files. I want to add text file name for the word list output.
I would like to see the output as follows:
TextFile_Name| Word | occurances
---------------------------------------------------
R1.doc | java | 2
R1.doc | oracle | 3
R1.doc | database | 1
R2.doc | sql | 1
Can you please suggest on how to achieve the same in Rapid Miner?
Thanks a lot for your help!
Regards
Sridhar
0
Answers
The blog post here http://rapidminernotes.blogspot.co.uk/2013/04/counting-words-in-lots-of-documents.html has an example where the file name is used in a text processing context. You could use this as a starting point.
regards
Andrew