The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Stuck at LDA process. No results are coming
lambamanika07
Member Posts: 24 Maven
in Help
I updated my Rapidminer and from that instant I can not get any result from my LDA process. I am attaching the screenshot for the process and the sub-processes I am trying out for LDA for last 2-3 days but 'NA' as results is showing. Kindly help.
0
Best Answer
-
MartinLiebig Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,533 RM Data ScientistHi @lambamanika07 ,your file is coded in UTF-8. If you are using windows, you want to change the Encoding of Read Document to UTF-8. Otherwise strange things happend with signs like é.Further you should use a tokenize operator before your text mining operators. Operators like 'Stem' or 'n-grams' are working on the tokens. This may have duplicated your data.Lastly: Can you quickly confirm that the number of topics you search is < then the number of documents? If you search for 5 topics in 2 documents, that is doomed to fail.Best,Martin- Sr. Director Data Solutions, Altair RapidMiner -
Dortmund, Germany6
Answers
Dortmund, Germany
Yes, I have checked many times. I tried with text files and pdf files both. I tried even with different text samples but I had no luck! The results were coming like in the screenshot as NA.
Dortmund, Germany
The text is in English language. I have run the same samples before also for testing few weeks ago and it worked fine. That time I was using the 8 version of Rapidminer. I am facing this problem from the moment I upgraded to the latest 9 version. I do not think the up gradation of the version would be creating any problem but I am telling you just in case.
Dortmund, Germany
I have sent you a personal message with the sample text and the process. Thank you for you help in advance.
Dortmund, Germany
I made both the changes regarding UTF selection and adding tokenization operator as suggested in the process and then it worked.
With regards
Manika