The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Extract Topic form Data (LDA) operator and optimize hyperparameter settings

Chidi_Opara1Chidi_Opara1 Member Posts: 1 Learner I
I am working on a text mining using the Extract Topic from Data(LDA) operator and i am finding difficult understanding how to use the optimize hyperparameter setting? 
Specifically the "Optimize interval for hyperparameter" and " iterations" and how they affect the results
i am currenly using the default alpha and beta heuristics. what is the effect of changing these values?  

Answers

  • jwpfaujwpfau Employee-RapidMiner, Member Posts: 303 RM Engineering
    In general the number of sampling iterations should collerate with the model quality and runtime.

    Optimize interval for hyperparameter defines the number of iterations between hyperparameter optimizations, if you do this too often (low value) you might end up with instabilities due to alpha hyperparameters going to zero.

    α influences the number of topics per document
    β influences the number of words per topic

    If you want to have some more sound knowledge about LDA:
    Or maybe @mschmitz can correct me
  • MartinLiebigMartinLiebig Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,533 RM Data Scientist
    edited March 2020
    Wow, @jwpfau. I didn't know that you as an engineer are so much into DS. Thats of course great!
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
Sign In or Register to comment.