The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

text classification with dialect in Arabic language

BadrBadr Member Posts: 1 Learner I
I made text classification for Arabic and it works very well with standard Arabic language but now I will use   with dataset with different dialect in Arabic ? can I use same operators( tokenize and stem(Arabic)  and stopword 

Best Answer

  • Telcontar120Telcontar120 RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,635 Unicorn
    Solution Accepted
    It should as long as the underlying characters are not different, even though the vocabulary, syntax and usage may vary in the dialect.  RapidMiner (or any NLP algorithm) doesn't really understand languages, it just transforms them all into numerical representations to manipulate.
    Brian T.
    Lindon Ventures 
    Data Science Consulting from Certified RapidMiner Experts
Sign In or Register to comment.