The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
"Sentiment Prediction through Text Mining"
Hi there!
We would like to start building a sentiment predicting system. That system should be able to predict market humor about companies/products.
I know this is a complex task and I would like to get some tips about building such a system using Rapid Miner.
Would Text Classification with Cross Validation be suitable to separate good from bad news about a company X?
Do you recommend any books in this field?
Thank you very much and hope to get things going soon.
Braulio
We would like to start building a sentiment predicting system. That system should be able to predict market humor about companies/products.
I know this is a complex task and I would like to get some tips about building such a system using Rapid Miner.
Would Text Classification with Cross Validation be suitable to separate good from bad news about a company X?
Do you recommend any books in this field?
Thank you very much and hope to get things going soon.
Braulio
Tagged:
0
Answers
We would like to start building a sentiment predicting system. That system should be able to predict market humor about companies/products.
I know this is a complex task and I would like to get some tips about building such a system using Rapid Miner.
[/quote]
Well, building such a system is certainly possible with RapidMiner. In fact, RapidMiner is a very suitable tool for such tasks, as despite its wide coverage of data mining / bi / etl approaches it is highly intuitive and can therefore be excellently used for what we call Rapid Prototyping which means to set up initial data mining processes easily and in a relatively short time. Due to the fact that RapidMiner can also be easily integrated into your applications, you can build a full system around the processes you just set up. Yes, learning a text classification model would be an approach to separate good news from bad news. This of course means, you have to supply (or crawl from the internet) text passages which you can present to the learner to make it able to learn a model.
Coming to the second part, a cross validation builds a mandatory part of almost all supervised learning tasks in which the performance of a model should be validated. Hence, it is suitable and recommendable in your application area as well, if you want to know how good your models can separate the good from the bad news.
Hope that helps,
Tobias
I am just wondering how the sentiment could be predicted at an entity level. One thing is to get an overall sentiment about a text. Another thing (completely different) is to get different sentiments about different entities. In the same text, there can be a positive sentiment about a company X on the first paragraph and a bad sentiment about company Y on the second.
Any tips on how to handle such a task with Rapid-i would be greatly appreciated.
Thank you very much
Braulio
you could try to segment the data with the corresponding operators and see if the segments get different sentiment classifications.
Cheers,
Ingo
Thanks a lot.
Unfortunatelly I could not get to Dortmund to attend the seminars this week.
I am building a team to work in this area in Brazil (portuguese language) and I will certainly need some partners that have already the expertise in building such systems. Hope we can get in touch.
Vielen Dank
Braulio Medina
the text plugin contains an operator called "Segmenter". It it a bit hard to configure but it should do the trick. Otherwise, your could crawl and / or segment the data yourself and let RapidMiner do the mining stuff. Yes, it's a shame - but I am sure we can get in touch later.
Cheers,
Ingo
I will work at the Text Segmenter.
Thanks a lot