The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
"Text Mining Questions"
MockingBird
Member Posts: 2 Contributor I
Hello,
I'm using Rapidminer for the first time and I'm currently struggling with the following issue:
Thanks a lot,
Adrian
I'm using Rapidminer for the first time and I'm currently struggling with the following issue:
- I have several texts which I want to split into sentences. Each of the texts is stored in a single cell of a column in an Excel file.
- After that, I want to extract frequently occurring terms from these sentences.
- As third step I want to automatically categorize the sentences depending on the terms respectively a combinations of the terms.
- Finally I want to be able to select for example the term "colours" and subsequently I want to get shown all sentences containing this term.
Thanks a lot,
Adrian
Tagged:
0
Answers
you might have a look at this tutorial. It should help you to start with Rapidminer:
http://vancouverdata.blogspot.de/2011/02/how-to-web-scraping-xpath-html-google.html
If you have further questions, feel free to ask.
Cheers,
Martin
Dortmund, Germany
Thanks for the link.
I'm currently trying to split the texts, which I imported from an Excel sheet, into sentences and I have absolutely no idea what I'm doing wrong here. I tried the "Tokenize" operator of the Text Processing addon as well as the "SentenceTokenizer" of the Information Extraction addon. None of these is working. The code you can find below. I'm grateful for any hint. Thank you,
Adrian
usually process documents from Text with split on linguistic senteces should be fine. So it is hard to predict anything w/o the data.
I will write you a mail on that matter.
Cheers,
Martin
Dortmund, Germany
We will take care of this.
Dortmund, Germany