The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Very strange - parameter changes during csv generation
Hi,
I´m rather new and try text mining with some 100s of documents. I tokenize, filter stopwords, use porter stemmer, filter by lenght, transform to lower case and wright the result to csv.
In the csv file some stemmed words have propabilities in the range of 10e+11, althought they have propabilities well below 1 in the example set table in rapidminer (as it should be to my opinion).
This effect is reproducable and seems to happen cumulative at certain parameters (words).
The effect occurs also if I copy and paste the example set matrix from rapidmier into excel.
What goes wrong??????
Many thanks for any assistance
I´m rather new and try text mining with some 100s of documents. I tokenize, filter stopwords, use porter stemmer, filter by lenght, transform to lower case and wright the result to csv.
In the csv file some stemmed words have propabilities in the range of 10e+11, althought they have propabilities well below 1 in the example set table in rapidminer (as it should be to my opinion).
This effect is reproducable and seems to happen cumulative at certain parameters (words).
The effect occurs also if I copy and paste the example set matrix from rapidmier into excel.
What goes wrong??????
Many thanks for any assistance
Tagged:
0
Comments
May I ask which RapidMiner version you are using? I tested this against the latest version, and it runs fine. I created an example set with the Data Editor, and simply added 2 rows, one with 1E+11 and one with 1E-11, both in a numerical and in a nominal column. The CSV result is as expected:
Regards,
Marco