The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

"Bug in SparseFormatExampleSource?"

Legacy UserLegacy User Member Posts: 0 Newbie
edited May 2019 in Help

I read in some examples from a DB and vectorized them with the text tools. The resulting data set looked like this:

image

I saved it to a file with an ExampleSetWriter. Then I read it back in using a SparseFormatExampleSource. The result was:

image

Note that the word 'weight' has been _inserted_ into the Name column, pushing the other values down. But its value seems to have _overwritten_ the Statistics column.

Bug? Could it have to do with 'weight' being a special weighting column? I couldn't reproduce the problem with the sample newsgroup data. Maybe it didn't contain the word 'weight'.
Tagged:

Answers

  • Legacy UserLegacy User Member Posts: 0 Newbie
    To clarify, I didn't add any weighting column... Just sourced the text, applied the StringTextImput operators, saved, then tried to read in the file. Using both write and read operators creates two datasets, the original and the newly read one. So it's easy to see the differences.
  • IngoRMIngoRM Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,751 RM Founder
    Hi,

    did you try loading the data with and witout defining the .aml file?

    Cheers,
    Ingo
Sign In or Register to comment.