The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

"Programmatic access to multiple Word Vectors?"

drstevekramerdrstevekramer Member Posts: 7 Contributor II
edited June 2019 in Help
When using the Word Vector methods in the Text plugin (or the associated WVTool library), I know that I can create a single WordVector using one of the classes (like TDIDF) that implement the WVTVectorCreator interface. That allows me to access the internal double array of calculated weights.

If one uses the WVTool.createVectors method to create an entire set of WordVectors (createVectors(WVTInputList input, WVTConfiguration config, WVTWordList wordList) ), is it possible to get direct programmatic access to the collection of WordVectors that were created? I want to be able to iterate over those and access the double arrays of the weights.

From what I have read so far, the WordVector results can be written to a text file, but I'd rather not have to write and read from disk constantly. Any suggestions?

Thanks,
Steve Kramer

Answers

  • IngoRMIngoRM Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,751 RM Founder
    Hi,

    I am not completely sure but in the class "TextInput" of the text plugin should be something like this. Probably somewhere near the RapidMinerOutputFilter stuff. Maybe Michael is reading this and can give more details...

    Cheers,
    Ingo
Sign In or Register to comment.