similarity

Bashar · January 2022

How are you .. How can I use data to similarity to calculate the similarity of a document with all the lines in a database and choose the most similar

Thank you

BalazsBarany · January 2022

Hi!

You would use these operators:
- Read Database for getting the data
- Process Documents from Data (from the Text Processing extension) to create a document vector
- A second Read Database (or different data source) for the data to compare
- Process Documents from Data, with the wor (Wordlist) output from the first one connected to the input of this one. This makes sure that the tables have the same structure
- Cross Distances

Then you would select the documents with the smallest distance (= largest similarity).

Regards,
Balázs

Howdy, Stranger!

Quick Links

Categories

Altair RapidMiner Community

GET HELP. LEARN BEST PRACTICES. NETWORK WITH YOUR PEERS.

similarity

Best Answer