The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Use AI Models for approximate matches

paolochippaolochip Member Posts: 4 Learner I
Hi,
many many times I'm struggling with joining/lookup values from different data sources that have sometimes slighlty differences hard to be coded or predicted. One of the classical example is two customer lists where the same customer can have capital vs not, commas, spaces and other differences in its name in a very random ways (e.g. CustomerA ltd vs customerA, ltd. or Customer B vs BCustomer, etc..) . I was wondering if there are existing AI models that can be used in RM that can solve this issue.

Thanks,

Paolo

Answers

  • MartinLiebigMartinLiebig Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,533 RM Data Scientist
    Hey,
    last time I did this with a customer we ended up using Levenshein Distance. One may use word embeddings for it if you want to use some 'AI'.
    Cheers,
    Martin
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • paolochippaolochip Member Posts: 4 Learner I
    Hi Martin, thanks for your quick response, we will try and see

    Paolo
Sign In or Register to comment.