The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Newbe to Rapidminer: Want to apply unsupervised outlier detection
Saurabh_Sawant_24
Member Posts: 7 Learner II
Hi,
I am new to rapidminer and data science and want to apply unsupervised outlier detection on 10 million transactional dataset.
want to understand ideal data processing steps and alogrithms to perform the activity. Thanks.
I am new to rapidminer and data science and want to apply unsupervised outlier detection on 10 million transactional dataset.
want to understand ideal data processing steps and alogrithms to perform the activity. Thanks.
Tagged:
0
Comments
To get started, I'd download the free Anomaly Detection extension and take a look at the operators they have. One simple method you could start with would be the HBOS approach, which is non-parametric and pretty simple. It looks to identify outliers based purely on ranges. If that doesn't suit your needs you may want to look at some density-based measures like LOF.
Without more detailed information, it is hard to make a more concrete recommendation. One word of warning, though, with 10MM records you might want to take a sample first and try a few approaches to see what they are like. Otherwise you might have to wait a long time for RapidMiner to process that many records, depending on the hardware resources you have available!
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts
Scott
Tried auto model and followed the steps as suggested but giving an error "No key attributes are specified. Please adjust the parameter 'key attributes'" in the Results section
Please help.
Scott
Please see the attached error.