The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Outlier deection: How to change an outlier value to mean value of the attribute
data:image/s3,"s3://crabby-images/e9e37/e9e376f86fc989f8be36462752cae2b4a4f55b06" alt="Ecclesiastes"
data:image/s3,"s3://crabby-images/2b985/2b98579ca4abd9381f526177f3044c84ba7cc6a3" alt=""
Hi,
I'm working on a high dimensional data (>250 attributes) to compare the different outlier detection methods.
I have already tested CoF and teh Distance-based method. There prodoce total different reults, but that was expected.
However, for forther comarision I like to treat the detected outlier in a simple workflow like this:
1) run outlier detecion
2) replace detectet outlier value with mean value of the attribute
3) run a clasifier on the preprocessed data.
Both, CoF and Density based outlier detection creates a new boolean variable outlier = (true / false)
that means i need just something like a filter, which selcts teh affected value of the attribute and
a simle replacement with the mean value of the attribute.
I have just found a "replace missing value" function which offers the mean replacement,
but not for outlier.
Is there a way, to do this sort of value replacement in rapid miner?
I have used RapidMinder today for the first time, so Im no expert..
Any comments are appreciated
marvin
I'm working on a high dimensional data (>250 attributes) to compare the different outlier detection methods.
I have already tested CoF and teh Distance-based method. There prodoce total different reults, but that was expected.
However, for forther comarision I like to treat the detected outlier in a simple workflow like this:
1) run outlier detecion
2) replace detectet outlier value with mean value of the attribute
3) run a clasifier on the preprocessed data.
Both, CoF and Density based outlier detection creates a new boolean variable outlier = (true / false)
that means i need just something like a filter, which selcts teh affected value of the attribute and
a simle replacement with the mean value of the attribute.
I have just found a "replace missing value" function which offers the mean replacement,
but not for outlier.
Is there a way, to do this sort of value replacement in rapid miner?
I have used RapidMinder today for the first time, so Im no expert..
Any comments are appreciated
marvin
0
Answers
this is possible, but unfortunately a little bit complicated
Greetings,
Sebastian
For future reference, outlier detection operators based on neighbors should not take the parameter (number of neighbors = 1). Because the nearest neighbor (number of neighbors = 1) for a given example is the example itself. This would lead to make distance based outliers detection methods to detect outliers improperly.
Please correct me if I am wrong.
Regards,
--Motaz
I think you are right depending on the definition of neighbor
Greetings,
Sebastian
Warm Greetings
--Motaz