The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Impute Missing Values by KNN
derek_tsui
Member Posts: 4 Contributor I
Hi Experts,
I walked through the operator of 'Impute Missing Values' that the tutorial is using K-NN scheme, and the configuration of parameters with ticked "iterate" and "learn on complete cases". May I know the default of this parameter is using K-NN scheme for imputation?
Thanks,
Derek
Tagged:
0
Answers
Hi Derek,
For the tutorial process kNN with a default of 1 is useful because kNN simply selects the value from the nearest record (using distance measures) to the missing value. It's a pretty logical choice for default.
However, you are not limited to only kNN. Here is an example using a Decision Tree for nominal value attributes and a Neural Network for numerical attributes.
When I tried to apply this operator by using decision tree or knn, it also showed the same error message "Missing attributes: Input ExampleSet has no attributes. Learning schemes cannot be applied without at least one valide attribute." May I know if I missed anything to apply these algorithms?
Thanks,
Derek
Im also facing the same problem. In the Impute Missing Value operator, where I selected to only input a single attribute name "col-adj" in the process below. Seems like the operator only selects that attribute and pass it into the inner process of the Impute Missing Value operator and therefore returning an error.
Does it mean that we can only impute all missing values and don't get to select which column to impute?
Thanks!
@ecolix SO your setup will not produce any results because you only select one column thereby removing additional feature information for the K-nn algorithm to use to figure out what the missing values are. If you want to replace missing values on a single column, you might want to look at the generic Replace Missing Values and set them at a specific value. When using Impute Missing Values, it's best to use the entire data set and not just a single column.