The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

I have Missing Data (10K out of 40K) I need to use Self-Organized Map (SOM) as clustering method

asiddiqasiddiq Member Posts: 25 Contributor I
edited September 2020 in Help
I have Missing Data (10K out of 40K) I need to use Self -Organized Map (SOM) as clustering method, and I need an initial approach to fill my missing data. 
Draw example using ReapidMiner operators please; I will  appreciate it 
Tagged:

Answers

  • jacobcybulskijacobcybulski Member, University Professor Posts: 391 Unicorn
    25% of missingness is a lot of missing values, if your data has only few attributes, I suggest to discard all examples with missing values and build your clustering system first - 30K examples is a lot examples so may still struggle with building a SOM if you intend to use more than 2 dimensions. Then you could play with missing values, e.g. by creating an imputation model, and apply your clustering model to these examples only.
Sign In or Register to comment.