The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
"DBSCAN Error: IndexoutofBoundexception"
hi,
I am not very familiar with rapidminer(yet!!) so I hope my problem is just a failure of a beginner who hasn´t found where to find the
right solution or gets a feeling for the error source.
I want to use DBSCAN Clustering in order to cluster image histograms with values for each Bin between 0.0 and 100 %.
My example source is an arff File.
The process is set up with the examplesource and then follows a NominalAttributeFilter before the exampleset is put into
DBSCAN.
I get an IndexOutOfBoundsException failure with the Message "IndexOutOfBoundsException caught: Index: 0 , Size: 0".
In Weka I don´t get such an error, so I think my example data seems to be valid.
Any suggestions? Thank you in advance!
I am not very familiar with rapidminer(yet!!) so I hope my problem is just a failure of a beginner who hasn´t found where to find the
right solution or gets a feeling for the error source.
I want to use DBSCAN Clustering in order to cluster image histograms with values for each Bin between 0.0 and 100 %.
My example source is an arff File.
The process is set up with the examplesource and then follows a NominalAttributeFilter before the exampleset is put into
DBSCAN.
I get an IndexOutOfBoundsException failure with the Message "IndexOutOfBoundsException caught: Index: 0 , Size: 0".
In Weka I don´t get such an error, so I think my example data seems to be valid.
Any suggestions? Thank you in advance!
Tagged:
0
Answers
unfortunatly you are right. There was a small bug, causing this exception if the parameters have been set to values which result in classifying all examples as noise.
This is now fixed in the repository (head). Nevertheless no sensefull results will be delivered, since every example is classified as noise.
DBScan is one of the most instable clustering algorithms I have seen so far, so tuning of parameter epsilon is very important to get a usefull result.
Greetings,
Sebastian
I will check out from the repository and try again but will consider your further hints on dbscan so that I play a little bit with the parameters but
perhaps look for an alternative for clustering.
Greets
Thomas