The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Visualizing Data Set with tons of small values.
Hi all, I'm new to rapidminer and just started to use it today. I have an extremely large dataset that I am trying to visualize. However most of the attributes contains lots of zeroes in them, as a result, when I visualize using a histogram, it ends up looking like this:
If I turn on the logarithmic scale in the y-axis it still looks pretty weird to me.
so my question is, are there anything to make the data look better? I thought of removing outliers, but due to the large amounts of smaller values, the outliers end up being most of the data that have higher values. Tried splitting zeroes and non zeroes but since most of them are small data and not just zeroes, it end up looking pretty much the same. Thanks in advance!
If I turn on the logarithmic scale in the y-axis it still looks pretty weird to me.
so my question is, are there anything to make the data look better? I thought of removing outliers, but due to the large amounts of smaller values, the outliers end up being most of the data that have higher values. Tried splitting zeroes and non zeroes but since most of them are small data and not just zeroes, it end up looking pretty much the same. Thanks in advance!
Tagged:
0
Best Answer
-
rjones13 Member Posts: 204 UnicornHi @gerby,
Unfortunately, this is the sort of answer in that it really depends what your end goal for visualizing the data is. If it's just to produce a visual overview for the data, then I think the logarithmic scale does a reasonable job of this. You'll see each line corresponds to a power of 10, so moving one tick mark up corresponds to a 10x increase. I'd also potentially filter out values above 10k to produce more granularity. What do you think?
Best,
Roland1
Answers