The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Answers
Hello
It depends on your data but when you import your data then you can see the number of each state for each column in RM.
I hope this helps
Sara
To understand the distribution of your data data Histograms and Bars are really useful since you'll be able to find outliers and the distribution of the attribute.
Scatter plots will let you understand relationships between two attributes (later you'll will validate through correlation matrix and PCA depending on your models if those attributes should or should not be included on the model)
Lines are really useful for Time Series analysis and trends.
If you are analyzing patterns on web clicks Sankey would be useful.
Boxplot is another useful graph since on it you could see the quartiles and outliers of an attribute and grouped by other attributes values.
In general you should expend some time visualizing your data because by doing it you may get some interesting insights and questions that could later be answered and explored during your ETL process.
You could find more about this on chapter 3 of this book:
https://rapidminer.com/resource/data-science-concepts-practice/