The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Easy way to calculate metrics of different status
Experts,
we have a data set like below. We wanted to gather some stats on how many days different client(id) stay on each status, before closing.. and also what %age of the total client stay in each stage. A visual representation would be phenomenal. I would greatly appreciate if you can share a sample process that would does gather these metrics.
id date status
1 1/1/18 proposal
1 1/8/18 analysis
1 1/20/18 accepted
1 1/30/18 closed
2 2/1/18 proposal
2 2/10/18 analysis
2 2/12/18 accepted
2 2/18/18 closed
...........
............
Many Thanks in Advance for your time
we have a data set like below. We wanted to gather some stats on how many days different client(id) stay on each status, before closing.. and also what %age of the total client stay in each stage. A visual representation would be phenomenal. I would greatly appreciate if you can share a sample process that would does gather these metrics.
id date status
1 1/1/18 proposal
1 1/8/18 analysis
1 1/20/18 accepted
1 1/30/18 closed
2 2/1/18 proposal
2 2/10/18 analysis
2 2/12/18 accepted
2 2/18/18 closed
...........
............
Many Thanks in Advance for your time
0
Answers
I propose the following process :
- plot the different durations according to the step for each Id client
- plot for example the average of each step according the step
Regards,
Lionel
OK let's get ready for a (long ?) feedback...
1. General
My general feeling is that advanced operations are more intuitive . In deed in previous releases of RM, building some charts in Charts
panel was very intuitive, but in Advanced Charts panel , that was sometimes tricky. RM 9.2 fixs that.
2. Possible improvments (from my humble point of view) :
a. the only thing that is is not immediatly obvious is to "change color of datapoints" : It tooks me 10 minutes before understand how it works…For me, this method is good… but it lacks contextual help for this operation (« drag the icons to define color order » for example).
b. It lacks a feature which just (re)auto-scale the chart after setting a « Axis min value » and/or a « Axis max value » in the « Axis » menus.
c. It lacks a "Add new Z-Axis" option with the Scatter 3D chart style (like it exists a "Add new Y-Axis"...)
3. Bug(s) report :
a. With Scatter 3D chart style, there is no effect when we set the Axis color to a new color :
b. Always with Scatter 3D chart style, Y-Axis graduation "disappears" when we set a new Axis min Value and a new Axis max value.
c. With Boxplot and Deviation chart styles, Y-axis name does not appear.
d. Sometimes (random bug), when switching between « Results » panel and « Design » panel,there is a lag of 4 / 5 sec with a black chart screen.
To conclude, I would say that there is (once again) a great work of RapidMiner's team.
Regards,
Lionel
Scott
The challenge with the actual data set is not every id will go through all stages.. some will skip stages and the date is not continuous for all Id's like below. Exact depiction of the true state of the data is below. So not sure if a Lag operator with 1 will help us here? how can we dynamically adjust the lag operator based on the id and available stages or perform a Lag with group by id? or can we completely avoid a lag operator here
id date status
1 1/1/18 proposal
1 1/8/18 analysis
1 ? accepted
1 1/30/18 closed
2 2/1/18 proposal
2 2/18/18 closed
3 1/2/18 proposal
3 1/29/18 analysis
3 ? accepted --> missing
3 2/5/18 closed
---
----
Also, I see you have converted to milliseconds "date_diff([date-1],date)/(1000*3600*24)" can you provide me the rational please.
Many Thanks
a bit off topic, but your data looks a lot like process mining data. you might want to have a look at ProM.
BR,
Martin
Dortmund, Germany
Thanks for your feedback! Let me reply to your points:
2 a) Yep, certainly not the best UX. It works better for color axes with linear gradients, I think it's more natural there but we tried to re-use it for the time being. The entire configuration is not final yet, we have some ideas on how to improve it in the future
2 b) The 'x' is a good idea for those boxes! Until that arrives, you can always just completely remove the entire content from any of the textboxes, and it will go right back to the default.
3 a) Hm indeed, seems like axis lines are not showing for 3D plots, ever.
3 b) Works for me, do you happen to have a data set for which it does not work?
3 c) Yep, same as in the old charts. To be honest, I had no idea what to put there, as the attributes are at the bottom already. If you have any good suggestions for Parallel Coordinates, Deviation, and Boxplot, I'd gladly take them.
3 d) Interesting. Never experienced that on our test systems. What operating system are you using exactly, if I may ask?
Regards,
Marco
First, thanks for your answers ! Let me precise several points :
3 b) Works for me, do you happen to have a data set for which it does not work?
I'm using the Iris Dataset (in the Samples repository), but I forgot to mention that is when we set a new "Axis min Value" and a new "Axis max value" of the X-Axis (and not the Y-Axis).
3 c) Yep, same as in the old charts. To be honest, I had no idea what to put there, as the attributes are at the bottom already. If you have any good suggestions for Parallel Coordinates, Deviation, and Boxplot, I'd gladly take them.
In deed, I forgot to mention that it comes when I used "Group by" option (for example with Boxplots). In this case, the attributes aren't anymore at the bottom...
3 d) Interesting. Never experienced that on our test systems. What operating system are you using exactly, if I may ask?
I'm executing Windows 10.
I'm rigorous, maybe too rigorous... , but I think that all these elements will improve the user experience.
And more seriously, once again, I have the feeling that there was a lot of work for this new visualization concept !
Regards,
Lionel
Scott
Does RM support "Partion by" ?