The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

How can the Generate Data operator be configured to create right or left skewed distributions?

michaelglovenmichaelgloven RapidMiner Certified Analyst, Member Posts: 46 Guru
How can the Generate Data operator be configured to create right or left skewed distributions? I see the normal distribution and would like to adjust this for skew and well as trimming negative tails.

Best Answer

Answers

  • MartinLiebigMartinLiebig Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,533 RM Data Scientist

    i think you cannot. What kind of distribution would you need?

    Best,
    Martin
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • michaelglovenmichaelgloven RapidMiner Certified Analyst, Member Posts: 46 Guru
    Hi Martin, we've created a Monte Carlo process in RM and have been able to successfully apply a single gaussian cluster to input data to our function. However, some of the input data by it's nature is a right or left skew distribution and\or must be non-negative. The practical use case is this - when we calculate the maximum burst pressure of steel pipe using a specified pipe property from the mill (let's say 60,000 psi), it is very unlikely the specified property is an average. The 60,000 psi more often represents maybe 2-3 standard deviations to the left and the actual average is probably 68,000 psi with a small tail to the right capped at some maximum like 70,000 psi. So, we're trying to figure out how to shape a normal distribution with an understanding what our tails look like. I found ways to do this on stack exchange but the math requires calculus (which I don't know how to do easily in RapidMiner), and we're not trying to get an exact solution, just something that better represents reality.
Sign In or Register to comment.