The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Input data with each cell containing an array instead of a single numerical or categorical entry.
Urgent!! I have posted this before under a different caption but have not received any response. I am trying to build a model which must take in all inputs as arrays. (Each cell would consist of arrays of the same size). The numerical inputs have to be arrays and the categorical ones also have to be arrays. The reason is that the predicted output is provided as a "group" but there are several members in each group which have separate decision variables. . Each member contributes to the group output in different ways depending on its decision variables. Imagine for example that I have 1000 football matches as sample data and would like to predict the number of goals that will be scored by a team from that dataset. I know that the number of goals is based on team work and each player contributes to the goal. So I get the decision variable for each team player such as (age, skill level, experience, role etc), but my predicted output (number of goals) is a 'group value' so I cant assign an output for each player rather I can only assign an output for each team, but I need to be able to individually provide the input variables for each and all players (array) of that team in each cell. How is this kind of problem solved in rapidminer?
0
Best Answer
-
Telcontar120 RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,635 UnicornYou don't need to use arrays to do this, but you do need to think about how you structure your dataset for RapidMiner to produce the outcome you want. In your example, you would need to predict at the level of each team (so each row would be a game for a team) and the label would be the number of goals scored (again for the team) but you would have attributes for each individual player (as many as you need) and then indicators for which players were participating in each game for each team. So you could end up with hundreds of attributes that would be considered based on whatever you are tracking for each individual player.1
Answers
There is not to my knowledge any way to do arrays as you are requesting with native RapidMiner operators but you might be able to accomplish it with R or python scripting.
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts