The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

[Solved] Converting Numerical Values to Binomial

neildugganneilduggan Member Posts: 18 Contributor II
edited November 2018 in Help
Hi

Very new user here so apologies if this questions seems dumb!!

I have some survey data with answers to each question as follows:

Yes = 1
No = 2
Undecided = 3
Refused to answer = 4
etc

I want to remove all values not equal to 1 or 2 or set it to "N/A" and then set values = 1 to "True" and set values = 2 to "False"

At the moment, I'm using "Generate Attributes" to create new attributes called "answer_yes" (answer==1) and "answer_no" (answer==2)

I presume there is a much better way to do this?? Or is this way as simple as any??

Any help appreciated :-)

Thanks

Neil

Answers

  • neildugganneilduggan Member Posts: 18 Contributor II
    One other thing - I can't remove the rows containing the values 3 or 4 as these occur on different lines each time.
  • Marco_BoeckMarco_Boeck Administrator, Moderator, Employee-RapidMiner, Member, University Professor Posts: 1,996 RM Engineering
    Hi,

    does each row only contain the answer to a single question? Is the anwer column nominal? f so, you can use the Filter Examples operator to get ride of rows where the value is not 1 or 2. Then you can use Generate Attributes (on the very same answer column, the operator can overwrite existing values) with this expression:

    if (Answer=="1", "true", "false")
    otherwise you can do

    if (Answer=="1", "true", if (Answer=="0", "false", NaN))
    Regards,
    Marco
  • neildugganneilduggan Member Posts: 18 Contributor II
    Hi

    Thanks for your reponse Marco.

    Each row contains the answer the multiple questions (i.e. each column is a question and each row is a respondent)

    Each answer is nominal (i.e. 1, 2, 3, 4, etc) although is viewed by RapidMiner as numerical.

    I'll have a try when I get home and let you know how I got on

    Thanks
  • neildugganneilduggan Member Posts: 18 Contributor II
    Thanks Marco, that second piece of code did the trick  :)
Sign In or Register to comment.