The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Use python to read several Rapidminer ExampleSets

frasaufrasau Member Posts: 9 Learner I
Hi everyone!
I am a beginner of Rapidminer.

I created a Loop on rapidminer that reads a series of csv files from a folder on my pc.
The loaded ExampleSets have only two columns.
For only one of these columns I want to calculate the larger values with Python.

how can this be done?

For the moment I have written a code on Python that reads ONLY A csv file of these:

with open(r'C:\Users\FR\Desktop\First.csv') as file:
    for line in file:

        item = int(line)

        list.append(item)


results = []


for i in range(len(list):
 
    a = lista[i-1]

    b = lista[i]

    c = lista[i+1]

    if b >= a and b >= c:

        results.append(b)

print(results)

is there a way not to repeat this code for each file?
Tagged:

Answers

  • btamasbtamas Employee-RapidMiner, Member Posts: 11 RM Team Member
    edited June 2020
    Hi frasau,

    If you would post your process here, maybe someone could give you more specific advise. Just copy the process xml into the post. If you are not using the Execute Python operator from the Python Scripting Extension, I highly recommend to use that. With the Execute Python operator you can work on pandas DataFrame. You have to define an rm_main method in your code as well. Something like this should do the trick:

    import pandas<br>def rm_main(df):<br>&nbsp; &nbsp; df["c"] = df.apply(lambda x: x["a"] if x["a"] > x["b"] else x["b"], axis=1)<br>&nbsp; &nbsp; return df


    Hope this helps!
  • frasaufrasau Member Posts: 9 Learner I
    thank you very much @btamas !!!!
Sign In or Register to comment.