The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

"UTF-8 Strings generated by Execute R not displaying properly"

aruberutouaruberutou Member Posts: 23 Contributor II
edited June 2019 in Help
Hi,

I am using Rapidminer with "Execute R". I have a script that is downloading and parsing JSON documents.

In Rapidminer, this process is looped and then appended and displayed.

The problem is that UTF-8 strings are not being displayed correctly in Rapidminer; e.g.; "Study objective is to assign patients<U+2019> management with". As you can see the UTF-8 part is not being encoded correctly.

I have already changed my encoding preferences to "UTF-8", and check that the script displays as intended in R.

I am also converting polynomial strings that are being returned by "Execute R" to text via "Nominal to String".

I'm just guessing, but perhaps it may have something to do with how Rapidminer handles character strings being output from "Execute R"?

Any help would be appreciated. Thanks,
Tagged:

Answers

  • Marco_BoeckMarco_Boeck Administrator, Moderator, Employee-RapidMiner, Member, University Professor Posts: 1,996 RM Engineering
    Hi,

    due to various R components ignoring all our efforts to globally set the encoding to UTF-8 for the entire script, communication with R is sadly limited to the system default encoding at this point in time. Characters not representable in this encoding will break when processing in R.

    Regards,
    Marco
  • aruberutouaruberutou Member Posts: 23 Contributor II
    Hi,

    Thanks for the quick response. My system default settings should allow UTF-8. It works in RStudio without having the locale set.

    I noticed, however, that text/character columns in dataframes (R) are output (to Rapidminer) as polynomials rather than text.

    I was wondering if this, perhaps, was responsible?

    If so, is there a data-type in R that that Rapidminer will encode as text upon output?

    Thanks,
Sign In or Register to comment.