The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Access e-mails with the Text Processing Extension

David_ADavid_A Administrator, Moderator, Employee-RapidMiner, RMResearcher, Member Posts: 297 RM Research
edited November 2018 in Knowledge Base

Nowadays e-mails might be the most common source of text you encounter every day. They are also a common source of data for text mining applications, e.g., spam detection or sentiment analysis.

Having a direct access to an e-mail account and processing mails automatically can be a great boon when putting text mining tasks into production. New incoming mails can be processed and their content scored, which can even trigger more reactions.


Read Documents (Mail) 

The Text Processing Extension available at the RapidMiner marketplace includes an Operator that allows exactly that:

Read Documents (Mail).

It provides access using the common IMAP protocoll or the older POP3 standard. All you need to know is the host address, the log-in data and the used protocol. Also quite important is the folder parameter, as you seldom want to download the complete inbox folder.

 

Connection properties

Unfortunately it is not always as simple as it could be. Some e-mail provider requires you to set additional parameters to access their services and it is sometimes hidden at their sites.

Below are the settings for the two most common providers: Microsoft Outlook (Office365) and Gmail.


Outlook 365 

 

Parameter settings:

 

 Outlook_parameterspng

Connection properties:

ReadDocuments_Outlook_propertiespng

 

Gmail

Parameter settings:

Gmail_parameterspng

 

Connection properties:

ReadDocuments_GMail_propertiespng

 

 

Similar is the Process Documents from Mail Store Operator, which requires the same properties settings. Whith this Operator it is possible to directly work on single e-mails. The Operator also allows to download attached files

Comments

  • osadasliitosadasliit Member Posts: 3 Learner III

    Hi 

    I am new to rapidminer how do you access Gmail parameter settings forms. in send mail operator only provide 

    to 

    supbject

    html check box

    body text

    headers 

     

    Gmail

    Parameter settings:  form

  • osadasliitosadasliit Member Posts: 3 Learner III

    Hi 

    if I want to send mail (not read mail) how do we setup smtp information in rapidminer studio for gmail let say we use send mail operator 

    to 

    supbject

    html check box

    body text

    headers 

     

    thanks

     

Sign In or Register to comment.