The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
"[SOLVED]Process Documents from files does not produce anything"
Hi,
I'm a new user for Rapid Miner. There are a few problems that I have encountered.
1) I can't update the program directly from the GUI because of the workplace's proxy settings. Is this causing any harms? (for example is this a problem? PM WARNING: Could not open http://rapid-i.com/wiki/index.php?title=Text:Tokenize: connect timed out)
2) I've watched some videos how to process textual data and trying to implement something. I've installed Text Processing extension. The problem is that I can't produce anything when I'm trying to use "Process Documents from Files". This warning is the only thing that appears: "PM WARNING: Error creating renderer: java.lang.ArrayIndexOutOfBoundsException: 0". This tells nothing to me. Results tell that there are no examples. Any solutions or proposals?
This is the code and those text files are in .txt -format
p3kka
I'm a new user for Rapid Miner. There are a few problems that I have encountered.
1) I can't update the program directly from the GUI because of the workplace's proxy settings. Is this causing any harms? (for example is this a problem? PM WARNING: Could not open http://rapid-i.com/wiki/index.php?title=Text:Tokenize: connect timed out)
2) I've watched some videos how to process textual data and trying to implement something. I've installed Text Processing extension. The problem is that I can't produce anything when I'm trying to use "Process Documents from Files". This warning is the only thing that appears: "PM WARNING: Error creating renderer: java.lang.ArrayIndexOutOfBoundsException: 0". This tells nothing to me. Results tell that there are no examples. Any solutions or proposals?
This is the code and those text files are in .txt -format
<?xml version="1.0" encoding="UTF-8" standalone="no"?>Br,
<process version="6.0.002">
<context>
<input/>
<output/>
<macros/>
</context>
<operator activated="true" class="process" compatibility="6.0.002" expanded="true" name="Process">
<process expanded="true">
<operator activated="true" class="text:process_document_from_file" compatibility="5.3.002" expanded="true" height="76" name="Process Documents from Files" width="90" x="179" y="30">
<list key="text_directories">
<parameter key="Fail" value="D:\Logs\fail.null"/>
<parameter key="Pass" value="D:\Logs\pass.null"/>
</list>
<process expanded="true">
<operator activated="true" class="text:tokenize" compatibility="5.3.002" expanded="true" height="60" name="Tokenize" width="90" x="112" y="30"/>
<operator activated="true" class="text:stem_snowball" compatibility="5.3.002" expanded="true" height="60" name="Stem (Snowball)" width="90" x="313" y="30"/>
<operator activated="true" class="text:transform_cases" compatibility="5.3.002" expanded="true" height="60" name="Transform Cases" width="90" x="514" y="30"/>
<connect from_port="document" to_op="Tokenize" to_port="document"/>
<connect from_op="Tokenize" from_port="document" to_op="Stem (Snowball)" to_port="document"/>
<connect from_op="Stem (Snowball)" from_port="document" to_op="Transform Cases" to_port="document"/>
<connect from_op="Transform Cases" from_port="document" to_port="document 1"/>
<portSpacing port="source_document" spacing="0"/>
<portSpacing port="sink_document 1" spacing="0"/>
<portSpacing port="sink_document 2" spacing="0"/>
</process>
</operator>
<connect from_port="input 1" to_op="Process Documents from Files" to_port="word list"/>
<connect from_op="Process Documents from Files" from_port="example set" to_port="result 1"/>
<connect from_op="Process Documents from Files" from_port="word list" to_port="result 2"/>
<portSpacing port="source_input 1" spacing="0"/>
<portSpacing port="source_input 2" spacing="0"/>
<portSpacing port="sink_result 1" spacing="0"/>
<portSpacing port="sink_result 2" spacing="0"/>
<portSpacing port="sink_result 3" spacing="0"/>
</process>
</operator>
</process>
p3kka
Tagged:
0
Answers
Br,
p3kka
no this should not be a license problem. I can execute your process just fine w/o any license.
To answer your questions:
1) No, you can safely ignore these warnings. Note that you can setup a proxy in the RapidMiner preferences which might help in your case.
2) We just released RapidMiner Studio 6.0.003 which will fix the error you mentioned. However the process should produce results just fine - I simply set both folders to a test folder I just created which only contained one file: test.txt which consists of nothing more than "Hello world!". The process works fine for me. Can you try that as well? If it still does not work, please post the contents of the rapidminer-studio.log which can be found inside the USER_HOME/.RapidMiner/ folder.
Regards,
Marco
I tried to do what you told (Created a text-file that contains just "Hello world!" and put it to the both dircetories). It produces the same result. Nothing. I tried also uninstall and re-install the program but that didn't help either. I'm still guessing that there is something wrong in the installation (or in the user) so I'm going to clean install the program.
I didn't find that log-file that you asked for. Where is it located in the windows environment?
Br,
p3kka
the log I am talking about can be found in C:\users\YOURNAME\.RapidMiner\ and is called rapidminer-studio.log. Can you please try to close RapidMiner Studio, delete the file, start it again and try again and then finally post the fresh log? The one you posted does not contain the information I was looking for
Regards,
Marco
There is no such folder but I think this is the same folder. It contains for example launcher.log, licences folder etc.. Anyway, here is content of the log (rapidminer-studio.log) after re-installation:
And btw., I have never seen that anyone helps this fast and accurately in any forum or in any community. So thank you for that
Br,
p3kka
this is a bit strange. The process you posted works just fine for me. Can you do me a favor and check which version of the Text Processing Extension you are using? You can do so by selecting "Help" in the top menu, then "About Installed Extensions" and then "About Text Processing Extension..:".
Regards,
Marco
It is the newest version (5.3.002).
I have to say that the whole process worked before (about three weeks ago) when I tried this application the first time. At the time I also used the free trial license which expired and after that (now) I tried again and this problem occured. That's why I asked about the license.
Also, Create document (or Read document) and then Process document works. The problem seems to be only with Process documents from Files.
Br,
p3kka
I have removed my license to test this, thus running the Starter version. The process still works for me. I must say, I have no clue what might be the problem :-[
Regards,
Marco
I have once again get back to this issue.
I installed the software to my home computer and the same happens with this computer. Now I'm seriously thinking I'm doing something wrong or something is wrong with my account.
-p3kka
After I got the Internet connection working with Rapid Miner, everything started to working as supposed.
-p3kka