The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Read PDF Error BouncyCastleProvider

johan_CGjohan_CG Member Posts: 19 Contributor II
edited August 2019 in Help
Hi everybody,

I've got an unknow error when I run my RapidMiner process on some folder.
To make a long story short, the goal of my process can be describe as following:
 Starting points:
  • Excel file .xls with "Open File"
  • the path of a directory to scan with "Set macro"
Outputs of the process
  • a PDF report
  • several ExampleSet
The process counts words, keywords, calculates ratios ...
It works very well on some folders and on others I've got the following pop-up message:

Process failed
org/bouncycastle/jce/provider/BouncyCastleProvider
Reason: org.bouncycastle.jce.provider.BouncyCastleProvider
The process works on HTML and PDF files but in all folder there are also xls, csv, doc.

Can somebody help me please?
Thanks in advance
Johan

P.S.: I can't put the process because it's too long.
Tagged:

Answers

  • johan_CGjohan_CG Member Posts: 19 Contributor II
    Hi Everybody,

    I've got some new information about my problem.
    The error occurs when I try to analyze PDF which is "protected".

    Can somebody help me to solve this problem?

    Thanks in advance
    Johan
  • Marco_BoeckMarco_Boeck Administrator, Moderator, Employee-RapidMiner, Member, University Professor Posts: 1,996 RM Engineering
    Hi,

    unfortunately protected PDFs are currently not supported by the Text Extension, so you will have to make sure there are no protected PDFs in a folder you're trying to process.
    Thank you for reporting it however, I created an internal ticket for this issue.

    Regards,
    Marco
  • johan_CGjohan_CG Member Posts: 19 Contributor II
    Hi Marco,

    Thank you for your reply.
    I found a small freeware to convert PDFs to Text and runs on protected also.
    http://www.colorpilot.com/extract-pdf-text.html
    Moreover this software is open source, maybe it can be usefull for a futur release of Text Processing Extension  ;)

    Regards,
    Johan
Sign In or Register to comment.