The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Question to get the date out of a document
So I have pdf files and each of these pdf files (articles) have a date at the top of the page. not at the very top. but around there. The date format is like 19 April 2012. I want to get the first date that shows up and set it as an attribute called "Mydate", is that even possible in rapidminer and how would I go about doing that? thank you.
0
Answers
you probably need to use Read Document, Process Documents and Keep Document Part and a clever regex. It is hard to say which w/o the document itself.
Cheers,
Martin
Dortmund, Germany
rapid-i.com/rapidforum/index.php/topic,8874.msg29914.html