The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
"Extract data from a website to Excel-file"
Hi!
I want to extract some data from the following website:
http://ec.europa.eu/sanco_pesticides/public/index.cfm?event=substance.selection
I want to generate a list in Excel with an ID-column and a pesticide name-column. Both should be extracted from the website, section "2) Select Pesticide".
As I am a RM beginner, I did not manage to cut the html-document and generate the two columns... The only thing I was able to do, was to get the web page on my computer.
Could anyone give me some hints I could start with?
Thanks in advance!
Currant
I want to extract some data from the following website:
http://ec.europa.eu/sanco_pesticides/public/index.cfm?event=substance.selection
I want to generate a list in Excel with an ID-column and a pesticide name-column. Both should be extracted from the website, section "2) Select Pesticide".
As I am a RM beginner, I did not manage to cut the html-document and generate the two columns... The only thing I was able to do, was to get the web page on my computer.
Could anyone give me some hints I could start with?
Thanks in advance!
Currant
Tagged:
0
Answers
this seems to be an easy task. After retrieving the website I would suggest to use XPath since the desired data is structured within a HTML select element. If you want to deal with multiple XPath or RegEx matches there is only one interesting operator: "Cut Document" (contained in the Text Processing extension).
Here a little working example. You just have to add the id and convert the collection into a table ("Documents to Data"). Best regards
Matthias
Now, I managed to the html-files on my computer and I could extract the html-information from the file. See script below.
Code: Now, my question is:
How can I export the attributes "Namen" and "Animal" to Excel?
(It's probably an easy job ... but I could not work it out...
currant
you can select the desired attributes via the "Select Attributes" operator, just select subset as the attribute filter type and select the desired attributes. Then put in a "Write Excel" operator, specify the file, and voilà.
Your modified process (you might have to change the selected attributes): Regards,
Marco
Thanx a lot for your support!
currant