Web Scraping - dynamic content

Hyperrick · May 2020

Hello folks,

we're trying to create a Text Mining project for our university class. Our goal is to scrape the data of our university courses description and look up Udemy for best matching courses. So far so good, now I realized that a saved HTML file of an Udemy course is missing relevant information like the "price" tag. Do you have an idea how to scrape those missing information?

Best regards

Patrick

JEdward · May 2020

To save you time and effort I recommend you could use Parsehub to build the scraper for Udemy.

Instructions to use it are really simple and the free version allows you to pull in 200 pages per run (should be enough for your task, but you can also contact them to ask about their academic program.

Best of all when you build your scraper it has a RestAPI which means you can then call it from your RapidMiner process and get the results back directly.

https://www.parsehub.com/

Hyperrick · May 2020

Thank you that solved it!

Howdy, Stranger!

Quick Links

Categories

Altair RapidMiner Community

GET HELP. LEARN BEST PRACTICES. NETWORK WITH YOUR PEERS.

Web Scraping - dynamic content

Best Answer

Answers