Sunday, January 26, 2014

Web Scraping.


Web Scraping is annoying! No wants to do that.

But Web Scraping is in business, and there are multitude of people who hire another multitude of people to scrape the Web for them. Line by line, manually, copy data, paste data and so on... It's the worst job ever.

But as always, there is a better way to do things. Automate your web scraping process!

All you need is a basic understanding of HTML, which you can get from here ---

http://www.w3schools.com/html/     (It's very basic)

Once you know the basic HTML, you can have a look at the website's source code, and then you can use Visual Basic (Yes you need to know a little VBA too) to rip off data via the source code.

With VBA you can treat your internet explorer as an object and then write code to pull the data from the source code of that IE object. Google up IE automation & webscraping using VBA.

You cant make a generic code that will work on every darn website, because every website is written in its own fashion. Their source codes are different.

But once you learn to scrape data using the source code, you can make your own web scrappers. It's easy! Here, check out this video, I made:


In case you want the code for the scrapper shown in the video, e-mail me. Will be happy to help!

contact: vardhanam.daga@gmail.com