Jamie Dixon walks us through scraping a webpage using F#:
I need to go through all 8 pages of the grid and download the .pdfs that are associated with the “View Report” link. The challenge in this particular site is that they didn’t do any url parameters so there is no way to go through the grid via the uri. Looking at the page source, they are using ASP.NET and in typical enterprise-derpy manner, named their table “GridView1”
The way to get to the next page is to press on the “Next” link defined like this:
They over-achieved in the bloated View State for a simple page category though.
#Sigh
The code is straightforward and available as a Gist in the post.
Comments closed