0

I have to download the html code from a website and parse it (unfortunately, parsing the html is the only way to get the necessary information). Of cause I could write my program using WebRequest, but I wonder whether there's a finished library for this.

I want to define the url and the header (cookies and POST-data) and would like to get the html code. It would be perfect if the library would return the answer-header as well.

Christopher
  • 2,005
  • 3
  • 24
  • 50

1 Answers1

2

You probably need to use WebRequest to do the communications (posting and retrieving data). For parsing HTML, there is the HTML Agility Pack. I have not used it myself though.

Srikanth Venugopalan
  • 9,011
  • 3
  • 36
  • 76
chue x
  • 18,573
  • 7
  • 56
  • 70
  • 1
    The HTML agility pack link leads to an msdn page about smalldatetime - I think you pasted the wrong link? – CorrugatedAir Mar 20 '13 at 16:07
  • @all - thanks for the corrections. yes, trying to answer multiple questions at the same time will do that ;) – chue x Mar 20 '13 at 16:40
  • Not what I hoped to hear, but thanks for the reference to the HTML Agility Pack! Maybe I'll upload my script to BitBucket so it can be reused... – Christopher Mar 21 '13 at 20:12