Questions tagged [html-agility-pack]

HTML Agility Pack is an open-source HTML parser that builds a read/write DOM and supports Linq, plain XPATH or XSLT.

HTML Agility Pack is an open-source HTML parser that builds a read-and-write DOM and supports Linq, plain XPath or XSLT.

It is a .NET code library that allows parsing out of the web HTML files. The parser is very tolerant to malformed HTML. The object model is very similar to what proposes System.Xml, but for HTML documents or streams.

Installing HTML Agility Pack can most easily be done using its NuGet package:

Install-Package HtmlAgilityPack

Latest stable release: 1.11.3 / 18 April 2019

GitHub page: https://github.com/zzzprojects/html-agility-pack

3466 questions
1
vote
2 answers

select next node html Agility pack C#

I'm a student working with htmlagilitypack for the 1st time. I'm currently filtering the html to get the values using foreach (HtmlNode link in bodyNode.SelectNodes("//span[@class='content-b']")) { if (link.InnerText.Contains("Name")) { …
DriverBoy
  • 3,047
  • 3
  • 19
  • 21
1
vote
1 answer

Screen scraping with htmlAgilityPack and XPath

[This question has a relative that lives at: Selective screen scraping with HTMLAgilityPack and XPath ] I have some HTML to parse which has general appearance as follow: ... Text Data here (1) Text Data…
user900360
1
vote
1 answer

HTMLAgilityPack: C# how to add strings with tags using CreateNode

I'm trying to build HTML output with data that might contain HTML tags (simple ones, link or ). If I add such a string to my HTML using CreateNode, some unwanted filtering happens. How can I fix this? Example: string myText =…
Marcel
  • 11
  • 4
1
vote
2 answers

how to use html agility pack to extract all url from html text

Often I extract file names from html text data using regex but I heard the html agility pack is good for parsing html data. how can I use html agility pack to extract all url from html data. Can any one guide me with sample code. Thanks. This is my…
Mou
  • 15,673
  • 43
  • 156
  • 275
1
vote
2 answers

HtmlAgilityPack xPath scraping

I am trying to scrape this site http://www.gotickets.com/calendar.php?Display=Daily&Date=2013-03-12&EventTypeID=2&EventID=0&GenreID=159&VenueID=0&MarketAreaID=0 This is what interests me The data is heirarchly organized as follows
Mickey Perlstein
  • 2,508
  • 2
  • 30
  • 37
1
vote
1 answer

Parsing javascript HTML using HTMLAgilityPack

I have the following HTML that I'm trying to parse using the HTML Agility Pack. This is a snippet of HTML code: ...
....