Questions tagged [html-agility-pack]

HTML Agility Pack is an open-source HTML parser that builds a read/write DOM and supports Linq, plain XPATH or XSLT.

HTML Agility Pack is an open-source HTML parser that builds a read-and-write DOM and supports Linq, plain XPath or XSLT.

It is a .NET code library that allows parsing out of the web HTML files. The parser is very tolerant to malformed HTML. The object model is very similar to what proposes System.Xml, but for HTML documents or streams.

Installing HTML Agility Pack can most easily be done using its NuGet package:

Install-Package HtmlAgilityPack

Latest stable release: 1.11.3 / 18 April 2019

GitHub page: https://github.com/zzzprojects/html-agility-pack

3466 questions

vote

2 answers

select next node html Agility pack C#

I'm a student working with htmlagilitypack for the 1st time. I'm currently filtering the html to get the values using foreach (HtmlNode link in bodyNode.SelectNodes("//span[@class='content-b']")) { if (link.InnerText.Contains("Name")) { …

c# regex filtering html-agility-pack

asked Mar 16 '13 at 06:46

DriverBoy

3,047
3
19
21

vote

1 answer

Screen scraping with htmlAgilityPack and XPath

[This question has a relative that lives at: Selective screen scraping with HTMLAgilityPack and XPath ] I have some HTML to parse which has general appearance as follow: ... Text Data here (1) Text Data…

c# xpath html-agility-pack

asked Mar 14 '13 at 08:47

user900360

vote

1 answer

HTMLAgilityPack: C# how to add strings with tags using CreateNode

I'm trying to build HTML output with data that might contain HTML tags (simple ones, link or ). If I add such a string to my HTML using CreateNode, some unwanted filtering happens. How can I fix this? Example: string myText =…

c# html-agility-pack

asked Mar 13 '13 at 11:27

Marcel

vote

2 answers

how to use html agility pack to extract all url from html text

Often I extract file names from html text data using regex but I heard the html agility pack is good for parsing html data. how can I use html agility pack to extract all url from html data. Can any one guide me with sample code. Thanks. This is my…

c# html-agility-pack

asked Mar 12 '13 at 15:11

Mou

15,673
43
156
275

vote

2 answers

HtmlAgilityPack xPath scraping

I am trying to scrape this site http://www.gotickets.com/calendar.php?Display=Daily&Date=2013-03-12&EventTypeID=2&EventID=0&GenreID=159&VenueID=0&MarketAreaID=0 This is what interests me The data is heirarchly organized as follows

xpath web-scraping html-agility-pack

asked Mar 12 '13 at 13:02

Mickey Perlstein

2,508
2
30
37

vote

1 answer

Parsing javascript HTML using HTMLAgilityPack

I have the following HTML that I'm trying to parse using the HTML Agility Pack. This is a snippet of HTML code: ...

....