Questions tagged [html-parser]

HTML Parser is a Java HTML parsing library. It features filters, visitors, custom tags and easy to use JavaBeans.

211 questions
-1
votes
1 answer

nested div innertext returns Object reference not set to an instance of an object in HtmlAgilitypack

I am trying to get the value of a nested div using HtmlAgilitypack.my html code is like this : here is my code : var val = resultat1.DocumentNode.SelectSingleNode("//div[@class='journal-content-article']/div[@class='news_page…
Ehsan Akbar
  • 6,977
  • 19
  • 96
  • 180
-1
votes
1 answer

how to parse html

I have downloaded the Java HtmlParser but I dont know how to use the API for extracting the HTML data. Can you give some example so that I can work on it?
Vipul
  • 2,637
  • 6
  • 25
  • 28
-1
votes
1 answer

What does "abs:%s" regex means?

I am using Jsoup in my project and i am try to get understand what these lines of code in my HTMLparser.java is step by step doing: static List toLinkNodeObject(LinkNode parentLink, Elements tagElements, String tag) { List
medo0070
  • 511
  • 1
  • 5
  • 21
-1
votes
3 answers

Extracting the main product image from a ecommerce product page

I am looking for options to extract the main image from a product page on a retailer website, the problem is there are multiple images in a product page (related images) , one approach I thought would work would be to extract all the image links,…
user1965449
  • 2,849
  • 6
  • 34
  • 51
-1
votes
1 answer

Cant extract tables from a html code

I am working to parse a html table given below(its a section of complete html code) But the code is not working. Can some one please help me.There is an error saying "table has no attribute findall". The code is: import re import HTMLParser from…
user3235169
  • 33
  • 1
  • 7
-1
votes
5 answers

Parse HTML using perl regex

I created a Perl script that would use an online website to crack MD5 hashes after the user inputs the hashes. I am partially successful as I am able to get the response from the website, though I need to parse the HTML and display the hash, and…
bAd bOy
  • 33
  • 1
  • 9
-1
votes
1 answer

Parse and Modify the HTML file using Java

I have to parse a given HTML and modify its content and save the modified version. My HTML Input:

@MarcoS had an excellent solution using a NodeTraversor to make a list of nodes to change at

Ramanan
  • 1,000
  • 1
  • 7
  • 20
-1
votes
1 answer

Undefined value HTML::TokeParser

I'm trying to find tags in a html with a perl script. use WWW::Mechanize::Firefox; use Crypt::SSLeay; use HTML::TokeParser; $ENV{PERL_LWP_SSL_VERIFY_HOSTNAME}=0; my $mech = WWW::Mechanize::Firefox->new; $mech->get_local('salida2.html'); my…
Malincy Montoya
  • 87
  • 1
  • 13
-1
votes
1 answer

In which situations, the elements will be attached to Head element of DOM during HTML parsing?

Obviously, the contents/tags within pair will be attached to Head element. Is there any situation in which during the parsing of , some elements would be attached to Head element, instead of Body element?
JackWM
  • 10,085
  • 22
  • 65
  • 92
-2
votes
1 answer

React-html-parser crushing the deployment

I am trying to deploy my next app on vercel and the react-html-parser is throwing error I thought about downloading an older version of react but there are other dependencies that required newest version Is there a solution for this case? npm ERR!…
ofek levi
  • 31
  • 3
-2
votes
2 answers

How to select the previous tag when re finds the str

I have an HTML file like this:(More than 100 records)

John Smith

Center - VAR - Employee I
user15109593
  • 105
  • 5
-2
votes
2 answers

help me with xpath please

Possible Duplicate: if you know xpath then please help? hi, so here is the html code Test
Google
test1
Tushar Chutani
  • 1,522
  • 5
  • 27
  • 57
-2
votes
2 answers

Crawl a news website and getting the news content

I'm trying to download the text from a news website. The HTML is:

"My Text"…

-3
votes
2 answers

Is there a GEM for HTML parsing in Ruby, like Cheerio in NodeJS?

Is there a Ruby GEM available for HTML parsing, like Cheerio in NodeJS? Cheerio is an easy to use HTML parser, works like jQuery.
dineshsprabu
  • 165
  • 3
  • 4
-6
votes
2 answers

HTML Table parsing in PHP Error

I am parsing table data via this code $table = $myurl->find('table', 3); $rowData = array(); foreach($table->find('tr') as $row) { $flight = array(); foreach($row->find('td') as $cell) …
1 2 3
14
15