HTML Parser is a Java HTML parsing library. It features filters, visitors, custom tags and easy to use JavaBeans.
Questions tagged [html-parser]
211 questions
2
votes
3 answers
Extract data from "e under title tag using BeautifulSoup?
I want to extract title of a link after getting its HTML via BeautifulSoup library in python.
Basically, the whole title tag is
Imaan Z Hazir on Twitter: "Guantanamo and Abu Ghraib, financial and military support to dictators in Latin…

Amar
- 855
- 5
- 17
- 36
2
votes
1 answer
Object moved this document may be found here php
I'm redirecting my web page to another url. It works fine on localhost but when i host it to my web server then it give me the message which says "Object Moved This document may be found here". I don't know what is the issue here is my…

Tashen Jazbi
- 1,068
- 1
- 16
- 41
2
votes
1 answer
“html agility pack” like solutions for C/Objective-c/iPhone
I need a powerful HTML parser and manipulator for Objective-C/C, like HTML Agility Pack.
Can anyone tell me some optimal solution? One solution is libxml2, but it seams is not the best.
Thanks in advance!

mxg
- 20,946
- 12
- 59
- 80
2
votes
2 answers
HTMLParser misunderstands entities in href. Is it a bug or not? Should I report it?
I don't want to know how to solve the problem, because I have solved it on my own. I'm just asking if it is really a bug and whether and how I should report it.
You can find the code and the output below:
from html.parser import HTMLParser
class…

StackUser
- 587
- 6
- 26
2
votes
1 answer
Extract data using HTMLParser
Sam
John

Vinay
- 470
- 1
- 5
- 18
2
votes
1 answer
JSoup check if , and tags are present
Hi I am using JSoup to parse a HTML file. After parsing, I want to check if the file contains the tag. I am using the following code to check that,
htmlDom = parser.parse("
My First Heading
clk"); Elements pe =…
Nemin
- 1,907
- 6
- 24
- 37
2
votes
1 answer
A HTML Extraction API using RegEx or HTML Parser
I am aware that it is public opinion to not use RegEx for parsing HTML; however I do not see how it would be harmful to use RegEx (alike functions have been added in previous Scripting Languages using RegEx such as _StringBetween( ) in AutoIt3) for…

Katja
- 23
- 4
2
votes
0 answers
HTML Page and Python: Extracting the Body and Dividing Text Within It
Big story
I want to improve a Python application that reads EPUB files. I want to add the option to "memorize" the last place where the reader stopped. Here is the link to this application on github
At the moment, I can save the last words where…

nurgasemetey
- 752
- 3
- 15
- 39
2
votes
2 answers
Converting HTML list to nested Python list
If I have a nested html (unordered) list that looks like this:
2
votes
1 answer
2
votes
2 answers
How to split value from a string in ruby
My example string is listed here. i want to split every value result in array or hash to process value of each element.
accno: 123232323
id: 5443534534534
name: …
id: 5443534534534
name: …

Galet
- 5,853
- 21
- 82
- 148
2
votes
1 answer
java html parser multi page table
i am using Jsoup as html parser to get all the details from the table in this website. With the code below am only able to get the data on the first page only. Any advise?
public static void main(String[] args) {
String html =…

Shann
- 660
- 1
- 6
- 19
2
votes
1 answer
Cannot get all matched nodes while using htmlparser to parse a website
I'm using htmlparser for parsing a website, but I've trapped in a really weird problem. I'm trying to get all nodes at a webpage and my code is such as:
String url =…

user3115708
- 41
- 3
2
votes
2 answers
Parse HTML table in php
I have a database table which consists the following format of data in one column.
Calories… ![]()
Noor
2
votes
1 answer
Speeding up CsQuery selectors by using html substringI want to parse some complex/heavy HTML pages. I recently read about CsQuery and checked the performance comparation of CsQuery Vs Html Agility Pack and Fizzler . According to these tests, CsQuery turns to be slower when creating the DOM due to its…
![]()
VMh
|