Questions tagged [sgml]

Standard Generalized Markup Language (SGML) is the precursor to XML and HTML. It is an ISO standard (ISO 8879) that was used for electronic publishing of documents and books.

Standard Generalized Markup Language (SGML) is the precursor to XML and HTML. It is an ISO standard (ISO 8879) that was used for electronic publishing of documents and books.


Useful links

134 questions
2
votes
1 answer

PHP strip non-SGML characters from a string?

I've got nonstandard characters coming out of my database (due to line breaks). My HTML validator is complaining about them. Since my HTML validator is a direct extension of my ego, I'd like to keep the thing happy and green-ok-arrow-y. Does…
Ben
  • 54,723
  • 49
  • 178
  • 224
2
votes
0 answers

Powershell - Exporting XML without closing tags - OFX format SGML?

I'm trying to create an OFX file for import into Sage300 using powershell. This is to provide automated bank reconciliation. Not too familiar with OFX structure but it appears to be SGML v1.02. Input will be from a SQL query. The target OFX…
ColinA
  • 99
  • 1
  • 9
2
votes
3 answers

Java code to read SGML files

I'm doing my project on Text Categorization.I've got a text categorisation test collection called Reuters-21578 for my Information Retrieval project. It is distributed in 22 files. Each of the first 21 files (reut2-000.sgm through reut2-020.sgm)…
user386633
  • 29
  • 1
  • 2
2
votes
2 answers

Comments in DTD definitions

When looking at the HTML 4 dtd (https://www.w3.org/TR/html4/sgml/dtd.html) I see a lot of descriptive comments within the dtd definitions.
wirrbel
  • 3,173
  • 3
  • 26
  • 49
2
votes
3 answers

SGML Parser in Python

I am completely new to Python. I have the following code: class ExtractTitle(sgmllib.SGMLParser): def __init__(self, verbose=0): sgmllib.SGMLParser.__init__(self, verbose) self.title = self.data = None def handle_data(self, data): if…
afg102
  • 361
  • 2
  • 4
  • 4
2
votes
2 answers

Regular expression for converting SGML to XML

I am converting sgml content to xml content by the help of this link. Using the sgmlString.replaceAll("<(([^<>]+?)>)([^<>]+?)(?=<(?!\\1))", "<$1$3"); regular expression I am almost closed to the expected result, but for the following file when…
Shailesh Saxena
  • 3,472
  • 2
  • 18
  • 28
2
votes
2 answers

DTD character data validation error due to linebreak

I have the following fake.dtd file: And the following SGML document: Document Title 1234 …
ChrisP
  • 5,812
  • 1
  • 33
  • 36
2
votes
3 answers

Convert .sgm to .txt

I have some files in .sgm format and I have to evaluate them (apply a language model and obtain the perplexity of the text). The main problem is that I need these files in plain format, i.e. in txt format. However I have been searching into the…
lucasrodesg
  • 638
  • 1
  • 6
  • 22
2
votes
0 answers

emacs SGML screwed up by quotes?

I'm troubleshooting a podcast feed. I'm not sure the best tool for a job like this, but I started by opening the feed in emacs sgml-mode and running sgml-pretty-print hoping to get a better look. The pretty printing and syntax highlighting do not…
Dave Cohen
  • 1,277
  • 2
  • 15
  • 21
2
votes
1 answer

DTD. Element Type Declaration. Different between 'contentspec' and 'content model'

I read Extensible Markup Language (XML) 1.0 (Fifth Edition) W3C Recommendation 26 November 2008 3.2 Element Type Declarations has: An element type declaration takes the form: Element Type Declaration elementdecl ::=
user3771600
  • 33
  • 1
  • 6
2
votes
1 answer

How does empty start tag work in HTML4?

The HTML4 specification mentions various SGML shorthand markup constructs. While I understand what others do, with a help of HTML validator, I cannot find understand why anyone would want an empty start tag. It cannot even have attributes, so it's…
Konrad Borowski
  • 11,584
  • 3
  • 57
  • 71
2
votes
2 answers

Usage of SGML(Standard Generalized Markup Language)

SGML is the superset of HTML and XML. There are rich HTML and XML parsers available. Could you please explain me the Usage of SGML (Sample business scneario) in current bussiness domains? is it when dealing with legecy systems ? There are HTML and…
Kasun
  • 561
  • 11
  • 22
2
votes
1 answer

Perl: Can't Build SGML::Parse::OpenSP in Mac OSX and Proxy Errors

I am running Perl 5.16.2 on OSX 10.9.1 and I am having difficulty installing modules from CPAN. I successfully installed sudo cpan App::cpanminus However, when I try something like curl -L http://cpanmin.us | perl - --sudo App::cpanminus I get…
Mark Clements
  • 465
  • 7
  • 25
2
votes
1 answer

Set up PSGML Mode on Emacs 24

I am desperately trying to set up the PSGML/XML Mode for Emacs 24. I did everything according this explanation: http://www.lysator.liu.se/~lenst/about_psgml/psgml.html I did ./configure ./make ./make install after unpacking the latest package of the…
Simon Fromme
  • 3,104
  • 18
  • 30
2
votes
1 answer

Sgml returns some warnings

I use the sgml library of prolog to extract information about a web page. I use this instruction to extract all: load_structure('file.html', List, [dialect(sgml), shorttag(false), max_errors(-1)]) the system loads the page but i have some warnings,…
Joachim Low
  • 277
  • 1
  • 2
  • 11
1 2 3
8 9