Questions tagged [tagged-pdf]

Tagged PDF is a form of embedded metadata for PDFs that allows support for bookmarks and copy-and-paste on text, and gives a namespace for Adobe's DOM that is leveraged by such technologies as layers, interactive PDFs, and RichMedia Annotations

Tagged PDF is a form of embedded metadata for PDFs that allows support for bookmarks and copy-and-paste on text, and gives a namespace for Adobe's DOM that is leveraged by such technologies as layers, interactive PDFs, and RichMedia Annotations.

FAQ

  1. What is "Tagged PDF"?
  2. PDF - Are Bookmarks and Hyperlinks equivalent?

PDF generation

  1. Can MikTeX create tagged PDFs?
  2. How to create a tagged PDF from a "complex" XML file
  3. Create destinations for all bookmarks in a PDF file with iText API

Manipulating tagged PDFs

  1. How to read PDF bookmarks programmatically
  2. Recover PDF to LaTeX
  3. Split a PDF by Bookmarks?
32 questions
2
votes
0 answers

How to add the tags to existing pdf file?

Could anyone explain how to add the tags to the existing pdf file using iText library in java, is there any sample code for adding the tags to pdf files ?
2
votes
2 answers

Can a tagged PDF file have a presentation table?

In HTML, I can have which indicates the table isn't a real table and is just used for layout purposes. This helps with screen reader navigation to real tables. Is there a similar construct for tables in tagged PDF? As…
slugolicious
  • 15,824
  • 2
  • 29
  • 43
2
votes
1 answer

Find tagged content in PDF/A-1a using pdfbox

I have what I presume to be a PDF/A-1a file that was generated by apache fop and has an overlay letterhead put on using OverlayPDF from pdfbox. preflight recognizes the file as ok (but obviously only PDF/A-1b) and Acroreader says it is "PDF/A" mode…
Stefan Hegny
  • 2,107
  • 4
  • 23
  • 26
2
votes
0 answers

Jaws screen reader scrolling function issue when reading pdf

when using Jaws screen reader (version 15 on windows 7) to read the contents of a 3 page tagged pdf, it reads the content correctly but it scrolls down to the last page once it starts reading the contents of the first page. The pdf contains a single…
user1479847
  • 107
  • 11
1
vote
1 answer

Tagged PDF NOT able to Read HyphenSpan

Here I have 2 PDF's where we have Hyphen used in the text of the document at the end of the fourth line of the PDF (see below.) Where we have a text de - (cember) in the next line. In both the PDF's the '-' has been tagged as Hyphen Span. But while…
1
vote
1 answer

Issue with xsl-fo :footnote when generating pdf/ua-1 document with fop.: "tagged PDF note id is missing"

I have an issue with when generating pdf/ua-1 document with fop. The resulting pdf displays correctly the footnote in the page but don’t pass the pdf-ua validation. A severe error on pdf tag Note “id is missing” is raised so the…
1
vote
1 answer

Java-PDFbox: Creating the artifact tag for lines and underlines in tagged PDF

I am creating the accessibility PDF from the tagged pdf. It shows a "path object is not tagged" error. The PDF has lines and underlined text. So, I am trying to add an "ARTIFACT" tag for the untagged line items. I am able to get the lines from…
Dilli
  • 15
  • 4
1
vote
1 answer

Tag structure initialization failed, tag structure is ignored, it might be corrupted

I created a fillable PDF using Adobe Acrobat pro and tagged the input fields. Tagged Content, Tagged annotations, Tagged Form Fields passed in the Accessibility check for the document I am trying to fill using itext. I am getting this error when i…
1
vote
0 answers

Manipulate tag tree from tagged pdf

Question: I was working on tags for accessibility with iText7 and I want to manipulate the structure of the tag tree. The task would be like. First of all, extract the tag tree from tree. Then, move the node to it's parent node. The demonstrate…
1
vote
1 answer

Tag content in pdf

I have a pdf which looks like below. I would want to tag the paragraph as 'paragraph'. I have searched a lot about this, and there are ways to create a tagged pdf from scratch, or convert html content to tagged pdf, but I have not had success in…
SuperNova
  • 25,512
  • 7
  • 93
  • 64
1
vote
0 answers

PdfContentByte not adding text Tagget Structure

I am trying to add text using ItextSharp Version 5.5.6.0 On one of the segments we are using PdfContentByte to write the text Assume that pdfPageContents is the instance of PdfContentByte with the content byte pdfPageContents.SaveState(); …
0
votes
0 answers

C# iTextSharp to manipulate tagged pdf structure

I want to manipulate (modify) a fully tagged pdf. Using below code i am able to get the document tree structure PdfDictionary catalog = reader.Catalog; PdfDictionary struc = catalog.GetAsDict(PdfName.STRUCTTREEROOT); now I want to modify the below…
0
votes
1 answer

PDFBox I need to wrap the link tags with reference tag

I need to wrap the link tags with the reference tag. private static PDStructureElement wrapInElement(String structType, PDStructureElement pdStructureElement) { PDStructureNode parentNode = pdStructureElement.getParent(); PDStructureElement…
Dilli
  • 15
  • 4
0
votes
1 answer

Java PDFBox: Remove the parent element in tagged PDF

I got the Possibly inappropriate use of a "Table" structure element warning in PAC3 validation. Table structure in PDF as follows, In order to pass the PAC3 I now drag out the tables from the parent tag to an element on its own as like below, I…
Dilli
  • 15
  • 4
0
votes
1 answer

Accessible Tagged PDF with BIRT

We are using BIRT to create PDF-files from our website, but now we need to change the PDFs to meet the accessibility demands. I have tried to search for an answer how to make accessible tagged PDF with BIRT, but haven't found any answer. Even this…
Piia
  • 1
  • 2