Questions tagged [pdfclown]

PDF Clown is a free/libre open source implementation of the Portable Document Format (PDF) specification published by Adobe® Systems Inc., written as a class library in multiple languages (Java™ 6 and C#/.NET 4.0).

PDF Clown is a free/libre open source implementation of the Portable Document Format (PDF) specification published by Adobe® Systems Inc., written as a class library in multiple languages (Java™ 6 and C#/.NET 4.0).

Java version represents the reference implementation of PDF Clown.

C# version was developed upon the Mono flavour of the .NET framework, which allows it to run over multiple platforms (BSD and UNIX variants, including OS X and Solaris, and Microsoft® Windows®). As such code adheres to the related ECMA spec, it may also be run upon other .NET implementations, including MS'.

It's hosted here at SourceForge.

95 questions
2
votes
2 answers

Designing PDF component for easy access

I have seen open source and commercial PDF components which support Dot net implementation, I think almost every available component in market,but the strange to identify a document that is protected or not, every one is showing in the form of…
Jay
  • 1,869
  • 3
  • 25
  • 44
1
vote
1 answer

PDFClown nuget package cannot load assembly

I've added the PDFClown nuget package to a .net core project and ran into the following error at runtime: Could not load file or assembly 'PDFClown, Version=0.1.2.0, Culture=neutral, PublicKeyToken=null' or one of its dependencies. A …
EmpZol
  • 11
  • 1
1
vote
1 answer

Unable to extract cmyk colorspaces from pdf

I'm trying to extract colorspace data from pdf. I have a file with Pantone and CMYK colorspaces. When I extracted the colorspaces from PDF using any pdf library (I used pdfclown, pdfbox and icePdf), the output data consists only of Pantone…
ksa
  • 47
  • 8
1
vote
1 answer

pdf clown- not highlighting specific search keyword

I am using pdf-clown with pdfclown-0.2.0-HEAD.jar.I have written below code for highlighting search the keyword in Chinese language pdf file and same code is working fine with english pdf file. import java.awt.Color; import java.awt.Desktop; import…
seshadri p
  • 29
  • 7
1
vote
0 answers

PDFClown Extract table How to extract a table from PDF using PDFClown

How would extract an Table from a PDF using PDFClown. I want to take the Table Data and save it to a DB. I am open to use different framework/package as long it is free Thanks
Valter
  • 2,859
  • 5
  • 30
  • 51
1
vote
1 answer

PDF Clown does not render text

I use PDF Clown to create PDF files containing text and different shapes. Saving the files brings the desired result. But when I print the pages or render them to bitmaps, only the shapes are visible and the text elements are missing. I tried…
Exponent
  • 94
  • 5
1
vote
1 answer

PDFClown NullPointerException in GlyphMapping

I recently switched my code from PDFClown v0.1.2 to v0.1.2.1. There were some differences which I think I have already met now (changed constructors of TextMarkup() and switching from CallOutNote to StaticNote) but unfortunately I've run in some…
grml
  • 23
  • 1
  • 4
1
vote
0 answers

PDFClown Detect empty text location

I am able to use the PDFClown library in C# to parse and extract the text from a daily report in PDF. The issue I am having is detecting when a text value is missing. Using the TextExtractor, there is no place holder in the text value as I expected.…
David Vice
  • 33
  • 1
  • 6
1
vote
1 answer

Flattening form using PDFClown throws IndexOutOfBounds exception

I'm using PDFClown-0.2.0 to flatten this pdf file. This is the code I have: import org.pdfclown.documents.Document; import org.pdfclown.files.File; import org.pdfclown.files.SerializationModeEnum; import org.pdfclown.tools.FormFlattener; public…
drunkenfist
  • 2,958
  • 12
  • 39
  • 73
1
vote
1 answer

Detecting Headers and Borders in PDF Tables using PDF Clown

I am using PDF Clown's TextInfoExtractionSample to extract a PDF table into Excel and I was able to do it except merged cells. In the below code, for object, "content" I see the scanned content as text, XObject, ContainerObject but nothing for…
1
vote
1 answer

Extracting vector graphics (lines and points) with pdfclown

I want to extract vector graphics (lines and points) out of a pdf with pdfclown. I have tried to wrap my head around the graphics sample but i cannot figure out how the object model works for this. Please can anyone explain the relationships?
1
vote
2 answers

Get font and size of an acroform field

I need to get access to the font style and size of an acroform text field. I have access to the Field object via pdfclown but can't work out how to access the font. Anyone know how to do this?
jax
  • 37,735
  • 57
  • 182
  • 278
1
vote
1 answer

pdfclown highlighting doesn't work for some pdf files

I am using the pdfclown library to highlight some text inside the pdf file but for some reason, I get nullpointerexception error when I run TextHighlightSample. [java] java.lang.NullPointerException [java] at…
1
vote
0 answers

How to add annotations and highlights to an encrypted PDF file?

I need to write an application, which highlights some text in a PDF file and add annotations to them. I've used the PDF Clown library and it works well with non-encrypted files in the following way: I read the PDF file A. Add some annotations and…
Glory to Russia
  • 17,289
  • 56
  • 182
  • 325
1
vote
1 answer

PdfClown nullpointer on Font access

I have something strange going on with text writing using PdfClown 0.1.2. PrimitiveComposer composer = new PrimitiveComposer(page); BlockComposer blockComposer = new BlockComposer(composer); addHeader(document, composer, blockComposer); …
sandris
  • 1,478
  • 2
  • 18
  • 34