Native C# pdf reader

Question

I need to extract text from PDF file. I've found iTextSharp and PDFBox, but both of them are only Java ports and to make them work i need to use big additional dlls.

So, my question is: is there some native C# library for extracting text from PDF files? If there is no any, is it hard to write one?

"If there is no any, is it hard to write it?" If it wasn't hard, someone would've written one already. — BoltClock, Apr 16 '11 at 22:38
possible duplicate of [PDF Reader](http://stackoverflow.com/questions/905683/pdf-reader) — Hans Passant, Apr 16 '11 at 22:39
If iTextSharp does not fill your needs, then you will probably need to go with a commercial (paid) product. And yes, iTextSharp is a port from Java, but it was rewritten in c#, thus managed code. — Jim, Apr 17 '11 at 00:26
@Jim iTextSharp/iText are also paid products unless used in open source projects. — Bobrovsky, Apr 17 '11 at 06:30

Bobrovsky · Accepted Answer · 2020-08-07T11:28:57.887

3

Docotic.Pdf library may be used to extract text from PDF files.

The library has no external dependencies and is written in C#. Docotic.Pdf comes in four editions.

Disclaimer: I work for Bit Miracle.

edited Aug 07 '20 at 11:28

answered Apr 17 '11 at 06:36

Bobrovsky

13,789
19
80
130

1

Wow, you sure you guys are charging enough for that? Perhaps you could ask for a non-preferential limb as well... – Niall Connaughton Jul 12 '14 at 05:58
Well yeah, there was a free version, but now it's $595+. Too expensive! – xZ6a33YaYEfmv Nov 07 '14 at 13:07

score 3 · Answer 2 · answered Apr 17 '11 at 09:47

3

There's PdfSharp

answered Apr 17 '11 at 09:47

erikkallen

33,800
13
85
120

3

PdfSharp doesn't support text **extraction**! – xZ6a33YaYEfmv Apr 17 '11 at 11:30

Native C# pdf reader

2 Answers2

Linked