15

Does anyone think it is possible to build a Google Docs style PDF document viewer, which will convert a document to a format that doesn't require Adobe Reader on the client machine?

If so, any references to point to? Either a place that had done it, or an explanation of how to do it.

pearcewg
  • 9,545
  • 21
  • 79
  • 125

9 Answers9

13

I've done a lot of research regarding this matter and I hope I can help.

Good old Macromedia used to market Flash Paper, which was supposed to be a PDF Adobe Reader killer as it allowed any webmaster to embed and display PDF docs online using Flash. But that was before they sold out to Adobe and Flash Paper was soon put on a shelf and forgotten in favor of Adobe's priorities.

However, Today there are a so many ground-breaking alternatives...

As a user has mentioned above you can use Scribd.com (the wanna-be YouTube for documents). But they're not the only service (and certainly not the ones most ahead of the curve).

Here are my two favorites:

  1. Issuu (http://www.issuu.com)
  2. Mygazines (http://www.mygazines.com/)

I enjoy Mygazines's flash user interface the most (it's also faster) but it costs $99. It's pretty impressive. Depending on what you want to do that price tag can be worth it.

Issuu however, has won me over recently with their Smartlook Platform: http://issuu.com/smartlook

Here's a sample of Smartlook setup on a website:

http://www.ismartlook.com/

Plus it's completely free, which is nice.

A third alternative, which I've considered using myself is this free and open source code made by this guy named samurajdata. He calls it psview (PostScript Viewer). Anyone can download the source code and see it in action here:

http://view.samurajdata.se/

The converted PDFs losses quality as it converts to image fie, but it's fast and easy to setup.

I hope this helps!

Marko Bonaci
  • 5,622
  • 2
  • 34
  • 55
3

You may try Doconut.com looks pretty same as Google Docs viewer. It is available for asp.net 4.0, apart from PDF it can also show all office formats, tiff, dwg, psd etc.. However it is a paid library.

qqx
  • 18,947
  • 4
  • 64
  • 68
Mark
  • 39
  • 1
1

You can just use the Google Docs Viewer which also supports PDF documents. It allows you to embed it in your web page and point to the URL where the PDF is located (which doesn't have to be on the Google servers).

Example:

http://docs.google.com/viewer?embedded=true&url=http%3A%2F%2Fwww.domain.com%2Fdocument.pdf
Tom van Enckevort
  • 4,170
  • 1
  • 25
  • 40
1

If I understand you correctly you only want to view these files and not edit them.

Google already makes a best effort at providing PDF files found in it's search results as HTML. This doesn't always work. You can try it out by setting up a gmail account, mailing all your PDF files to it, and then using all the "View attachment as HTML" links in the messages.

Your other options are to take the source material and make it into HTML as say LaTeX2HTML does for LaTeX documents, or to convert the PDF into one of: a raster image (tiff, DjVu, etc), or a vector image (PostScript, SVG, SWF).

If the input to this process starts with the PDF files, you have very limited options, especially if the contents of the PDFs are just raster images (say scanned pages).

Personally I'd advocate for creating the PDFs from their source and trying to use Flash Paper to create an SWF out of them too as Flash Paper will pretend to be a printer. Because some 98% of browsers have Flash 9 or greater.

Have you seen Scribd?

dlamblin
  • 43,965
  • 20
  • 101
  • 140
1

There is the Internet Archive BookReader available. It's a nice book viewer implemented in javascript (jQuery), so the client doesn't need a PDF reader nor Flash. Though it needs images for the book pages, you can easily connect it to your own image server, so you may try to convert a PDF to images via ASP.NET (or any other tool like XPDF). I found that this is simpler to implement than actually implementing an images viewer.

Also, it seems to support search highlighting (try it here), but I haven't investigated exactly which metadata are needed and in what format.

The last release file contains a simple example on how to use it. More details and examples can be found in the first link.

kepler
  • 1,712
  • 17
  • 18
0

Try converting them from PDF to TIFF. Tiff supports multiple pages and is widely supported.

If formatting isn't that important, and your PDFs are structured right (ie actually contain text, not images of text), an alternate could be to convert to HTML. The tools from Aspose are pretty good.

Robert Wagner
  • 17,515
  • 9
  • 56
  • 72
0

I'm wondering why you would want to do that. PDF is such a general and widely supported format that if you try to avoid it you're limited to:

  • A more obscure or less well supported format (dvi, svg until it gets better support)
  • Converting to text/HTML like Google does with less than perfect results
  • Converting to an image format like TIFF which bumps up file sizes and removes all the niceties of PDF like real, selectable text and hyperlinks

If you don't want your users to have to install Adobe Reader (understandable), there are many free lightweight PDF viewers available (Foxit Reader for example), I'm sure many of these have browser embedding capabilities.

Mark Pim
  • 9,898
  • 7
  • 40
  • 59
0

Am I missing something here? Google Docs DOES support PDF. Simply upload the PDF file.

  • 2
    In many scenarios, it is not appropriate to upload and therefore host the PDFs on Google's servers for security and privacy reasons. To have a more controlled environment a self-hosted version of the Google Docs viewer is preferred. – RyanW Aug 23 '11 at 21:06
0

Some other alternatives depending upon what you're looking to do:

  • RAD PDF - ASP.NET component for displaying PDF documents, forms, etc. Also allows PDF searching, bookmarks, text selection, and basic editing.
  • Atalasoft - ASP.NET component for image viewing, but also allows PDF use as an image. Doesn't support any PDF features beyond simple viewing.
userx
  • 3,769
  • 1
  • 23
  • 33
  • (I work at Atalasoft) -- actually we support annotations, PDF page reordering/removing/adding, bookmarks, embedded links and more – Lou Franco Aug 19 '10 at 14:14