8

I am looking to create a Facebook App and the data I want to use is not available except for in CHM format.

Is it possible to convert CHM files into CSV or SQL? Do we have any existing tools for this?

Thanks for the help as always.

Community
  • 1
  • 1
Atif
  • 10,623
  • 20
  • 63
  • 96
  • 6
    lol @ close voters. Looking for an excuse to try "too localized"? – xyz Apr 14 '10 at 15:42
  • Did You find something similar for KU'AN a.z. I want to make XML Kur'an translation to Bosnian – adopilot Apr 14 '10 at 15:43
  • No I didnt look for Quran. Actually I am making a facebook app to randomly post a hadeeth on users wall. so just needed them to fill the database, i guess i need to go with @mjv procedure and then use regular expressions on the files to make my own database – Atif Apr 19 '10 at 02:12

2 Answers2

3

BTW, you should specify the language (English, Arabic...) in which you wish to build this dataset, as this affect both the selection of book sources and the conversion utilities.

Identifiying data content sources:
Interestingly, and for all the [interactive] online Hadeeth Search tools such as the one on the
CRCC's Compendium of Muslim Texts site (original from MSA West, but somehow not available/working at MSA site anymore), there doesn't seem to be any download-able version of the underlying databases!
There are several online versions of books themselves, in particular the popular ones you mention, but you would then need to parse and index them properly in order to retain the references etc. Also, going "back" to the books, you would have to relate them yourself.

With regards to converting CHM files...
There's no open source or freeware program that I'm aware of, but the shareware ABC Amber CHM converter (c. $25.00) appears to be the gold standard for that purpose.
I only had passing exposure to this software a couple of years ago, for a one-time conversion job similar to the one you are contemplating. The Amber converter "did the trick"; Luckily the underlying structure of the help pages exposed much regularity which allowed a relatively straight forward tabulation into CSV/database fields.
ABC Amber converter supports many languages, including Arabic (but I used it for English only).

mjv
  • 73,152
  • 14
  • 113
  • 156
1

There is a wellknown open source *nix lib called libchm on which most *nix chm viewers are based. It comes with a tool to decompress .chm to its .html originals (from which you then would have to convert the html tables to .csv)

Lazarus/Free Pascal also has libs, that even can write, but afaik no readily made "chm compiler" tool.

Marco van de Voort
  • 25,628
  • 5
  • 56
  • 89