0

I am trying to insatll Python Boilerpipe in my Ubuntu 14. It fails with the following error:

 Traceback (most recent call last):
  File "setup.py", line 27, in <module>
    download_jars(datapath=DATAPATH)
  File "setup.py", line 21, in download_jars
    tar = tarfile.open(tgz_name, mode='r:gz')
  File "/usr/lib/python2.7/tarfile.py", line 1678, in open
    return func(name, filemode, fileobj, **kwargs)
  File "/usr/lib/python2.7/tarfile.py", line 1730, in gzopen
    raise ReadError("not a gzip file")
tarfile.ReadError: not a gzip file

These are the steps I am following:

najeeb
  • 813
  • 12
  • 25

2 Answers2

3

Found the issue, so in the setup.py they are looking for boiler-pipe tar file. And they download it from googlecode, which is not there any more.

def download_jars(datapath, version=boilerpipe_version):
    tgz_url = 'https://boilerpipe.googlecode.com/files/boilerpipe-{0}- bin.tar.gz'.format(version)

So I replaced the same line with the new file location:

tgz_url='https://storage.googleapis.com/google-code-archive-downloads/v2/code.google.com/boilerpipe/boilerpipe-1.2.0-bin.tar.gz'

This worked for me.

najeeb
  • 813
  • 12
  • 25