Boilerpipe is a great Java program for cleaning web pages and I've used it in the past. I note today that many users are not able to install the Python wrapper version and get 404 and other errors. Here is one of my attempts which I copied from conda.
/Users/duncan>sudo -H pip install https://pypi.python.org/packages/source/b/boilerpipe-py3/boilerpipe-py3-1.2.0.0.tar.gz Collecting https://pypi.python.org/packages/source/b/boilerpipe-py3/boilerpipe-py3-1.2.0.0.tar.gz Downloading boilerpipe-py3-1.2.0.0.tar.gz (1.3MB) 100% |████████████████████████████████| 1.3MB 436kB/s Complete output from command python setup.py egg_info: Traceback (most recent call last): File "", line 1, in File "/tmp/pip-r6swd0hy-build/setup.py", line 33, in download_jars(datapath=DATAPATH) File "/tmp/pip-r6swd0hy-build/setup.py", line 26, in download_jars urlretrieve(tgz_url, tgz_name) File "/Users/duncan/anaconda/lib/python3.5/urllib/request.py", line 188, in urlretrieve with contextlib.closing(urlopen(url, data)) as fp: File "/Users/duncan/anaconda/lib/python3.5/urllib/request.py", line 163, in urlopen return opener.open(url, data, timeout) File "/Users/duncan/anaconda/lib/python3.5/urllib/request.py", line 472, in open response = meth(req, response) File "/Users/duncan/anaconda/lib/python3.5/urllib/request.py", line 582, in http_response 'http', request, response, code, msg, hdrs) File "/Users/duncan/anaconda/lib/python3.5/urllib/request.py", line 510, in error return self._call_chain(*args) File "/Users/duncan/anaconda/lib/python3.5/urllib/request.py", line 444, in _call_chain result = func(*args) File "/Users/duncan/anaconda/lib/python3.5/urllib/request.py", line 590, in http_error_default raise HTTPError(req.full_url, code, msg, hdrs, fp) urllib.error.HTTPError: HTTP Error 404: Not Found
----------------------------------------
Command "python setup.py egg_info" failed with error code 1 in /tmp/pip-r6swd0hy-build/
I've seen several solutions that didn't work for me (ex: google changed its URL for a module) If anyone has a solution I would be very grateful!
My platform here is a 16GB El Capitan but I've seen this reported on Ubuntu and other platforms. Thank you for any help here!