Questions tagged [python-newspaper]

Newspaper is a Python library which delivers Instapaper style article extraction.

Newspaper is a Python library which delivers Instapaper style article extraction. Newspaper is inspired by requests and powered by lxml.

Useful links

111 questions
0
votes
2 answers

Unable to parse multiple files in a directory

I have html files on my local harddrive that I am trying to open in a webpage by sending a http request. Once the http request is created, I am trying to parse the stored html file by passing the url:(parsing is successful when passing one file at…
0
votes
0 answers

Python import error on one machine, but not on the other

I'm using a library called newspaper and deploying my app to elastic beanstalk. It runs fine locally, but on the server it doesn't work and throws this error: File "/opt/python/current/app/application.py", line 7, in from api.v01.api…
krishan711
  • 873
  • 1
  • 12
  • 27
0
votes
1 answer

Trouble parsing URLs from a list imported with pickle using newspaper library

I've been trying to pass a list of urls to extract articles from the pages. Extraction(with newspaper) works just fine if I build an actual list of urls (e.g. lista = 'http://www.zeit.de', ...). Taking the list from another file does not work,…
blub123
  • 31
  • 5
-1
votes
2 answers

Python Newspaper function not reading Article URL during loop?

Apologies if this is a stupid question - I'm new to Python and am more familiar with excel VBA. I am trying to have Python loop through multiple article URLs housed in an excel document and create summaries of the various URLs. The goal would be to…
Mondy77
  • 1
  • 1
-2
votes
4 answers

remove all possible unwanted characters from python string at once

I'm using python module newspaper3k and extracting article summary using its web url. As, from newspaper import Article article = Article('https://www.abcd....vnn.com/dhdhd') article.download() article.parse() article.nlp() text =…
Learner
  • 800
  • 1
  • 8
  • 23
-2
votes
1 answer

i want to get CNN news article all link

i want to get CNN news article all link for example in this link https://edition.cnn.com/search/?q=%20news&size=10 i can show lastest 10 news to get news link i tried two methods. html_page = urlopen(url) soup =…
Visa De
  • 21
  • 3
1 2 3 4 5 6 7
8