Questions tagged [feedparser]

A Python library that parses feeds in all known formats, including Atom, RSS, and RDF.

Universal feed parser, handles RSS 0.9x, RSS 1.0, RSS 2.0, CDF, Atom 0.3, and Atom 1.0 feeds.

Links:

362 questions
0
votes
1 answer

How to parse all post's url's from blogspot.com blog?

This post is similar to How can I retrieve all posts from a Blogger (blogspot.com) blog? I do import feedparser url = feedparser.parse('https://www.blogger.com/feeds/1020819187099399113/posts/default?max-results=1000') for x in url.entries: …
Edward
  • 4,443
  • 16
  • 46
  • 81
0
votes
2 answers

Email title and link from rss-feed and email them

I'm doing a bit of an experiment in Python. I'm making a script which checks a rss-feed for new items, and then sends the title and link of the items via email. I've got the script to work to a certain level: when it runs it will take the link+title…
HankSmackHood
  • 4,673
  • 7
  • 29
  • 30
0
votes
2 answers

Storing Media RSS and iTunes podcast RSS feeds in the database

I want to be able to store media RSS and iTunes podcast RSS feeds into the database. The requirement here is that I don't want to miss out on ANY element or its attributes in the feed. It would make sense to find all most common elements in the feed…
Chirantan
  • 15,304
  • 8
  • 49
  • 75
0
votes
1 answer

feedparser missing entry description, includes phantom entry summary? python3

when using feedparser for reading rss feeds such as business insider's at this url: businessinsider.com/rss feedparser in python3 seems to properly handle some of the attributes for each entry in the rss feed, 'transform' others, and ignore/delete…
10mjg
  • 573
  • 1
  • 6
  • 18
0
votes
1 answer

Unreachable xml feed entries

I'm working on a python application supposed to make a request on a phonebook search api and format the received data. The entries are sent back as an xml feed looking like the exemple at the bottom. I'm using feedparser to split the…
joel
  • 3
  • 2
0
votes
1 answer

Python script's feedparser works, but fails in docker container

Running Docker Version 17.12.0-ce-mac49 (21995) on mac. Converting an existing working python script that performs some feed parsing on a password protected feed. Standard alone python 2.7 script works perfectly. Created an alpine based container…
J. Taylor
  • 31
  • 3
0
votes
0 answers

Problems with feedparser on python

Below is my code with the functions I have written, but I have a problem. When passing a string with the URL in the function get_feedparser('FEED_URL'), since I do not parse it well when giving me the error: {'encoding': 'utf-8', 'bozo': 1, 'feed':…
0
votes
1 answer

Python: Download/parse *Solely* the pubdate tag of rss entry

how do I download only the specific attribute of a metadata of an rss? I'm using the universal feedparser library feedparser.parse(linkstr).entries[0].published I have this line, which returns something like this if printed: "Sat, 02 Dec 2017…
xland44
  • 9
  • 5
0
votes
0 answers

Get exception when just started py script, but all work

Interesting situation! I has parser.py script, link to it below in 1, with feedparser library. When i just started it, i get such exceptions: Connected to pydev debugger (build 172.3757.67) pydev debugger: process 20017 is connecting Traceback…
Gudsaf
  • 289
  • 3
  • 12
0
votes
1 answer

Parse Stackoverflow RSS job feed for same name elements, with Feedparser in Python

Every job item on the Stackoverflow RSS feed has certain tags, with the key "category". Looking basically like this:…
Felix
  • 667
  • 14
  • 28
0
votes
1 answer

AttributeError: module 'feedparser' has no attribute 'FeedParserDict'

I am trying to import feedparser in Python, and want to call FeedParserDict from the library feedparser, i.e., feedparser.FeedParserDict. But it leads to the following error: "AttributeError: module 'feedparser' has no attribute…
xha
  • 39
  • 1
  • 8
0
votes
1 answer

Feedparser returns a SAXParseException

I try to read RSS feeds using python. I use feedparser, and it works well for every feed, except one : https://www.frenchweb.fr/feed import feedparser feed = feedparser.parse("https://www.frenchweb.fr/feed") print(feed) Output : { 'feed': {}, …
iero
  • 401
  • 3
  • 14
0
votes
1 answer

Python and Comparing File Changes

d = feedparser.parse('somerssfeed/rss.xml') message = {} smessage = {} for post in d.entries: message[post.link] = post.title fwrite = open("db.txt", "a") for k, v in message.items(): if k in open("db.txt", "r"): print("already…
ggnoredo
  • 801
  • 1
  • 13
  • 33
0
votes
3 answers

Different results for the same RSS feed fetching from different user agents

If I add a feed URL to Google Reader or to a desktop feed aggregator, I receive nice results. The URL is: http://estaticos03.marca.com/rss/futbol_1adivision.xml But when I fetch the same URL from a script (python script, using feedparser library) I…
nabucosound
  • 1,283
  • 1
  • 12
  • 23
0
votes
2 answers

feedparser google appengine deferred no entries?

I'm using feedparser in a deferred task in google app engine like this: class RSSFetchHandler(webapp.RequestHandler): def get(self): deferred.defer(parse_dk_indeed_com, feed) and then in parse_dk_indeed_com I have the following code…
kristian nissen
  • 2,809
  • 5
  • 44
  • 68