Questions tagged [feedparser]

A Python library that parses feeds in all known formats, including Atom, RSS, and RDF.

Universal feed parser, handles RSS 0.9x, RSS 1.0, RSS 2.0, CDF, Atom 0.3, and Atom 1.0 feeds.

Links:

362 questions
2
votes
1 answer

Retrieving raw XML for items with feedparser

I'm trying to use feedparser to retrieve some specific information from feeds, but also retrieve the raw XML of each entry (ie. elements for RSS and for Atom), and I can't see how to do that. Obviously I could parse the XML by hand, but that's not…
a3nm
  • 8,717
  • 6
  • 31
  • 39
2
votes
4 answers

NSDateFormatter with RSS pubDate

I have some problems with converting a rss pubdate string into a NSDate object. Fri, 09 Sep 2011 15:26:08 +0200 NSDateFormatter *dateFormatter = [[NSDateFormatter alloc] init]; [dateFormatter setDateFormat:@"EEE, d MMM yyyy…
Philippxp
  • 367
  • 2
  • 6
  • 16
2
votes
0 answers

Feedparser pubdate gives error when trying to store into a table with asyncpg

I am trying to store feedparser data into a Postgres database via asyncpg and I am getting an error while storing pubdate that is of type timestamptz in the database I have a Postgres table test with the following…
PirateApp
  • 5,433
  • 4
  • 57
  • 90
2
votes
4 answers

Accessing duplicate feed tags using feedparser

I'm trying to parse this feed: https://feeds.podcastmirror.com/dudesanddadspodcast The channel section has two entries for podcast:person
Brenden
  • 8,264
  • 14
  • 48
  • 78
2
votes
1 answer

Error using cx_freeze with a feedparser import: ModuleNotFoundError: No module named 'sgmllib'

I am trying to use cx_freeze to freeze an app that uses Feedparser. The app works perfectly if I launch it from the command line using python. However, when I try to freeze it using cx_freeze, I don't get an error until I try to launch the app. Then…
Ricky Kresslein
  • 372
  • 1
  • 13
2
votes
1 answer

How to query arXiv for a specific year?

I'm using the code shown below in order to retrieve papers from arXiv. I want to retrieve papers that have words "machine" and "learning" in the title. The number of papers is large, therefore I want to implement a slicing by year (published). How…
Fluxy
  • 2,838
  • 6
  • 34
  • 63
2
votes
2 answers

What could cause a UnicodeEncodeError exception to creep into a working Python environment?

I have a method in my script that pulls a Twitter RSS feed, parses it with FeedPharser, wraps it in TwiML (Twilio-flavored XML) using the twilio module, and returns the resulting response in a CherryPy method via str(). This works my fine in…
Sean W.
  • 4,944
  • 8
  • 40
  • 66
2
votes
1 answer

How to use feedparser with proxy in python 3?

I'd like to use a proxy and feedparser. This is how I'd do in urllib2: import urllib2, feedparser proxy = urllib2.ProxyHandler({"http":"proxy:port"}) d = feedparser.parse('http://www.shop.inonit.in/RSSFeedDetails.aspx?PID=801', handlers =…
dranxo
  • 3,348
  • 4
  • 35
  • 48
2
votes
2 answers

How to modify an atom/rss feed in python?

Here is what I want to do in python: take a feed add data to the feed re-publish the thus modified feed Feedparser does a good job at parsing, but it doesn't seem to provide a way to compose an XML document from a FeedParserDict object. Is there a…
emm
  • 265
  • 3
  • 11
2
votes
1 answer

Serializing a FeedParser object to Atom

I use feedparser http://www.feedparser.org/ to parse Atom feeds and I do some manipulation on the resulting Python objetcs. After that, I would like to serialize the objects back to Atom. But feedparser does not seem to offer a way to do so? I…
bortzmeyer
  • 34,164
  • 12
  • 67
  • 91
2
votes
1 answer

Unexpected error when using feedparser.py

I have had great success parsing RSS feeds from the National Hurricane Center using the feedparser module: import feedparser feedparser.parse('https://www.nhc.noaa.gov/gis-at.xml') #Works Fine feedparser.parse('https://www.nhc.noaa.gov/gis-ep.xml')…
2
votes
0 answers

feedparser object has no attribute 'etag' and 'modified'

I am trying to run example from feedparser documentation What I have tried so far import feedparser d = feedparser.parse('http://feedparser.org/docs/examples/atom10.xml') print d.etag I am getting this error for etag AttributeError: object has no…
Solaman Raji
  • 170
  • 9
2
votes
1 answer

How to make Universal Feed Parser only parse feeds?

I'm trying to get content from external feeds on my Django web site with Universal Feed Parser. I want to have some user error handling, e.g. if the user supplies a URL that is not a feed. When I tried how feedparser responds to faulty input, I was…
Benjamin Wohlwend
  • 30,958
  • 11
  • 90
  • 100
2
votes
2 answers

How to parse a xml feed using feed parser python?

Im trying to parse a feed in python using feedparser. But all I get is None returned. Im not sure what im missing. Here is my code: import feedparser def rss(self): rss = 'https://news.google.com/news?q=fashion&output=rss' feed =…
kevinabraham
  • 1,378
  • 4
  • 28
  • 55
2
votes
1 answer

Feedparser SAXParseException, bozo:1

I'm using feedparser in a script that's generally working for RSS URLs, but there's one URL that's giving me a headache: tabbforum.com/feed.atom I get a SAXParseException('not well-formed (invalid token)',). import feedparser def…
malla
  • 1,618
  • 1
  • 17
  • 23