Questions tagged [feedparser]

A Python library that parses feeds in all known formats, including Atom, RSS, and RDF.

Universal feed parser, handles RSS 0.9x, RSS 1.0, RSS 2.0, CDF, Atom 0.3, and Atom 1.0 feeds.

Links:

362 questions
0
votes
0 answers

Refresh json thread to parse data on specific intervals

Ideally I want to implement Method level synchronization where whole method is synchronized every 1minute Not Block level synchronization where only some set of statements are synchronized But I don't know why when my new runnable handler execute…
0
votes
1 answer

Rss feed not working online

I m working on a website which extracts news feed from my blog to the website. It workd fine in my localhost but when the content is online, the same code does not work. Here are my code.
0
votes
1 answer

Trace/BPT trap when running feedparser inside a Thread object

I am trying to run a Thread to parse a list of links using the universal feed parser, but when I start the thread I get a Trace/BPT trap. Here's the code I am using: class parseRssFiles(Thread): def __init__ (self,rssLinks): …
simao
  • 14,491
  • 9
  • 55
  • 66
0
votes
1 answer

MySQL in Python complaining about placeholders

I've been trying to use python's MySQLdb to execute SQL on a MySQL Database from SSH on my webhost. This program i wrote (on a mac) should print a table, but it doesn't. Here's my code: import feedparser import time import MySQLdb topnews =…
CJ Goldshine
  • 135
  • 1
  • 2
  • 11
0
votes
1 answer

feedparser cannot get namespace values

hi i am trying to find the best way to extract values from a string returned from feedparser, basically its returning the following, i can of course do a regex match but im sure there is a better way. xml looks like this:- Size:…
binhex
  • 374
  • 4
  • 13
0
votes
1 answer

Is there a way in feedparser to retrieve only X number of entries from a feed URL?

I need to fetch the last 3 feeds from a feed url and display. The code that i'm currently using is : import feedparser feedUrl = "" feed = feedparser.parse( feedUrl ) length = len(feed['entries']) stop = (length-4) if length > 3 else -1 for i…
Dhruv
  • 1,079
  • 2
  • 13
  • 26
0
votes
2 answers

Retrieve info between paragraph tags with feedparser

I've been reading through the documentation for feedparser and haven't been able to find a solution to this: I would like to retrieve only the string between

. An example of an excerpt from a feed I'd like to retrieve this from is:
user3050397
  • 65
  • 1
  • 6
0
votes
1 answer

Python extract links

I am getting a feed by parsing this URL with feedparser which is supplied at the bottom of the post. In the URL supplied above there is a list of zip files on links on the page. The tutor (slide 8) wants to use this code below to extract all the…
ExoticBirdsMerchant
  • 1,466
  • 8
  • 28
  • 53
0
votes
1 answer

FeedparserDict object doesn't have 'content' attribute

I am trying to get familiar with the feedparser library, but I don't seem to be able to access the content attribute of entries in the feedparser object: d = feedparser.parse('http://www.reddit.com/r/python/.rss') post =…
0
votes
0 answers

error with feed excel in drupal

After installing Feeds Excel Parser and Feeds XLS, I get the error Fatal error: require_once(): Failed opening required 'sites/default/libraries/PHPExcel/PHPExcel/IOFactory.php' (include_path='.;C:\xampp\php\PEAR') in…
0
votes
2 answers

How do I retrieve all RSS entries that are no more than X days old

I am using Python and the RSS feedparser module to retrieve RSS entries. However I only want to retrieve a news item if it is no more than x days old. For example if x=4 then my Python code should not fetch anything four days older than the current…
timebandit
  • 794
  • 2
  • 11
  • 26
0
votes
1 answer

Why aren't there any dates in the Netflix.com NewWatchInstantlyRSS feed entries (when parsed with feedparser)?

The output from the following: import feedparser d = feedparser.parse('http://www.netflix.com/NewWatchInstantlyRSS') d.entries[177].keys() is: ['summary_detail', 'links', 'title', 'summary', 'guidislink', 'title_detail', 'link', 'id'] According to…
Daryl Spitzer
  • 143,156
  • 76
  • 154
  • 173
0
votes
1 answer

How to check if Rss has been updated when there's no Etag support

I've tried this code but it doesn't seem to work: import feedparser import time c = feedparser.parse('http://www.tsa-algerie.com/feed/') constID = c['entries'][0].id print c['entries'][0]['title'] while True: time.sleep(300) if constID…
Bilel_a
  • 163
  • 1
  • 3
  • 8
0
votes
1 answer

Feedparser newbie questions

After a break from Python(and I knew very little then!) I'm coming back to it for a project(hopefully!). I want to do some parsing using Feedparser & need a few hints to start. Before anyone shouts, I have searched Google and read the docs, but…
James Wanchai
  • 2,861
  • 4
  • 21
  • 16
0
votes
2 answers

Python Feedparser and Multi-threading

I have a list (nearly 500) of RSS/ATOM feeds urls to parse and fetch the links. I am using python feedparser libary to parse the url. To parse the list of urls parallely, I thought of using threading library in python. My code looks something like…
Prashere
  • 167
  • 1
  • 1
  • 7