Questions tagged [feedparser]

A Python library that parses feeds in all known formats, including Atom, RSS, and RDF.

Universal feed parser, handles RSS 0.9x, RSS 1.0, RSS 2.0, CDF, Atom 0.3, and Atom 1.0 feeds.

Links:

362 questions
0
votes
0 answers

Script for FeedPaser to Regularly Gather RSS, then Storing Data in Database

I'm learning Python. To teach myself I've decided to try to build a tool which gathers RSS feeds and stores the output, title, URL and Summary in a database (I will later build a tool to access the data and scrape the pages) So far, I have created a…
Nick Duddy
  • 910
  • 6
  • 20
  • 36
0
votes
1 answer

Feedparser returns only first entry of ATOM feed

I updated my (already) working code from python2.7 to python3.5 and the following problem suddenly appears. By parsing the given ATOM feed with many entries (correct syntax), feedparser 5.2.1. returns only the first entry of the feed and of course…
zypro
  • 1,158
  • 3
  • 12
  • 33
0
votes
1 answer

FeedParser, Removing Special Characters and Writing to CSV

I'm learning Python. I've set myself a wee goal of building a RSS scraper. I'm trying to gather the Author, Link and Title. From there I want to write to a CSV. I'm encountering some problems. I've search for the answer since last night but can't…
Nick Duddy
  • 910
  • 6
  • 20
  • 36
0
votes
0 answers

cannot parse gmail Atom feed using python feedparser library

I don't succeed to parse gmail Atom feed using feedparser with python 3.6 on windows 10 and neither on a raspberry pi running python 3.5 with ubuntu. For the gmail account I'd like to access atom feed, I have previously setup 2 step authentification…
0
votes
2 answers

Parser line break if string contains –

My NSXMLParser breaks on this string: AAA – BCDEFGQWERTYUIO I parsed it in this way, hope is the right way: - (void) parser: (NSXMLParser *) parser foundCharacters: (NSString *) string{ [...] if ([currentElement…
elp
  • 8,021
  • 7
  • 61
  • 120
0
votes
1 answer

python feedparser return only limited number of entries

I'm new to feedparser and would like to retrive the full number or entries/articles from a feed. myfeed = feedparser.parse(feedurl) print len(myfeed['entries']) For instance if I use feedurl = 'http://www.korben.info/feed' It only returns only 30…
chris3389
  • 53
  • 2
  • 8
0
votes
1 answer

Parse multiple RSS feeds (Python)

Using Feedparser to parse multiple RSS feeds but this function doesn't work. How can I create a function to parse feeds and get the link out of the feed entry to further push to a tweet? def get(): rss_url = [ …
0
votes
1 answer

Feedparser cause no attribute error for geo only when looping

I am using feedparser to parse RSS feed from spotcrime However, I am getting "no attribute" error whenever I'm trying to loop through the entries to get the 'geo_long' and 'geo_lat' attribute. If I don't loop than it works fine: f =…
Kim G Pham
  • 145
  • 6
0
votes
0 answers

deleting common words in python

How to delete common words from two documents thats extracted from two websites? I already extracted the news from two sites now I want to delete the common words from the two documents. I used the following code to extract news from two different…
Anila
  • 1
0
votes
1 answer

Getting a webpage using requests is taking too long while dealing with Cloudflare server, but works quickly in browser and curl

I am trying to read a rss feed using feedparser. But the program gets stuck when using requests, feedparser or urlopen. The page I am trying to get is here (mangastream.com rss feed) I can view it in browser or get it using curl in terminal. >>>…
Mark Evans
  • 974
  • 1
  • 11
  • 29
0
votes
1 answer

Simple Yahoo Pipe not working

All I want to do is mix three RSS feeds into one, remove duplicates (based on the title) and sort the items by date of publish. So I built this pipe (http://pipes.yahoo.com/nerdvanagr/olympiacosbc) which is just refusing to work: As you can see, for…
user103798
0
votes
0 answers

RSS scraping from several websites

I am trying to fetch some RSS feeds from a list of blog sites.But, most of these sites have older Feeds entries. And some of these sites do not have any etag information.so i cant check the latest published feed with etag. I figured out a logic to…
Kxplorer
  • 28
  • 8
0
votes
1 answer

Feedparser doesn't work, gives AttributeError

I have this code: import jinja2 import webapp2 import os from google.appengine.ext import db import feedparser from xml.dom import minidom from google.appengine.api import memcache template_dir = os.path.join(os.path.dirname(__file__),…
Sidharth Samant
  • 714
  • 8
  • 28
0
votes
3 answers

Is it possible to display hyperlinks in a Tkinter message widget?

I am creating a news feed program that uses the Feedparser module to read the Yahoo! RSS API, write key data to a text file, and then display the data organised in a Tkinter GUI. I was enquiring if it is possible to have clickable hyperlinks in a…
Nick
  • 63
  • 2
  • 9
0
votes
1 answer

Separate RSS feed link/s

I am using the feedparser module to create a news feed in my program. The Yahoo! Finance API link element actually has two links: the Yahoo link, and the actual article link (external site/source). The two are separated by an asterisks, with the…
Nick
  • 63
  • 2
  • 9