Questions tagged [scraper]

Synonym of [web-scraping]

Synonym of web-scraping: Let's [scrape] these tags off the bottom of our shoe

349 questions

votes

1 answer

Quotes Messing Up Python Scraper

I am trying to scrape all the data within a div as follows. However, the quotes are throwing me off.

14955 Shady Grove Rd.

Rockville, MD 20850

asked Dec 29 '13 at 03:00

user2832516

votes

1 answer

Trying update Twitter status with scraper data using Twython. Unsure on what to do

So I have these two scripts: redditScraper.py # libraries import urllib2 import json # get remote string url = 'http://www.reddit.com/new.json?sort=new' response=urllib2.urlopen(url) # interpret as json data =…

python raspberry-pi reddit scraper twython

asked Dec 01 '13 at 22:27

oddnonymous

votes

1 answer

cURL times out when web scraping: "PHP Fatal error: Call to a member function find() on a non-object"

I've created this function that basically scrapes Technorati for blog posts and URLs to those posts. Btw, I tortured myself to find an API for this, and couldn't find one. I do feel ashamed for this scraper, but there should be an API!…

php curl web-scraping scraper

asked Nov 28 '13 at 19:05

user796443

votes

1 answer

Fixing a 'sqlite3.InterfaceError: Error binding parameter 0 - probably unsupported type. Try converting types or pickling.'

I'm stuck on this scraper in ScraperWiki. I just want the text from the li-elements in the ul with dir='ltr'. I run this script every week and sentences could be similar to each other, while being a completely new sentence. That's why I want to…

python sqlite scraper scraperwiki screen-scraping

asked Nov 06 '13 at 10:40

Jerry Vermanen

votes

1 answer

XPath to select between two HTML comments is not working?

I'm trying to select some content between two HTML comments, but having some trouble getting it right (as seen in "XPath to select between two HTML comments?"). There seems to be a problem when new comments that are on the same line. My…

html ruby xpath nokogiri scraper

asked Oct 29 '13 at 16:17

Thomas

votes

1 answer

ScraperWiki scrape frequence

This might be a stupid question but I am currently working with scraping twitter by using Scraperwiki. Tho ScraperWiki run-frequency is rather low. Is there a way to force-run ScraperWiki to run more frequently without touching python since my…

javascript scraper scraperwiki

asked Oct 25 '13 at 09:54

user2906393

votes

1 answer

BeautifulSoup4 - All links within 1 div on multiple pages

For a schoolproject we need to scrape a 'job-finding' website and store this in a DB, and later match with these profiles with companies who are searching people. On this particular site, all the url's to the pages I need to scrape are in 1 div…

python beautifulsoup scraper

asked Oct 06 '13 at 22:12

rockyl

votes

1 answer

Chrome shows different html then my RequestJS & CheerioJS app

My scraper app is searching a Vimeo URL with a query string attached to it which is 'http://vimeo.com/search?q=angularjs' When I load that URL on Chrome I can see a number of elements that do not show up with I request() that URL from my scraper.…

node.js xmlhttprequest scraper cheerio

asked Sep 28 '13 at 23:30

user883807

votes

1 answer

Scraping Tags Using JSOUP

I'm attempting to extract the values from the following table using JSOUP: …

java android jsoup scrape scraper

asked Sep 12 '13 at 21:55

OverflowCustodian

votes

1 answer

How to handle NILs with Anemone / Nokogiri web scraper?

def scrape!(url) Anemone.crawl(url) do |anemone| anemone.on_pages_like %[/events/detail/.*] do |page| show = { headliner: page.doc.at_css('h1.summary').text, openers: page.doc.at_css('.details h2').text …

ruby nokogiri scraper anemone

asked Aug 13 '13 at 20:47

GN.

8,672
10
61
126

votes

1 answer

Python site scraper fails with socket.error 104

I feel like I am missing something very basic here about the limits of python processes. I have a screen scraper that is supposed to go to a password-protected site once a week, filling out a form to update existing records and then grabbing new…

python django web-scraping screen-scraping scraper

asked Aug 08 '13 at 18:16

user1046162

votes

4 answers

how to get all the urls of a website using a crawler or a scraper?

i have to get many urls from a website and then i've to copy these in an excel file. I'm looking for an automatic way to do that. The website is structured having a main page with about 300 links and inside of each link there are 2 or 3 links that…

python url web-crawler scraper

asked Jul 31 '13 at 08:33

giogix

votes

1 answer

How to extract text with lxml in this scraper program?

I am trying to scrape the text data from a specific element on this page (using scraperwiki) import requests from lxml import html response = requests.get(http://portlandmaps.com/detail.cfm?action=Assessor&propertyid=R246274) tree =…

python lxml scraper scraperwiki

asked Jul 24 '13 at 17:56

u'i

votes

1 answer

how to scrape the name of a class form a web page?

This is the HTML code of site I want to scrape:

this is the xpath im using in dynamic django scraper but its not working: //div[@class="ayah…

python django dynamic scraper

asked Jul 17 '13 at 03:43

user4650611

votes

1 answer

Django Dynamic Scraper Project does not run on windows even though it works on Linux

I am trying to make a project in dynamic django scraper. I have tested it on linux and it runs properly. When I try to run the command: syndb i get this…

python django web-scraping scraper scraperwiki

asked Jun 28 '13 at 11:53

user4650611

Prev 1 2 3

…

23 24 Next

Item No.	Name	Sex	Location