Questions tagged [scrape]

DO NOT USE THIS TAG. It is under an active cleanup: https://meta.stackoverflow.com/q/305314 Use [web-scraping] if your question is about scraping information from web resources (there is also [screen-scraping]) or use [pdf-scraping] if your question is about scraping information from pdf files. Use [data-extraction] if you need to extract data from other resources.

1204 questions
-2
votes
1 answer

Scrape Javascript page with Python or other

I want to scrape the team grids for each game on the following website: http://mc.championdata.com/nrl/ and I believe the code below is for away teams:
brad58
  • 1
-2
votes
1 answer

Can php save scraped data into a cache file with a file name as CURRENT DATE?

I'm asking because I'd like to have a graph with historical data, including from the most updated date. I know how to save scraped data as a certain name, and have it saved into a cache folder. If it's possible instead of saying...save current file…
Masteryogurt
  • 199
  • 4
  • 14
-2
votes
1 answer

How can I scrape data from a function which includes loops

I'm learning Web Scraping (with Python and Beautiful Soup) and I encountered with a problem on how to scrape data within a function which contains loops. The data I'm trying to get is within the condition of if,else statement as shown below.(Page…
Sreekant Shenoy
  • 1,420
  • 14
  • 23
-2
votes
1 answer

Is Python a suitable tool for automating data scraping?

I am working on a project which involves working with a large amount of data. Essentially, there exists a large repository on some website of excel files that can be downloaded. The site has several different lists of filters and I have several…
niccalis
  • 134
  • 1
  • 7
-2
votes
1 answer

Where can I find the list of web scraping projects to practice?

I am looking to practice the web scraping projects. Can anyone tell me where can I find such list or give me some examples of web scraping projects?
nitinvijay23
  • 1,781
  • 3
  • 13
  • 11
-2
votes
1 answer

Vb.Net Scraped Data MsgBox Shows Line Breaks but RichTextBox Does Not

So I'm scraping data and I've got my xpath parsing out the line I need. I go line by line and dump out the data so I know I'm looking in the right place and for every listing it's showing several lines which is good. Exactly what I expected. Example…
Max Better
  • 61
  • 10
-2
votes
1 answer

scrape the data-title and data-video-id on single page

I want to scrape the data-title and data-video-id but i don't know how to write the script with imacros The Photos
-2
votes
2 answers

writing a list with multiple data to a csv file in separate columns in python

import csv from itertools import izip if l > 0: for i in range(0,l): combined.append(str(questionList[i]).encode('utf-8') + str(viewList[i]).encode('utf-8')) # viewcsv.append(str(viewList[i]).encode('utf-8')) # …
Rohit Raj
  • 1
  • 1
  • 2
-2
votes
1 answer

undefined method `[]' for nil:NilClass while using Nokogiri

I am using Nokogiri to scrape data from a HTML document, but I'm running into the following error: `block in
': undefined method `[]' for nil:NilClass (NoMethodError) This is the code to reproduce the problem: require 'rubygems' require…
Anshul Kalra
  • 198
  • 3
  • 13
-2
votes
2 answers

Scrape data from HTML

This is the page I'm trying to scrape from, http://www.footballlocks.com/nfl_point_spreads_week_1.shtml , I want to end up with a simple data.frame with 4 columns so I can perform further analysis. I've have tried using the XML package but with not…
RcodeNFL
  • 9
  • 1
  • 1
  • 1
-2
votes
2 answers

Scrape content from a webpage

I'm trying to scrap content from this webpage: www.motorcyclemonster.com/motorcycle-events.html, the code I'm using is:
TrevTech
  • 49
  • 6
-2
votes
1 answer

Scrape urls from google's top page in python

I need to scrape google for the first 10-20 URL results for a keyword, I found a lib called pygoogle but when I run it I get a syntax error: here is my code: from pygoogle import pygoogle g = pygoogle('quake 3 arena') g.pages = 5 print '*Found %s…
Ken Dotx
  • 1
  • 1
-2
votes
1 answer

How to scrape all data from website with Jsoup?

I need to scrape all data from those sites: http://outfit7.com/privacy-policy/ http://www.gameloft.com/privacy-notice/ I use JSOUP. And program must scrape all text from site. As you see those sites have different structures. So I should use…
user3661720
  • 27
  • 12
-2
votes
1 answer

Combining string (URL) with Array

I am trying to scrape some data. B is the number of pages required for 1000 items to be scraped - there are 22 per page. usernum = 1000 b = usernum.to_i/22 Array.new(b) {|b| b+1} I have an array for the number of pages (1000/22 = 45 int.) -…
user2208607
  • 35
  • 2
  • 5
-2
votes
1 answer

How to scrape relative images

If i look at Amazon Button to add items to lists on their site - you can see it here: http://www.amazon.co.uk/wishlist/get-button How does it work? I'm pretty sure it scrapes the page somehow but it seems to get every image whether its a flash…
StudioTime
  • 22,603
  • 38
  • 120
  • 207