Highest Voted 'scraper' Questions

2

votes

0 answers

Flipkart Review Scraping - Data not coming after certain pages, why?

I was trying to build a crawler for Flipkart using python and bs4. I was successful in doing so as well, but there is a problem while trying to scrape more than 13-14 pages of review. Till 13 pages the scraper works fine, but once I reach page no.…

asked Mar 18 '18 at 07:28

Prateek

185
1
3
12

2

votes

1 answer

missing a piece of the "set-Cookie" from a GET response (scraper)?

I'm trying to authenticate with: https://sso-prod.sun.ac.za/cas/login but the cookies I receive on my initial get request seem to be incomplete here is what firefox receives: but the cookies I get from my request: Cookies: "" …

java authentication web-crawler httprequest scraper

asked Jun 09 '17 at 12:10

SimpleJack

151
1
12

2

votes

2 answers

How can I extract text from a span tag using beautiful soup 4?

how can I scrape text through span tags using beautful soup? scrape faculty members informations from bs4 import BeautifulSoup import requests r = requests.get("http://www.uoj.ac.ae/ContentBan.aspx?m=15&p=4&sm=4") soup = BeautifulSoup(r.content,…

python-2.7 beautifulsoup scraper

asked May 19 '16 at 06:39

Kt7

71
1
4

2

votes

1 answer

Selenium webdriver with python to scrape dynamic page cannot find element

So there are a lot of questions that have been asked around dynamic content scraping on stackoverflow, and I went through all of these, but all the solutions suggested did not work for the following problem: Context: Using Selenium webdriver with…

python selenium selenium-webdriver web-scraping scraper

asked Mar 28 '16 at 23:20

Audrey Bascoul

23
4

2

votes

1 answer

Scrape Google News with lxml and python

I'm trying to scrape Google News using python and lxml. Everything is going well but when I try to print each div data using a for loop everything mess up. Here my code: # -*- coding: utf-8 -*- from stem import Signal from stem.control import…

python dom web-crawler lxml scraper

asked Mar 05 '16 at 12:04

JJack_

859
9
30

2

votes

1 answer

What Python tools can I use to write a scraper of a password-protected webpage?

Suppose there is a password-protected website that I want to access to scrape some info from it and put it into a spreadsheet. For example, it could be my personal credit card account page and I would be scraping info about the latest transactions.…

python password-protection scrape scraper

asked Sep 21 '15 at 01:19

I Z

5,719
19
53
100

2

votes

1 answer

How to Parse this HTML with Web::Scraper?

I am trying to use Web::Scraper to parse the following HTML:

TITLE1
DESCRIPTION1

TITLE2
DESCRIPTION2

TITLE3
DESCRIPTION3

into 'test' => [ …

html perl dom web-scraping scraper

asked Sep 15 '15 at 08:12

user1768233

1,409
3
20
28

2

votes

1 answer

Web Scraper won't fill in a child window that my VBA code launches

I have the following code in VBA which opens up an IE page, fills it in and then clicks on a button that opens up a new IE window. However, my code is not able to fill in the first dropdown of the new window. Any help would be greatly…

vba bots scraper

asked Sep 03 '15 at 23:22

chris

155
2
3
17

2

votes

1 answer

Scraping .aspx page in node

I've been working on scraping the following site: http://www.fightingillini.com/schedule.aspx?path=softball I've had extensive experience using node/cheerio/scraperjs to scrape both static and dynamic content in the past, but I'm not having any luck…

javascript asp.net node.js scraper cheerio

asked Jul 10 '15 at 00:26

Mark

51
1
9

2

votes

0 answers

Can DOMPath be used to get javascript variables inside HTML document?

I want to get javascript variable printed out in the HTML code using DOMXPath,here is the HTML code

Questions tagged [scraper]