DO NOT USE THIS TAG. It is under an active cleanup: https://meta.stackoverflow.com/q/305314 Use [web-scraping] if your question is about scraping information from web resources (there is also [screen-scraping]) or use [pdf-scraping] if your question is about scraping information from pdf files. Use [data-extraction] if you need to extract data from other resources.
Questions tagged [scrape]
1204 questions
-2
votes
1 answer
Facebook like is broken, adds a trailing # to the final url
I'm running my site on Cargo Collective and trying to have likes per page. I cannot modify the code in the head tag only within the body tag.
When i debug a page i get the following;
Response Code 206
Fetched URL…

iamneuron
- 11
- 3
-2
votes
1 answer
screen scrape and ajax
I want three tables from a page that reloads itself every two minutes. I have screen scraped the three tables using php now I want to put these three tables in a div on my page what is the best ajaxy way to do this?
should i save the three tables as…

pouya
- 3,400
- 6
- 38
- 53
-3
votes
1 answer
Scraping HTML file in R to extract specific lines
I am trying to develop an R script that can extract specific lines of downloaded HTML files. Here is a file example:
ARMS Email System

benjamin1989
- 1
- 2
-3
votes
1 answer
Get Data from Javascript codes with Python Selenium
I am trying to get javascript data from website with Selenium. However, I could not achieve what I wanted. Below is a screenshot of the javascript code. I want to extract Barcode numbers and variation values (such as 36 numbers, 37 numbers) among…

tarikdelibas
- 3
- 2
-3
votes
1 answer
Loop through List of urls and pulling off stock available
I have 15,000 web pages all in the same format listed in column A, I need to pull of the available stock for each product and paste it into Column H. I have tried multiple solutions that I have found online with no success.
My first URL is :…

Rory Watts
- 1
- 1
-3
votes
1 answer
How to access a part of an element from a list?
import cv2
import os
import glob
import pandas as pd
from pylibdmtx import pylibdmtx
import xlsxwriter
# co de for scanning
img_dir = "C:\\images" # Enter Directory of all images
data_path = os.path.join(img_dir,'*g')
files =…

Kritesh Lunawat
- 47
- 7
-3
votes
1 answer
How can i scrape the google search result from page 1 to 2
I would like to scrape the google search result up to page 2 but i'm having trouble on the result of blank page of my website or timeout.
for($j=0; $j<$acount; $j++){
sleep(60);
for($sp = 0; $sp <= 10; $sp+=10){
$url =…

Marc Justin Rait
- 23
- 5
-3
votes
2 answers
How do i edit this invalid json and parse load it in python
jsonCallback1530150433250_46028 &&…

Han Jinn
- 41
- 8
-3
votes
1 answer
Multi find beautiful soup
I have a list of item like this: (the number of item lists can vary)
My title
http://myurl.com textMy title
http://myurl.com
uskap
- 71
- 1
- 8
-3
votes
2 answers
How to scrape website with login required - example.com
I try to screen scrape a website using .net (Webclient, webrequest,response etc.)
I've tried many methods but nothing seems to work.
I always get "Please login to see this content!" site instead of
full auction info :…

Kath
- 15
-3
votes
2 answers
python scrape links keyword
I'm new to python and I need help for scraping all links with a certain keyword. The problem is that I'm getting the following error:
if "air-max" in link["href"]:
^
IndentationError: expected an indented block.
Here is my…

Phil
- 46
- 7
-3
votes
1 answer
Get All Play store search results with PHP scraper (not just top 20)
Summary :
I want to fetch all Play store search result, problems is that, Apps that show after scroll that are not show in PHP file_get_content().
Detail:
I'm trying to make a php based play store scraper.
I checked all stackoverflow answers and…

asad app
- 321
- 1
- 2
- 14
-3
votes
2 answers
Scrape links from HTML
I have been always using preg_match to scrape URLs from HTML files but I wanted to extract only URLs that have .mp3 as their extension. I was told to try DOM and I have been trying to fix a code but it doesn't work. I get a blank page whatever I do.…

andrew
- 31
- 1
- 9
-3
votes
2 answers
Delay Between WebScraping
I am using webscraping to get data from a certain website using SIMPLE PHP DOM PARSER CLASS
There are few problems i am facing.
There are two websites which are returning an error HTTP 403 forbidden
Secondly As per the below code i am scraping 9…

Abdul Basit
- 493
- 11
- 34
-3
votes
1 answer
how to get specific data from the sentence
public class newsidlink{
public static void main (String[] args) throws Exception{
String msg="ABCD NEWSBULLETIN01 Issued at HHSSIST NT=2147IST 28Oct2012 Name";
//to do code
}
}
Hi I am a beginner in java.... please help me…

Karthik Kolla
- 29
- 2