Questions tagged [scrape]

DO NOT USE THIS TAG. It is under an active cleanup: https://meta.stackoverflow.com/q/305314 Use [web-scraping] if your question is about scraping information from web resources (there is also [screen-scraping]) or use [pdf-scraping] if your question is about scraping information from pdf files. Use [data-extraction] if you need to extract data from other resources.

1204 questions
-2
votes
1 answer

Facebook like is broken, adds a trailing # to the final url

I'm running my site on Cargo Collective and trying to have likes per page. I cannot modify the code in the head tag only within the body tag. When i debug a page i get the following; Response Code 206 Fetched URL…
iamneuron
  • 11
  • 3
-2
votes
1 answer

screen scrape and ajax

I want three tables from a page that reloads itself every two minutes. I have screen scraped the three tables using php now I want to put these three tables in a div on my page what is the best ajaxy way to do this? should i save the three tables as…
pouya
  • 3,400
  • 6
  • 38
  • 53
-3
votes
1 answer

Scraping HTML file in R to extract specific lines

I am trying to develop an R script that can extract specific lines of downloaded HTML files. Here is a file example: ARMS Email System
-3
votes
1 answer

Get Data from Javascript codes with Python Selenium

I am trying to get javascript data from website with Selenium. However, I could not achieve what I wanted. Below is a screenshot of the javascript code. I want to extract Barcode numbers and variation values (such as 36 numbers, 37 numbers) among…
-3
votes
1 answer

Loop through List of urls and pulling off stock available

I have 15,000 web pages all in the same format listed in column A, I need to pull of the available stock for each product and paste it into Column H. I have tried multiple solutions that I have found online with no success. My first URL is :…
-3
votes
1 answer

How to access a part of an element from a list?

import cv2 import os import glob import pandas as pd from pylibdmtx import pylibdmtx import xlsxwriter # co de for scanning img_dir = "C:\\images" # Enter Directory of all images data_path = os.path.join(img_dir,'*g') files =…
-3
votes
1 answer

How can i scrape the google search result from page 1 to 2

I would like to scrape the google search result up to page 2 but i'm having trouble on the result of blank page of my website or timeout. for($j=0; $j<$acount; $j++){ sleep(60); for($sp = 0; $sp <= 10; $sp+=10){ $url =…
-3
votes
2 answers

How do i edit this invalid json and parse load it in python

jsonCallback1530150433250_46028 &&…
Han Jinn
  • 41
  • 8
-3
votes
1 answer

Multi find beautiful soup

I have a list of item like this: (the number of item lists can vary)

My title

http://myurl.com text

My title

http://myurl.com
uskap
  • 71
  • 1
  • 8
-3
votes
2 answers

How to scrape website with login required - example.com

I try to screen scrape a website using .net (Webclient, webrequest,response etc.) I've tried many methods but nothing seems to work. I always get "Please login to see this content!" site instead of full auction info :…
Kath
  • 15
-3
votes
2 answers

python scrape links keyword

I'm new to python and I need help for scraping all links with a certain keyword. The problem is that I'm getting the following error: if "air-max" in link["href"]: ^ IndentationError: expected an indented block. Here is my…
Phil
  • 46
  • 7
-3
votes
1 answer

Get All Play store search results with PHP scraper (not just top 20)

Summary : I want to fetch all Play store search result, problems is that, Apps that show after scroll that are not show in PHP file_get_content(). Detail: I'm trying to make a php based play store scraper. I checked all stackoverflow answers and…
asad app
  • 321
  • 1
  • 2
  • 14
-3
votes
2 answers

Scrape links from HTML

I have been always using preg_match to scrape URLs from HTML files but I wanted to extract only URLs that have .mp3 as their extension. I was told to try DOM and I have been trying to fix a code but it doesn't work. I get a blank page whatever I do.…
andrew
  • 31
  • 1
  • 9
-3
votes
2 answers

Delay Between WebScraping

I am using webscraping to get data from a certain website using SIMPLE PHP DOM PARSER CLASS There are few problems i am facing. There are two websites which are returning an error HTTP 403 forbidden Secondly As per the below code i am scraping 9…
Abdul Basit
  • 493
  • 11
  • 34
-3
votes
1 answer

how to get specific data from the sentence

public class newsidlink{ public static void main (String[] args) throws Exception{ String msg="ABCD NEWSBULLETIN01 Issued at HHSSIST NT=2147IST 28Oct2012 Name"; //to do code } } Hi I am a beginner in java.... please help me…
1 2 3
80
81