Questions tagged [import.io]

Transform any website into a table of data or a Structured API in minutes without even writing any code.

Transform any website into a table of data or a Structured API in minutes without even writing any code.

https://import.io/

198 questions
13
votes
2 answers

Python requests call with URL using parameters

I am trying to make a call to the import.io API. This call needs to have the following…
johan855
  • 1,578
  • 4
  • 26
  • 51
10
votes
2 answers

XPath to select 1 element if one of two exists

I want to select one element if one out of 2 exist while using this for 2 pages 1st page (price with discount)
$2,990.00
Amr Ali
  • 449
  • 2
  • 11
8
votes
2 answers

How can I extract data behind a login page using import.io

I need to crawl some data that sits behind a login page. To be able to scrap it I need a tool that is able to login and then crawl the pages behind it. Is it possible to do this behind import.io?
hopper
  • 4,230
  • 8
  • 36
  • 49
7
votes
3 answers

Running the crawler doesn't get the same data it does when training

When training my crawler to scrape a Yelp page, it gets all the information without me doing anything, but when I run the crawler, the address is not recognized, and doesn't record.
Ian Woodfill
  • 164
  • 1
  • 1
  • 9
7
votes
1 answer

XPath following-sibling for crawling not returning sibling

I am trying to create a crawler to extract some attribute data from supplier websites that I can audit against our internal attribute database and am new to import.io. I watched a bunch of videos, but though my syntax seems to be right, my manual…
Elizabeth VO
  • 111
  • 1
  • 6
6
votes
1 answer

import.io URL returns

I'm using Import.io to catalogue assets on a client's blog. I've got it all working nicely and I'm seeing some great results but when I do a Bulk Extract I'm getting a handful of results less than I put in, around 145 versus the 150 or so I…
CMteam
  • 71
  • 3
6
votes
0 answers

How I get data_type of each column which I extract from page using New Extractor in import.io?

I am working on import.io and I use New Extractor to fetch data from a site page. I create columns of different types and insert content into it After that, I publish that data. I got data in JSON format. But that JSON format data do not give me…
Suraj Dalvi
  • 988
  • 1
  • 20
  • 34
6
votes
1 answer

import.io stuck at Test your connector

I have created a connector using import.io windows application. I am able to successfully test my connector using example queries. I want to extract data returned from this connector into dataset. I am stuck at "Test your connector" option. Here is…
FunkYou
  • 63
  • 3
6
votes
1 answer

What is the xpath regex to extract this meta tag?

What is the XPath or regex to extract the content of this meta tag?
Cesar Maiorino
  • 563
  • 5
  • 16
5
votes
1 answer

Can I use extractor for local html files

I'm using this article here to load a local html file. I can use extractor to get the data, but I can't publish the API. I'd like to run the API extractor on multiple pages. Is this possible?
Craig.Pearce
  • 746
  • 7
  • 25
5
votes
1 answer

Calling import.io dataset created with "Chain API" via REST

I created a dataset using 2 extractors: a "many rows" extractor which is then called by a "just one row" extractor via the "URLs from another API / Chain APIs" option. This has given me the data that I need and I have saved it as a dataset. Although…
A.J
  • 51
  • 1
5
votes
1 answer

How to scrape multiple pages with Import.io

I am trying to scrape a a list of events from a site http://www.cityoflondon.gov.uk/events/, But when scrapping it with import.io I am able to extract just the first page. How could I extract all pages at once?
Huander
  • 71
  • 2
  • 11
5
votes
2 answers

What are guids?

HI there can you please tell me that what are connector-guid, user-guid and api key in below given code and how to get them for any website?

        
Ali Hamza
  • 97
  • 2
  • 9
5
votes
1 answer

Setting up import.io crawler with xpath or regexp

I am currently trying to set up a web crawler to extract data from real estate web sites. It is a trend with these sites that certain info is not in the same place for each page, so I must figure out how to have it extract text elements that contain…
user2480757
  • 123
  • 2
  • 6
5
votes
2 answers

How to use substring() with Import.io?

I'm having some issues with XPath and import.io and I hope you'll be able to help me. :) The html code: For the moment, I manage to extract the content of the href ( page.php?var=12345 ) with…
Pierre
  • 114
  • 1
  • 13
1
2 3
13 14