Questions tagged [mechanize-ruby]

The Ruby library for automating interaction with websites.

The Mechanize library is used for automating interaction with websites. Mechanize automatically stores and sends cookies, follows redirects, can follow links, and submit forms. Form fields can be populated and submitted. Mechanize also keeps track of the sites that you have visited as a history.

193 questions

vote

0 answers

How to login into website using Mechanize

I'm trying to login to this website and I keep getting this error: Mechanize::ResponseCodeError (404 => Net::HTTPNotFound for ... I followed the documentation and changed the user agent but still have this problem: require 'rubygems' require…

asked Jun 02 '20 at 18:14

Knowlege_Collector

vote

2 answers

How to set a "base URL" for Webrat, Mechanize

I would like to specify a base URL so I don't have to always specify absolute URLs. How can I specify a base URL for Mechanize to use?

ruby cucumber mechanize webrat mechanize-ruby

asked Mar 21 '11 at 19:40

Andrew

227,796
193
515
708

vote

1 answer

How to parse an Invalid XML

I have a project I'm working on where I request an XML document from a server and parse it to import the data into my system. I'm using Ruby 2.4.3. My issues is that the XML comes in with element tags that have names starting with numbers. …

ruby xml mechanize mechanize-ruby

asked Dec 23 '17 at 20:41

user1977840

vote

2 answers

How to avoid getting blocked by websites when using Ruby Mechanize for web crawling

I am successful scraping building data from a website (www.propertyshark.com) using a single address, but it looks like I get blocked once I use loop to scrape multiple addresses. Is there a way around this? FYI, the information I'm trying to access…

ruby web-crawler mechanize-ruby

asked Sep 06 '17 at 17:54

Josh

vote

2 answers

Get all tags followings a certain with mechanize ? (ruby)

How can I get all elements following once, like :

foo

bla bla

bar1
bar2
bar3

baz

…

ruby xpath css-selectors nokogiri mechanize-ruby

asked Jul 12 '17 at 10:51

Matrix

3,458
6
40
76

vote

1 answer

Mechanize returns `connect_nonblock': SSL_connect returned=1 errno=0 state=SSLv3

I am trying to scrape a Crunchbase page but i got this error: ryzal~/Desktop/Sites/scraper$ ruby scraper.rb /Users/Ryzal/.rbenv/versions/2.3.1/lib/ruby/2.3.0/net/http.rb:933:in `connect_nonblock': SSL_connect returned=1 errno=0 state=SSLv3 read…

ruby ssl web-scraping mechanize mechanize-ruby

asked Jan 10 '17 at 00:46

Ryzal Yusoff

vote

1 answer

Using page.at with CSS selector in Mechanize

I am trying to scrape a webpage with Mechanize, with the following structure:

Mechanize in Module, Nameerror ' agent'

Looking for advice on how to fix this error and refactor this code to improve it. require 'mechanize' require 'pry' require 'pp' module Mymodule class WebBot agent = Mechanize.new { |agent| agent.user_agent_alias = 'Windows…

ruby-on-rails ruby ruby-on-rails-4 mechanize mechanize-ruby

asked Jan 20 '16 at 19:39

user2012677

5,465
6
51
113

vote

1 answer

Are Mechanize and its dependencies incompatible with multithreading in JRuby or am I doing something wrong on my end?

I'm trying to scrape a group of pages with Mechanize and JRuby. I'm using JRuby to have multithreading, since the program is a little slow on MRI. However, I've been running into some problems with what seems to be non-threadsafe data types in…

ruby multithreading web-scraping jruby mechanize-ruby

asked Nov 26 '15 at 17:07

GDP2

1,948
2
22
38

vote

0 answers

Disable javascript validation using mechanize in rails

I'm scrapping a website using Mechanize gem, the website has got a form which uses some javascript code for some validation. How do I bypass that? On form submission, the website redirects to the same form page.

javascript ruby-on-rails ruby-on-rails-4 mechanize mechanize-ruby

asked Sep 30 '15 at 08:59

Talha Shoaib

vote

1 answer

Only one image getting uploaded multiple times

I have been using mechanize gem to scrape data from craigslist, I have a piece of code that uploads multiple image to craigslist, all the file paths are correct, but only single image gets uploaded multiple times what's the reason. unless…

ruby-on-rails-4.2 ruby-2.1 mechanize-ruby

asked Jul 20 '15 at 12:49

codemilan

1,072
3
12
32

vote

1 answer

Hooks to always be run efter request - also on error

I know that mechanize has post_connect_hooks that will be run after the page is retrieved. However if an exception happens e.g. if you request an unknown URL like "http://dsjkhbgdfb.comsfg" then it runs pre_connect_hooks but not post_connect_hooks.…

mechanize mechanize-ruby

asked Jul 01 '15 at 13:34

Niels Kristian

8,661
11
59
117

vote

1 answer

Using the Ruby Mechanize "links_with" to grab text but getting extra content

When I grab a group of links using the Mechanize links_with method I only want the text showing the link but I'm getting a series of extra characters: links = @some_page.links_with(text: /V\s.*(BENCH|EARCX)|(BENCH|EARCX).*V/) links.each do…

ruby nokogiri mechanize-ruby

asked Jun 03 '15 at 01:37

bkunzi01

4,504
1
18
25

vote

0 answers

sending a form with mechanzie(Ruby) returns and empty page?

I want to scrape the list of offers of a given product from amazon.com with the quantity in stoke for each offer. To find this last information (quantity) I need to add that offer to cart, than edit the cart with the quantity 999. and than get the…

ruby forms mechanize-ruby

asked May 14 '15 at 17:18

Nafaa Boutefer

2,169
19
26

vote

2 answers

How do I ignore the nil values in the loop with parsed values from Mechanize?

In my text file are a list of URLs. Using Mechanize I'm using that list to parse out the title and meta description. However, some of those URL pages don't have a meta description which stops my script with a nil error: undefined method `[]' for…

ruby null mechanize-ruby

asked May 08 '15 at 16:06

mr. greybox tester

Prev 1 2 3

…

12 13 Next