Questions tagged [mechanize]

Mechanize is a library for automated web browsing originally developed for Perl, there are now also Python and Ruby implementations.

Mechanize is a Ruby library for automating interaction with websites. Mechanize automatically stores and sends cookies, follows redirects, can follow links, and submit forms. Form fields can be populated and submitted. Mechanize also keeps track of the sites that you have visited as a history. It is adapted from the Perl www-mechanize module. There is also a mechanize-python for Python.

2512 questions

votes

2 answers

BeautifulSoup HTML table parsing

I am trying to parse information (html tables) from this site: http://www.511virginia.org/RoadConditions.aspx?j=All&r=1 Currently I am using BeautifulSoup and the code I have looks like this from mechanize import Browser from BeautifulSoup import…

python beautifulsoup html-table html-parsing mechanize

asked Jan 13 '10 at 18:50

Stephen Tanner

votes

7 answers

WebBrowsing in C# - Libraries, Tools etc. - Anything like Mechanize in Perl?

Looking for something similar to Mechanize for .NET... If you don't know what Mechanize is.. http://search.cpan.org/dist/WWW-Mechanize/ I will maintain a list of suggestions here. Anything for browsing/posting/screen scraping (Other than WebRequest…

c# authentication screen-scraping mechanize

asked Jan 27 '10 at 20:00

Jason

11,435
24
77
131

votes

2 answers

Detect redirect with ruby mechanize

I am using the mechanize/nokogiri gems to parse some random pages. I am having problems with 301/302 redirects. Here is a snippet of the code: agent = Mechanize.new page = agent.get('http://example.com/page1') The test server on mydomain.com will…

ruby http redirect mechanize

asked Jul 06 '13 at 12:29

user337620

2,239
3
19
19

votes

11 answers

Emulating a browser to download a file?

There's an FLV file on the web that can be downloaded directly in Chrome. The file is a television program, published by CCTV (China Central Television). CCTV is a non-profit, state-owned broadcaster, financed by the Chinese tax payer, which allows…

python shell mechanize wget

asked Feb 13 '13 at 02:31

showkey

votes

1 answer

mechanize how to get current url

I have this code require 'mechanize' @agent = Mechanize.new page = @agent.get('http://something.com/?page=1') next_page = page.link_with(:href=>/^?page=2/).click As you can see this code should go to the next page. The next_page should have url…

ruby mechanize

asked Apr 05 '12 at 16:36

megas

21,401
12
79
130

votes

2 answers

Need more mechanize documentation (python)

I'm having a really hard time finding a good comprehensive source for Mechanize's documentation. Even the main documentation on mechanize's site isn't really that great: it only seems to list examples. Is there a more formal place for documentation…

python mechanize

asked Feb 15 '12 at 06:08

varatis

14,494
23
71
114

votes

3 answers

Clicking a button with Ruby Mechanize

I have a particularly difficult form that I am trying to click the search button and can't seem to do it. Here is the code for the form from the page source:

ruby mechanize

asked Aug 31 '11 at 21:11

Sean

2,891
3
29
39

votes

2 answers

Maintaining cookies between Mechanize requests

I'm trying to use the Ruby version of Mechanize to extract my employer's tickets from a ticket management system that we're moving away from that does not supply an API. Problem is, it seems Mechanize isn't keeping the cookies between the post call…

ruby screen-scraping mechanize

asked Aug 12 '11 at 21:31

adamjford

7,478
6
29
41

votes

3 answers

Click on a javascript link within python?

I am navigating a site using python's mechanize module and having trouble clicking on a javascript link for next page. I did a bit of reading and people suggested I need python-spidermonkey and DOMforms. I managed to get them installed by I am not…

javascript python screen-scraping mechanize spidermonkey

asked Mar 06 '11 at 01:06

Lostsoul

25,013
48
144
239

votes

1 answer

Using Python and Mechanize to submit form data and authenticate

I want to submit login to the website Reddit.com, navigate to a particular area of the page, and submit a comment. I don't see what's wrong with this code, but it is not working in that no change is reflected on the Reddit site. import…

python networking screen-scraping mechanize

asked Jan 18 '11 at 04:24

Parseltongue

11,157
30
95
160

votes

3 answers

how do i set a timeout value for python's mechanize?

How do i set a timeout value for python's mechanize?

python timeout mechanize

asked Aug 24 '10 at 01:22

Joe Schmoe

1,815
3
15
14

votes

2 answers

Web Crawler - Ignore Robots.txt file?

Some servers have a robots.txt file in order to stop web crawlers from crawling through their websites. Is there a way to make a web crawler ignore the robots.txt file? I am using Mechanize for python.

python web-crawler mechanize robots.txt

asked Dec 05 '11 at 14:05

Craig Locke

votes

1 answer

How can I add a cookie to an existing cookielib CookieJar instance in Python?

I have a CookieJar that's being used with Mechanize that I want to add a cookie to. How can I go about doing this? make_cookie() and set_cookie() weren't clear enough for me. br = mechanize.Browser() cj =…

python cookies mechanize cookiejar cookielib

asked Jan 30 '10 at 20:12

Paul

votes

8 answers

Programmatic Python Browser with JavaScript

I want to screen-scrape a web-site that uses JavaScript. There is mechanize, the programmatic web browser for Python. However, it (understandably) doesn't interpret javascript. Is there any programmatic browser for Python which does? If not, is…

javascript python browser screen-scraping mechanize

asked Dec 16 '09 at 18:37

Claudiu

224,032
165
485
680

votes

1 answer

Ruby Mechanize https error

I'm trying to do the following: page = Mechanize.new.get "https://sis-app.sph.harvard.edu:9030/prod/bwckschd.p_disp_dyn_sched" But I only get this exception: OpenSSL::SSL::SSLError: SSL_connect returned=1 errno=0 state=SSLv2/v3 read server hello A:…

ruby ssl https mechanize

asked Aug 07 '12 at 23:00

wrongusername

18,564
40
130
214

Prev 1 2

…

99 100 Next