Question 1

I'm writing a little crawler that should fetch a URL multiple times, I want all of the threads to run at the same time (simultaneously).

I've written a little piece of code that should do that.

import thread
from urllib2 import Request, urlopen, URLError, HTTPErrordef getPAGE(FetchAddress):attempts = 0while attempts < 2:req = Request(FetchAddress, None)try:response = urlopen(req, timeout = 8) #fetching the urlprint "fetched url %s" % FetchAddressexcept HTTPError, e:print 'The server didn\'t do the request.'print 'Error code: ', str(e.code) + "  address: " + FetchAddresstime.sleep(4)attempts += 1except URLError, e:print 'Failed to reach the server.'print 'Reason: ', str(e.reason) + "  address: " + FetchAddresstime.sleep(4)attempts += 1except Exception, e:print 'Something bad happened in gatPAGE.'print 'Reason: ', str(e.reason) + "  address: " + FetchAddresstime.sleep(4)attempts += 1else:try:return response.read()except:"there was an error with response.read()"return Nonereturn Noneurl = ("http://www.domain.com",)for i in range(1,50):thread.start_new_thread(getPAGE, url)

from the apache logs it doesn't seems like the threads are running simultaneously, there's a little gap between requests, it's almost undetectable but I can see that the threads are not really parallel.

I've read about GIL, is there a way to bypass it with out calling a C\C++ code? I can't really understand how does threading is possible with GIL? python basically interpreters the next thread as soon as it finishes with the previous one?

Thanks.

Question 2

As you point out, the GIL often prevents Python threads from running in parallel.

However, that's not always the case. One exception is I/O-bound code. When a thread is waiting for an I/O request to complete, it would typically have released the GIL before entering the wait. This means that other threads can make progress in the meantime.

In general, however, multiprocessing is the safer bet when true parallelism is required.

running multiple threads in python, simultaneously - is it possible?

Related Q&A

Drawing bounding rectangles around multiple objects in binary image in python

Replicating YEARFRAC() function from Excel in Python

creating a pandas dataframe from a database query that uses bind variables

Is there a docstring autocompletion tool for jupyter notebook?

Long to wide data. Pandas

How to wrap text in Django admin(set column width)

Problems compiling mod_wsgi in virtualenv

Python - Multiprocessing Error cannot start a process twice

Printing unicode number of chars in a string (Python)

Pandas report top-n in group and pivot