Question 1

I would like to read a website asynchronously, which isnt possible with urllib as far as I know. Now I tried reading with with plain sockets, but HTTP is giving me hell. I run into all kind of funky encodings, for example transfer-encoding: chunked, have to parse all that stuff manually, and I feel like coding C, not python at the moment.

Isnt there a nicer way like URLLib, asynchronously? I dont really feel like re-implementing the whole HTTP specification, when it's all been done before.

Twisted isnt an option currently.

Greetings,

Tom

Question 2

You can implement an asynchronous call yourself. For each call, start a new thread (or try to get one from a pool) and use a callback to process it.

You can do this very nicely with a decorator:

def threaded(callback=lambda *args, **kwargs: None, daemonic=False):"""Decorate  a function to run in its own thread and report the resultby calling callback with it."""def innerDecorator(func):def inner(*args, **kwargs):target = lambda: callback(func(*args, **kwargs))t = threading.Thread(target=target)t.setDaemon(daemonic)t.start()return innerreturn innerDecorator@threaded()
def get_webpage(url):data = urllib.urlopen(url).read()print data

Reading a website with asyncore

Related Q&A

How to select specific the cipher while sending request via python request module

Different classes made by type with the same name in Python?

Installing python server for emacs-jedi

Multi-feature causal CNN - Keras implementation

Adding a join to an SQL Alchemy expression that already has a select_from()

How should I move blobs from BlobStore over to Google Cloud Storage?

Python: Find `sys.argv` before the `sys` module is loaded

Dotted lines instead of a missing value in matplotlib

How to change the creation date of file using python on a mac?

Classification tree in sklearn giving inconsistent answers