Question 1

Here is the code I ran:

import timeitprint timeit.Timer('''a = sorted(x)''', '''x = [(2, 'bla'), (4, 'boo'), (3, 4), (1, 2) , (0, 1), (4, 3), (2, 1) , (0, 0)]''').timeit(number = 1000)
print timeit.Timer('''a=x[:];a.sort()''', '''x = [(2, 'bla'), (4, 'boo'), (3, 4), (1, 2) , (0, 1), (4, 3), (2, 1) , (0, 0)]''').timeit(number = 1000)

and here are the results:

0.00259663215837
0.00207390190177

I would like to know why using .sort() is consistently faster than sorted() even though both are copying lists?

Note: I am running Python 2.7 on an 2.53Ghz i5 with Win7

Question 2

The difference you are looking at is miniscule, and completely goes away for longer lists. Simply adding * 1000 to the definition of x gives the following results on my machine:

2.74775004387
2.7489669323

My best guess for the reason that sorted() was slightly slower for you is that sorted() needs to use some generic code that can copy any iterable to a list, while copying the list directly can make the assumption that the source is also a list. The sorting code used by CPython is actually the same for list.sort() and sorted(), so that's not what is causing the difference.

Edit: The source code of the current development version of sorted() does the moral equivalent of

a = list(x)
a.sort()

and indeed, using this code instead of your second version eliminates any significant speed differences for any list sizes.

Why is Pythons sorted() slower than copy, then .sort()

Related Q&A

How to efficiently unroll a matrix by value with numpy?

Anaconda Python 3.6 -- pythonw and python supposed to be equivalent?

Good way of handling NoneType objects when printing in Python

problems with easy_install pycrypto

What is the most efficient way to do a sorted reduce in PySpark?

Interactive figure with OO Matplotlib

nose2 vs py.test with isolated processes

ValueError: Attempt to reuse RNNCell with a different variable scope than its first use

Convex Hull and SciPy

Flask Confirm Action