Cost of len() function

Question

What is the cost of len() function for Python built-ins? (list/tuple/string/dictionary)

score 487 · Accepted Answer · edited Feb 27 '17 at 15:22

487

It's O(1) (constant time, not depending of actual length of the element - very fast) on every type you've mentioned, plus set and others such as array.array.

edited Feb 27 '17 at 15:22

kcpr

1,055
1
12
28

answered Jul 12 '09 at 04:40

Alex Martelli

854,459
170
1,222
1,395

27

Thanks for the helpful answer! Are there any native types for which this is not the case? – mvanveen Mar 16 '12 at 03:41
2

interesting that get length runtime is only mentioned for list here - https://wiki.python.org/moin/TimeComplexity [not mentioned for other types] – Chaitanya Bapat May 17 '21 at 00:07
But why is it `O(1)?` – Freddy Mcloughlan May 18 '22 at 12:52
2

len() is a very frequent operation, and making it O(1) is extremely easy from the viewpoint of implementation -- Python just keeps each collection's "number of items" (length) stored and updated as part of the collection data structure. – Alex Martelli May 18 '22 at 17:02
I assume its only O(1) because it was already calculated at time of creation and getting len(x) is just accessing that stored value – Kevin Jun 19 '22 at 02:32

score 163 · Answer 2 · edited Apr 26 '22 at 01:08

163

Calling len() on those data types is O(1) in CPython, the official and most common implementation of the Python language. Here's a link to a table that provides the algorithmic complexity of many different functions in CPython:

TimeComplexity Python Wiki Page

edited Apr 26 '22 at 01:08

answered Jul 12 '09 at 04:59

James Thompson

46,512
18
65
82

score 122 · Answer 3 · edited Jul 22 '15 at 10:15

122

All those objects keep track of their own length. The time to extract the length is small (O(1) in big-O notation) and mostly consists of [rough description, written in Python terms, not C terms]: look up "len" in a dictionary and dispatch it to the built_in len function which will look up the object's __len__ method and call that ... all it has to do is return self.length

edited Jul 22 '15 at 10:15

Wolf

9,679
7
62
108

answered Jul 12 '09 at 06:17

John Machin

81,303
11
141
189

1

why doesn't `length` show up in dictionary by `dir(list)` ? – ViFI Apr 26 '20 at 02:38
@ViFI Because it is just a example. The illustrated `list.lenght` variable is implemented in C, not Python. – Ekrem Dinçel Jun 16 '20 at 15:51

mechanical_meat · Answer 4 · 2013-01-21T17:20:35.970

The below measurements provide evidence that len() is O(1) for oft-used data structures.

A note regarding timeit: When the -s flag is used and two strings are passed to timeit the first string is executed only once and is not timed.

List:

$ python -m timeit -s "l = range(10);" "len(l)"
10000000 loops, best of 3: 0.0677 usec per loop

$ python -m timeit -s "l = range(1000000);" "len(l)"
10000000 loops, best of 3: 0.0688 usec per loop

Tuple:

$ python -m timeit -s "t = (1,)*10;" "len(t)"
10000000 loops, best of 3: 0.0712 usec per loop

$ python -m timeit -s "t = (1,)*1000000;" "len(t)"
10000000 loops, best of 3: 0.0699 usec per loop

String:

$ python -m timeit -s "s = '1'*10;" "len(s)"
10000000 loops, best of 3: 0.0713 usec per loop

$ python -m timeit -s "s = '1'*1000000;" "len(s)"
10000000 loops, best of 3: 0.0686 usec per loop

Dictionary (dictionary-comprehension available in 2.7+):

$ python -mtimeit -s"d = {i:j for i,j in enumerate(range(10))};" "len(d)"
10000000 loops, best of 3: 0.0711 usec per loop

$ python -mtimeit -s"d = {i:j for i,j in enumerate(range(1000000))};" "len(d)"
10000000 loops, best of 3: 0.0727 usec per loop

Array:

$ python -mtimeit -s"import array;a=array.array('i',range(10));" "len(a)"
10000000 loops, best of 3: 0.0682 usec per loop

$ python -mtimeit -s"import array;a=array.array('i',range(1000000));" "len(a)"
10000000 loops, best of 3: 0.0753 usec per loop

Set (set-comprehension available in 2.7+):

$ python -mtimeit -s"s = {i for i in range(10)};" "len(s)"
10000000 loops, best of 3: 0.0754 usec per loop

$ python -mtimeit -s"s = {i for i in range(1000000)};" "len(s)"
10000000 loops, best of 3: 0.0713 usec per loop

Deque:

$ python -mtimeit -s"from collections import deque;d=deque(range(10));" "len(d)"
100000000 loops, best of 3: 0.0163 usec per loop

$ python -mtimeit -s"from collections import deque;d=deque(range(1000000));" "len(d)"
100000000 loops, best of 3: 0.0163 usec per loop

This is not so good of a benchmark even though it shows what we already know. This is because range(10) and range(1000000) is not supposed to be O(1). — Unknown, Jul 12 '09 at 05:45
This is by far the best answer. You should just add a conclusion just in case someone doesn't realize the constant time. — santiagobasulto, Jan 21 '13 at 13:14
Thanks for the comment. I added a note about the O(1) complexity of `len()`, and also fixed the measurements to properly use the `-s` flag. — mechanical_meat, Jan 21 '13 at 17:21
It is important to note that saving the length into a variable could save a significant amount of computational time: `python -m timeit -s "l = range(10000);" "len(l); len(l); len(l)"` 223 nsec per loop `python -m timeit -s "l = range(100);" "len(l)"` 66.2 nsec per loop — Radostin Stoyanov, Jan 04 '20 at 19:27

score 45 · Answer 5 · answered Sep 02 '18 at 07:02

45

len is an O(1) because in your RAM, lists are stored as tables (series of contiguous addresses). To know when the table stops the computer needs two things : length and start point. That is why len() is a O(1), the computer stores the value, so it just needs to look it up.

answered Sep 02 '18 at 07:02

RAHUL KUMAR

1,123
11
9

I don't think this is true for python lists. They're linked lists, not arrays, so contiguous addresses are not guaranteed – bluppfisk Jul 05 '23 at 06:34
1

@bluppfisk You are totally wrong. Here are the python docs https://docs.python.org/3/faq/design.html#how-are-lists-implemented-in-cpython – airled Aug 05 '23 at 09:52