Time complexity analysis of two algorithms contradicts empirical results

Question

I wrote the following simple function that checks whether str1 is a permutation of str2:

def is_perm(str1, str2):
    return True if sorted(str1)==sorted(str2) else False

Assuming that sorted(str) has a time complexity of O(n*logn), we can expect a time complexity of O(2*n*logn)=O(n*logn). The following function is an attempt to achieve a better time complexity:


def is_perm2(str1, str2):
    dict1 = {}
    dict2 = {}
    
    for char in str1:
        if char in dict1:
            dict1[char] += 1 
        else:
            dict1[char] = 1
    for char in str2:
        if char in dict2:
            dict2[char] += 1         
        else:
            dict2[char] = 1
        
    if dict1==dict2:
        return True
    else:
        return False

Each for-loop iterates n times. Assuming that dictionary lookup and both dictionary updates have constant time complexity, I expect an overall complexity of O(2n)=O(n). However, timeit measurements show the following, contradicting results. Why is is_perm2 slower than is_perm after 1000000 executions even though it's time complexity looks better? Are my assumptions wrong?

import timeit

print(timeit.timeit('is_perm("helloworld","worldhello")', 'from __main__ import is_perm', number=10000000))
print(timeit.timeit('is_perm2("helloworld","worldhello")', 'from __main__ import is_perm2', number=10000000))

# output of first print-call: 12.4199592999993934 seconds
# output of second print-call: 37.13826630001131 seconds

You won't see the effects of the difference in complexity until you **increase the length of the strings substantially**. Note that the 10000000 repetitions are unrelated to the complexity. The time should always increase linearly with the number of repetitions, regardless of the algorithm being measured. To put it another way, the repetition count is not in any way related to `n`. In your code, `n` is the length of the string. — user3386109, Aug 20 '21 at 04:36
@user3386109 of course, how stupid of me. this seems to be right answer. — Moritz Wolff, Aug 20 '21 at 07:54

score 1 · Answer 1 · answered Aug 20 '21 at 08:37

There is no guarantee that an algorithm with a time complexity of O(nlogn) will be slower than one with a time complexity of O(n) for a given input. The second one could for instance have a large constant overhead, making it slower for input sizes that are below 100000 (for instance).

In your test the input size is 10 ("helloworld"), which doesn't tell us much. Repeating that test doesn't make a difference, even if repeated 10000000 times. The repetition only gives a more precise estimate of the average time spent on that particular input.

You would need to feed the algorithm with increasingly large inputs. If memory allows, that would eventually bring us to an input size for which the O(nlogn) algorithm takes more time than the O(n) algorithm.

In this case, I found that the input size had to be really large in comparison with available memory, and I only barely managed to find a case where the difference showed:

import random
import string
import timeit

def shuffled_string(str):
    lst = list(str)
    random.shuffle(lst)
    return "".join(lst)

def random_string(size):
    return "".join(random.choices(string.printable, k=size))

str1 = random_string(10000000)
str2 = shuffled_string(str1)
print("start")
print(timeit.timeit(lambda: is_perm(str1, str2), number=5))
print(timeit.timeit(lambda: is_perm2(str1, str2), number=5))

After the initial set up of the strings (which each have a size of 10 million characters), the output on repl.it was:

54.72847577700304
51.07616817899543

The reason why the input has to be so large to see this happen, is that sorted is doing all the hard work in lower-level, compiled code (often C), while the second solution does all the looping and character reading in Python code (often interpreted). It is clear that the overhead of the second solution is huge in comparison with the first.

Improving the second solution

Although not your question, we could improve the implementation of the second algorithm, by relying on another built-in function: Counter:

def is_perm3(str1, str2):
    return Counter(str1) == Counter(str2)

With the same test set up as above, the timing for this implementation on repl.it is:

24.917681352002546

score 0 · Answer 2 · answered Aug 20 '21 at 02:36

0

Assuming that dictionary lookup and both dictionary updates have constant time complexity,

python dictionary is hashmap. So exactly, dictionary lookup and update costs O(n) in worst case. Total average time complexity of is_perm2 is O(n) but worse case time complexity is O(n^2).

If you want get exactly O(n) time complexity, please use List(not Dictionary) to store frequency of characters. You can easily convert each character to ascii numbers and store their frequency to python list.

answered Aug 20 '21 at 02:36

Code Plus

150
1
12

1

That seems completely the wrong way round. TIme complexity of dictionary is O(1), and of List is O(n). – see sharper Aug 20 '21 at 02:42
I mean python arrays. – Code Plus Aug 20 '21 at 02:49
Please visit [here](https://www.geeksforgeeks.org/check-if-two-strings-are-permutation-of-each-other) and see method 2 or 3 for complete solution. – Code Plus Aug 20 '21 at 02:55
I think I see what you're suggesting - an array with items for each ASCII code point. Yes? – see sharper Aug 20 '21 at 03:05
Yes, Exactly an array of frequences for each ascii code. – Code Plus Aug 20 '21 at 03:13
@CodePlus I considered the worst case to be of no practical significance. As the second comment to my original question shows, the reason for the longer execution time is not related to the algorithm per se, but to the fact that n is the number of the input strings and not the number of repetitions. – Moritz Wolff Aug 20 '21 at 08:00

Time complexity analysis of two algorithms contradicts empirical results

2 Answers2

Improving the second solution