Kadane's algorithm to find subarray with the maximum sum

Question

I have the following implementation of Kadane's algorithm to solve the problem of the maximum subarray of an array:

public static decimal FindBestSubsequence
    (this IEnumerable<decimal> source, out int startIndex, out int endIndex)
{
    decimal result = decimal.MinValue;
    decimal sum = 0;
    int tempStart = 0;

    List<decimal> tempList = new List<decimal>(source);

    startIndex = 0;
    endIndex = 0;

    for (int index = 0; index < tempList.Count; index++)
    {
        sum += tempList[index];
        if ((sum > result) || 
            (sum == result && (endIndex - startIndex) < (index - tempStart)))
        {
            result = sum;
            startIndex = tempStart;
            endIndex = index;
        }
        else if (sum < 0)
        {
            sum = 0;
            tempStart = index + 1;
        }
    }

    return result;
}

It fails when I use a sequence that starts with a negative number like -1, 2, 3 giving a result of 4, [0,2] instead of 5, [1,2].

For the life of me that I cannot find where the error is. Maybe its a defect on the algorithm design?

Thanks in advance.

Gene Belitski · Accepted Answer · 2012-03-28T04:18:59.053

Your initial implementation suffered from unnecessarily complicated and partially wrong checks within the main scan cycle. These checks are two:

if greater intermediate sum is found, store it constituents as a temporary result;
independently, if sum got negative, reset it to 0 and prepare to build a new sequence from next scan position.

Refactored FindBestSubsequence method implementation is listed below:

public static decimal FindBestSubsequence (this IEnumerable<decimal> source, out int startIndex, out int endIndex)
{
    decimal result = decimal.MinValue;
    decimal sum = 0;
    int tempStart = 0;

    List<decimal> tempList = new List<decimal>(source);

    startIndex = 0;
    endIndex = 0;

    for (int index = 0; index < tempList.Count; index++)
    {
        sum += tempList[index];
        if (sum > result)
        {
            result = sum;
            startIndex = tempStart;
            endIndex = index;
        }
        if (sum < 0)
        {
            sum = 0;
            tempStart = index + 1;
        }
    }

    return result;
}

Now not only for -1,2,3 the code above produces correct answer 5,[1,2] but also it correctly processes arrays of all negative numbers without any extra code: entering -10,-2,-3 will return -2,[1,1].

Perfect. I just took an already existent implementation in C that seemed standard and ported it to C#. Yours pass all my unit tests so I think its the best option. Thanks! — Ignacio Soler Garcia, Mar 28 '12 at 08:13
Additionally, if you are refactoring it, I would iterate the IEnumerable directly, there is no need to create a copy of the list. And passing multiple 'out' arguments is usually a bad practice, a custom return type would be better. — vgru, Mar 28 '12 at 16:41
Agree with the copy of the list. Don't agree with creating a new return type as in this case seems pretty obvious the usage of start index and end index. — Ignacio Soler Garcia, Mar 28 '12 at 19:44

score 3 · Answer 2 · edited Apr 03 '14 at 03:22

3

In your example you always have sum > result even if sum<0 in the first iteration of The loop because 0 > decimal.MinValue.

So you never go to your second case.-

You need to change the first if by adding a condition sum > 0:

if ((sum >0 ) & ((sum > result) || 
    (sum == result && (endIndex - startIndex) < (index - tempStart))))
{
    ...
}
else if (sum < 0)
{
    ...
}

Update:

As explained in my comment you can just change the initialisation of result to 0 :

decimal result = 0;

From wikipedia :

This subarray is either empty (in which case its sum is zero) or consists of one more element than the maximum subarray ending at the previous position

Therefore if the array contains only negative numbers the solution is an empty subarray with sum 0.

edited Apr 03 '14 at 03:22

abatishchev

98,240
88
296
433

answered Mar 27 '12 at 13:22

Ricky Bobby

7,490
7
46
63

If I do this change then the algorithm fails with sequences with all the values negative. – Ignacio Soler Garcia Mar 27 '12 at 13:25
1

You can add a case for this situation, and return 0 with an empty list, or if you don't want to return 0 return the max of the list. – Ricky Bobby Mar 27 '12 at 13:26
I agree, but that means that the Kadane's algorithm is faulty? – Ignacio Soler Garcia Mar 27 '12 at 13:32
No, and I think that initializing result to 0 will give you the same output as adding the condition on the first if (my answer), and it would work. – Ricky Bobby Mar 27 '12 at 13:35
Well, no implementation I've found of the algorithm includes this conditions (nor the pseudocode of Kadane's). – Ignacio Soler Garcia Mar 27 '12 at 13:38
1

@SoMoS: Yeah, Ricky is right, unmodified Kadane's algoritm is simply not suitable for negative numbers, since it starts from zero each time. – vgru Mar 27 '12 at 14:00

score 1 · Answer 3 · answered Mar 27 '12 at 13:36

1

Change this line:

decimal result = decimal.MinValue;

to

decimal result = 0;

answered Mar 27 '12 at 13:36

vgru

49,838
16
120
201

Thanks makes the algorith return 0 when all the values are negative. With an input of -1, -2, -3 the best subarray is -1. – Ignacio Soler Garcia Mar 27 '12 at 13:39
@SoMoS: that's right, I just compared your code to the Wikipedia article you posted. This also means that their python example suffers from the same problem. – vgru Mar 27 '12 at 13:42
1

(wikipedia) Kadane's algorithm consists of a scan through the array values, computing at each position the maximum subarray ending at that position. This subarray is either empty (in which case its sum is zero) or consists of one more element than the maximum subarray ending at the previous position. – Ricky Bobby Mar 27 '12 at 13:44

score 0 · Answer 4 · edited May 23 '17 at 12:24

Built upon Gene Belitski's answer and comments:

    public static void Main()
    {
        var seq = new[] { -10M, -2M, -3M };
        var stuff = seq.FindBestSubsequence();

        Console.WriteLine(stuff.Item1 + " " + stuff.Item2 + " " + stuff.Item3);
        Console.ReadLine();
    }

    public static Tuple<decimal, long, long> FindBestSubsequence(this IEnumerable<decimal> source)
    {
        var result = new Tuple<decimal, long, long>(decimal.MinValue, -1L, -1L);

        if (source == null)
        {
            return result;
        }

        var sum = 0M;
        var tempStart = 0L;
        var index = 0L;

        foreach (var item in source)
        {
            sum += item;
            if (sum > result.Item1)
            {
                result = new Tuple<decimal, long, long>(sum, tempStart, index);
            }

            if (sum < 0)
            {
                sum = 0;
                tempStart = index + 1;
            }

            index++;
        }

        return result;
    }

score 0 · Answer 5 · answered Mar 27 '12 at 13:37

For each position you should take the maximum of the value there (from the original sequence) and your sum as you have written it. If the original number is bigger, then it's better to start summing 'from beginning', i.e. sum = max(sum+tempList[index],tempList[index]); Then you won't need the case for sum < 0 at all.

score 0 · Answer 6 · answered Mar 27 '12 at 16:12

At the end this is how I corrected the algorithm to handle all the scenarios, just in case it helps to someone:

    public static decimal FindBestSubsequence (this IEnumerable<decimal> source, out int startIndex, out int endIndex)
    {
        decimal result = decimal.MinValue;
        decimal sum = 0;
        int tempStart = 0;

        List<decimal> tempList = new List<decimal>(source);

        if (tempList.TrueForAll(v => v <= 0))
        {
            result = tempList.Max();
            startIndex = endIndex = tempList.IndexOf(result);
        }
        else
        {
            startIndex = 0;
            endIndex = 0;

            for (int index = 0; index < tempList.Count; index++)
            {
                sum += tempList[index];

                if (sum > 0 && sum > result || (sum == result && (endIndex - startIndex) < (index - tempStart)))
                {
                    result = sum;
                    startIndex = tempStart;
                    endIndex = index;
                }
                else if (sum < 0)
                {
                    sum = 0;
                    tempStart = index + 1;
                }
            }
        }

        return result;
    }

Thanks to Ricky Bobby and Groot to point me in the right direction. — Ignacio Soler Garcia, Mar 27 '12 at 16:12
The code above still allows for few important improvements, such as removal of unnecessary special case processing arrays of all negatives. You may check my implementation for refactored `FindBestSequence`. — Gene Belitski, Mar 28 '12 at 04:27

Kadane's algorithm to find subarray with the maximum sum

6 Answers6

Linked