If we look at source code of extension method Any
we will se, that it always uses enumerator:
public static bool Any<TSource>(this IEnumerable<TSource> source)
{
if (source == null)
throw Error.ArgumentNull(nameof (source));
using (IEnumerator<TSource> enumerator = source.GetEnumerator())
{
if (enumerator.MoveNext())
return true;
}
return false;
}
I think, is't it better (for perfomace) to check Count
property if collection is IList
like in SingleOrDefault
method for example:
public static TSource SingleOrDefault<TSource>(this IEnumerable<TSource> source)
{
if (source == null)
throw Error.ArgumentNull(nameof(source));
IList<TSource> sourceList = source as IList<TSource>;
if (sourceList != null)
{
switch (sourceList.Count)
{
case 0:
return default(TSource);
case 1:
return sourceList[0];
}
}
else
{
//...
}
throw Error.MoreThanOneElement();
}
I say, it can looks like this:
private static bool Any<TSource>(IEnumerable<TSource> source)
{
if (source == null)
throw new ArgumentNullException(nameof(source));
IList<TSource> sourceList = source as IList<TSource>;
if (sourceList != null)
{
return sourceList.Count != 0;
}
using (IEnumerator<TSource> enumerator = source.GetEnumerator())
{
if (enumerator.MoveNext())
return true;
}
return false;
}
I wrote a benchmark to test it:
namespace AnyTests
{
class Program
{
static void Main(string[] args)
{
BenchmarkRunner.Run<Test>();
}
}
public class Test
{
private readonly List<int> list1 = new List<int>(new[] { 1, 2, 3, 4, 5 });
private readonly IEnumerable<int> list2 = GetCollection();
private static IEnumerable<int> GetCollection()
{
yield return 1;
}
[Benchmark]
public void TestLinqAnyList()
{
Enumerable.Any(list1);
}
[Benchmark]
public void TestNewAnyList()
{
NewAny(list1);
}
[Benchmark]
public void TestLinqAnyEnumerable()
{
Enumerable.Any(list2);
}
[Benchmark]
public void TestNewAnyEnumerable()
{
NewAny(list2);
}
private static bool NewAny<TSource>(IEnumerable<TSource> source)
{
if (source == null)
throw new ArgumentNullException(nameof(source));
IList<TSource> sourceList = source as IList<TSource>;
if (sourceList != null)
{
return sourceList.Count != 0;
}
using (IEnumerator<TSource> enumerator = source.GetEnumerator())
{
if (enumerator.MoveNext())
return true;
}
return false;
}
}
}
Results show that it's about two times better:
// * Summary *
BenchmarkDotNet=v0.10.13, OS=Windows 10 Redstone 3 [1709, Fall Creators Update] (10.0.16299.192)
Intel Core i7-7700 CPU 3.60GHz (Kaby Lake), 1 CPU, 8 logical cores and 4 physical cores
Frequency=3515624 Hz, Resolution=284.4445 ns, Timer=TSC
[Host] : .NET Framework 4.7.1 (CLR 4.0.30319.42000), 32bit LegacyJIT-v4.7.2600.0
DefaultJob : .NET Framework 4.7.1 (CLR 4.0.30319.42000), 32bit LegacyJIT-v4.7.2600.0
Method | Mean | Error | StdDev |
---------------------- |---------:|----------:|----------:|
TestLinqAnyList | 26.80 ns | 0.1382 ns | 0.1154 ns |
TestNewAnyList | 12.75 ns | 0.0480 ns | 0.0426 ns |
TestLinqAnyEnumerable | 18.03 ns | 0.0947 ns | 0.0886 ns |
TestNewAnyEnumerable | 23.51 ns | 0.0913 ns | 0.0762 ns |
For IList
it's about twice better, for IEnumerable
it's about 20% worse.
So, the question: what the reason to use optimization in SingleOrDefault
method and do not use it in Any
one?