according to Wikipedia (http://en.wikipedia.org/wiki/Loop_tiling) and many other sources, loop tiling is a loop optimization technique which helps to take advantage of cache (locality of reference). The iteration space is divided into number of blocks and thus cache is better exploited.
From the link above, can somebody explain what difference it makes for 1D case (Overview section)? To my mind, the same number of cache misses will happen in both cases.