in class, we were assigned to parse through an access log and record all the successful attempts. In researching the Apache web server access log I ran into several instances, roughly 3/100,000 logs that the format was incorrect. The log would appear as, for example, 96.45.3.2 - - [14/Mar/2011:00:12:33 -0400] "GET webpage.html HTTP/1.1" 400 236 - -
I am not asking on how to parse through the file, I'm just curious as to what happens if the format isn't finished? Did the user's browser fail? Power outage? ect.
Also, when parsing through the file, I noticed that in one specific instance the index [8] (which is supposed to be the 200,300,400,500 codes) there was a mistake where the index[8] would be 1.1".
Any ideas?