[AWFFULL] no results some months
Steve McInerney
steve at stedee.id.au
Tue May 18 10:18:54 EST 2010
On Mon, 2010-05-17 at 14:43 -0400, Zack Galbreath wrote:
> The following log causes all of my April 2010 data to disappear. It's
> only three lines long.
>
> http://public.kitware.com/stats/awffull/test/bad.log
>
>
> I can't reproduce this bug if I apply these three lines one at a time,
> so I think you're right that it has something to do with the order
> that awffull is processing the log entries.
Oh excellent work! And yes, my guess was right. the problem is that logs
aren't written in the order they arrive; but the order they finish; so
have a few slower pages; and time sorting goes out the window.
> Any suggestions on what I should do to overcome this issue?
In *this* case you have 2 options in the easy case; plus a third.
1. as the bad is an ipv6, we can simply apply an external prefilter (eg
egrep) to strip all ipv6 addresses.
2. do a time sort on the logs in the first place. This is easy to do;
but *can* be painfully slow.
http://www.mrunix.net/webalizer/faq.html has a great example:
$ sort -t ' ' -k 4.9,4.12n -k 4.5,4.7M -k 4.2,4.3n -k 4.14,4.15n -k
4.17,4.18n -k 4.20,4.21n
as a full sort string
3. upgrade the version of awffull you use. I'm quite confidant that bug
was eliminated in later versions. Classic nasty edge case unf.
Cheers!
- Steve
More information about the AWFFull
mailing list