Waxy.org
Waxy.org is the sandbox of Andy Baio, a writer and tech entrepreneur in Portland, OR. I work with Expert Labs, helped build Kickstarter, founded Upcoming, made an album, and other stuff too.

Contact Me: Email, AOL IM, or follow me on Twitter.

Sawmill Log Analyzer

Posted Jan 31, 2003

For my job, I recently researched and reviewed almost every major web log analyzer on the market. Almost every package provided the same basic level of detail, summaries of daily/monthly/weekly usage and aggregate statistics.

But we needed more detail. Much more detail. We wanted to track the paths of any user (authenticated or not) through the site, to see which pages they looked at, in which order, and for how long. We wanted to drill down to any time period, seeing who visited the site during that period and what they were looking for. Here's what I found...

Nearly all the commercial packages I reviewed had the same basic information as their popular open-source counterparts -- AWStats, Analog and Webalizer. The biggest differences were mostly skin-deep: ultra-fancy (but no more useful) 3-D graphs and pretty menus.

Take a look at the sample reports for WebTrends, Urchin (user/pass: english/english), HitBox Pro, 123LogAnalyzer, NetTracker, Wusage, Summary, and LiveStats.

As you can see, all of the above are fundamentally the same: static summary reports. Only two packages were able to provide the individual clickthroughs we need. The first, Funnel Web Analyzer, provides the bare minimum of clickthrough detail, doesn't run on Unix servers, and costs $1000.

The second choice was the best, by far: Flowerfire's Sawmill (see a sample report). It does everything we need, is extremely customizable, and also has the single-easiest Unix installation and configuration process I've ever seen. Custom log filters, dynamic filtering of data, static pre-generation of reports, and on and on. It's more powerful than any competitor by a factor of 10. It runs on Mac, Windows and Unix, and requires no additional software to be installed (including a web server or database).

Plus, it's cheap! Or free, if you help them test it for a couple hours. If you've written or know about a better log analyzer, I want to hear about it.

(Thanks to Leonard for originally mentioning it to me. Honorary mention goes to Clicktracks, which sports the most unique interface I've seen: a browser-like interface with statistics overlaid on the web page, perfect for usability testing.)

7 Comments (Add Yours)

Feb 2, 2003
6:53 AM  
Aaron wrote:

Have you ever heard of TeaLeaf?

It's not really a log analyzer, and it costs a lot more than Sawmill or probably any of the others that you looked at, but I was completely impressed when our company had it demoed.

It captures all the information going from the webserver to the client, and allows you to replay sessions and view exactly what the user saw. It indexes everything on a seperate server (more expense), and has some extensive searching/filtering capabilities.

If you are building large-scale web applications/e-commerce sites. I would think it to be an invaluable debugging tool.


Feb 2, 2003
1:10 PM  
Konstantinos wrote:

And while we're at it; the 'Sawmill Pricing' page says it costs "$99 for an individual, $399 for a small organization". A similar pricing scheme goes for many other apps too (e.g. WS-FTP).

I've always had that question: how does the creator know the customer doesn't lie to him?

For example, a representative of a small organization may appear and claim he's bying Sawmill for himself. That way the company is saving $300.

I'm sure I'm missing something here..

PS: "Students can purchase Sawmill at a 75% discount (i.e. for 25% of the organization price, or US$99.75)" while it costs "$99 for an individual". Isn't that a bit weird? (A student paying more than an individual? I guess all students will try to hide their identity and claim they're just individuals...)


Feb 2, 2003
2:21 PM  
Cameron wrote:

I downloaded and played with Clicktracks, and it really was pretty cool. But $500?!? I was thinking about buying a copy until I saw the price. Why are all log analyzing programs so effing expensive? I think I'll just stick with Sawmill.

By the way, great post Andy. Thanks for the research.


Feb 2, 2003
8:48 PM  
Andy wrote:

Konstantinos: I think the idea is that students can get a multi-user license for the price of a single-user license, if they want it. And to answer your other question, it's just the honor system. An ethical company will register the appropriate license, rather than risk getting caught by a disgruntled employee or an SPA audit.


Feb 3, 2003
10:51 AM  
leonard wrote:

Sawmill is great, the biggest weakness (besides the interface / need to rebuild db's over and over while setting up) is that it's single threaded. Which is a mighty pain in the butt when dealing with big logs. Where are the MPI/PVM traffic analyzers is what I want to know.

[If you need hard core analysis and you can afford to pay a magnitude or two more and you don't mind shipping your data out for processing, Omniture's SiteCatalyst has some very slick features and a polished interface.]


Feb 12, 2003
5:10 AM  
stavrosthewonderchicken wrote:

(I'm guessing) because I'm in Korea, the clicktacks site redirected me without asking to their Japanese site, with no way that I could find to get to English.

That was annoying. I'm just sayin'.


May 15, 2004
3:00 AM  
brenda wrote:

Ha, I'm a little late in replying, but I haven't found one yet that suits my needs, so my husband made a program that shows who's on my site at the moment and what all they've viewed, complete with IP and precisely what time in Portugal. He made another that show what are the most viewed archives. We recently switched from our own server to one in the states that comes with an archaic version of AW and it severly sucks.


 

Leave a comment





Waxy Links
Ads via The Deck
February 10, 2012
Chimp displays stunning visual memory skills — try it yourself, little monkey (via)
MG Siegler on VEVO employees pirating a football game — "Why would VEVO pirate content? Because it was easier than getting it legally." (via)
GQ on Terry Thompson and last year's exotic animal massacre in Ohio — Ohio's lax laws on personal zoos leads to tragic results (via)
Game developers react to Double Fine's $1M fundraising success — "did you hear? the death-rattle of a million middle men"
DataEast's Movie Opinion Meter — using the awesome dataset from Information Is Beautiful's Hollywood budgets design challenge
Kickstarter's craziest 24 hours — a minute-by-minute breakdown of the most insane day in Kickstarter history
February 9, 2012
Horse_ebookmarklet — turn any site into Horse_ebooks
The Puzzlejuice Emails — in-depth look at the evolution of the visual design of one of my favorite iOS games
February 8, 2012
Double Fine's Kickstarter project to make a new point-and-click adventure — best project video ever; I backed it so hard
Interactive ASCII fluid dynamics animation — based on this JS simulation (via)
What Popular iPhone/Android Apps Know/Transmit About You — ignore the awful visualization and skip to the table; Angry Birds sends your contacts to third parties!?
Path apologizes, deletes user address books — they never should've done it in the first place, but this is the right way to handle it
BBC tracks down an Internet troll — as the Daily Dot points out, he's more of a racist asshole than a troll (via)
February 7, 2012
PressPausePlay — stylish documentary on the digital media revolution of the last decade
February 6, 2012
Restored Disneyland footage from 1957 — only open for two years in this video
Robot readable world — found footage from machine-vision tests
February 3, 2012
Avería, the average font — preview them all (via)
February 2, 2012
How and why Mark Jaquith became an atheist — gripping personal story of the life-affirming shift from faith to evidence (via)
Where's the Pixel? — find and click on the black pixel; you may need to clean your screen first (via)
ARTINFO on the chilling effect of the Prince v. Cariou copyright ruling — the journalist mentions me and Kind of Bloop
Darkness — a brilliant 24-hour comic by French cartoonist Boulet (via)
January 31, 2012
Nano quadrotors flying in formation — don't miss the figure 8 pattern at the end (via)
Bootstrap 2 released — here's the announcement
Jeff Atwood on the risks of unmoderated communities — left to their own devices, popular online communities get taken over by cheap, easy gags (via)
How and why J.D. Roth sold Get Rich Slowly — interesting tale of a founder selling his site, but unable to share the details for years
Yahoo lays off in-house Flickr support team — from what I hear, it was done with 10 minutes' notice to Flickr management
Mapstalgia — videogame maps drawn from memory
January 30, 2012
Shit Programmers Say — strikingly similar to Shit Rocks Say
Impressions of Corporate Logos by a 5-Year-Old — "a cheetah, a cheetah, a cheetah"
Bellbot — web app that beeps when you get new signups or sales

Andy Baio lives here. Some rights reserved, for your pleasure.