Waxy.org
Waxy.org is the sandbox of Andy Baio, a journalist/programmer living in Portland, Oregon. I'm the CTO of Kickstarter, created Upcoming.org, and some other stuff too.

Contact Me: log@waxy.org or waxpancake on AIM

Boing Boing Statistics

Posted Jan 21, 2005

Today is the fifth anniversary of Boing Boing's relaunch, the day they switched from a traditional webzine to uber-blog.

To commemorate the birthday, the gang released a complete dump of every Boing Boing entry for free download. I'm hosting the torrent on my tracker, and I pulled together some statistics. (Is anyone surprised?)

Try my new Boing Boing Statistics. Most notably, use the keyword tracker to search the popularity of keywords over time, broken down by author. This is outstanding for looking at trends, or the uniquely quirky obsessions of each author.

Let me know if you have any suggestions, or have found other uses for the data dump.

January 22, 2005: By request, here's a direct download of the 5-year archive.

13 Comments (Add Yours)

Jan 21, 2005
3:01 PM  
Jonathan wrote:

is there any way to use it to get Cory to blog more often? Xeni's taking over that place, and not for the better...


Jan 21, 2005
3:54 PM  
Philipp Lenssen wrote:

The "Zeitgeist" Flash is very interesting and implemented very smoothly. I saw "Terror" shows a peak at around 9/11, as one could expect. "Podcasting" must be a new trend as it's only been picked up this year.

My use of this data dump, so far -- for nostalgia's sake, I extracted interesting quotes on Google:
http://blog.outer-court.com/archive/2005-01-21-n42.html


Jan 21, 2005
3:58 PM  
John Dowdell wrote:

I was housecleaning last month and found the first eight issues of the paper "Boing Boing" fanzine Mark did in the 80s. Many of these were handmade, with glued-on artwork. I'm not sure what to do with this, though...?


Jan 21, 2005
4:02 PM  
Andy Baio wrote:

If it were me, I'd wrap them in plastic and pray over them nightly. (Like I do with my complete print run of Might.)


Jan 22, 2005
10:04 AM  
Bud Landry wrote:

I'm probably ignorant, but I ignore Bit Torrent feeds because I'm on dial-up, and I think I will be the weakest link in any such torrent, plus I think that it will both take longer, and it will hog my limited bandwith beyond the typical download time from a single server, due to my apparent promise to pass it on. It is all just too viral for me...

Anyone have a regular vanilla download?


Jan 22, 2005
11:36 AM  
Andy Baio wrote:

Bud: Here's a direct download of the file.


Jan 22, 2005
1:54 PM  
Angie wrote:

Thanks for sharing this! :)


Jan 24, 2005
1:53 PM  
dave bug wrote:

It's interesting, in a completely meaningless way, to see the strange mid-life spike in Cory posts that used the word "boing."


Jan 24, 2005
3:03 PM  
Adam wrote:

Interesting in a similarly meaningless way:


gender and posts using 'sex'
gender and posts using 'porn'


(Pointed out by a sociology-centric friend)


Jan 24, 2005
4:23 PM  
Steve wrote:

Oh man. Waxy, if you ever decide to get into copyright violation in a big way, that Might archive would be awfully tasty; I only ever picked up a few issues (and "Shiny Adidas Tracksuits").


Jan 26, 2005
5:11 PM  
Eric Rodenbeck wrote:

I'd love to see some exposure as to what some of the most frequent terms are - BBC is a good one, but as it stands you pretty much have to know what to search for.


Feb 4, 2005
2:32 AM  
Dee wrote:

Does the stats page run off the MT export format alone? If so - could you open source the page source and the stats generation code?
This could be fun to do for other MT based blogs as well.


Feb 4, 2005
7:02 AM  
Andy Baio wrote:

No, I actually imported the exported entries into a new MT blog. Then I simply queried the MovableType database in MySQL directly. There isn't much to it.


 

Leave a comment





Waxy Links
Ads via The Deck
November 20, 2009
Regretsy gets a book deal — the anonymous author turned out to be April Winchell, collector of audio oddities
Google Chrome OS Demo — a world without a local filesystem and apps; also, the Chrome UI concept video (via)
Patrick Moberg's Internet Vices — funny, Tumblr feels more like beer than wine to me
Charlotte Gainsbourg and Beck's "Heaven Can Wait" — Keith Schofield's surreal video and insane treatment were inspired by FFFFOUND and Reddit, but maybe too explicitly (via)
November 19, 2009
YouTube adds machine-translated automatic captions — starting with some partner channels, but auto-timing is available to everyone today
Microsoft tries to patent Edward Tufte's sparklines — they were recently added to Excel
Leonard Lin's Retweet Avatars for Greasemonkey — a subtle change, but a big improvement
Web-ops god John Allspaw leaves Flickr to join Etsy — he's the last of the original Ludicorp team to go (via)
November 18, 2009
Laptop Steering Wheel Desk — don't miss the product photos
Interview with Ralph Eggleston, Pixar's production designer on WALL-E — from last February, but new to me; I didn't know the Axiom had three passenger classes
NSFW: Animated pixel-art video for Flair's "Trucker's Delight" — warning: very offensive and sexist, but the attention to 16-bit detail by director Jérémie Perin is incredible
NY Observer on Anil Dash's new government 2.0 incubator project — Expert Labs debuted at Web 2.0 today, funded with a $500k grant from the MacArthur Foundation
November 17, 2009
Google's Dan Morrill explains how the Droid autofocus breaks every 24.5 days — this gets second-place for quirkiest Android bug (via)
Conan O'Brien and Andy Richter on Zach Galifianakis' Between Two Ferns — his style of comedy usually makes me uncomfortable, but this made me laugh
The Pirate Bay shuts down their tracker for good — they're switching to DHT instead
November 16, 2009
How Darren at Link Machine Go found Belle de Jour's identity five years ago — Brooke was part of the early UK blog scene
ICU64, real-time visualization of Commodore 64 memory — the developer also posted videos of Paradroid and Boulder Dash (via)
Russell Davies on pretending and "barely games" — his SAP prototype looks like great ambient fun (via)
NYT Magazine on the indie gaming movement — nothing new here, but good overview with a wonderful closing anecdote from Cactus
Tim O'Reilly on the pending War for the Web — "more than that, it's a war against the web as an interoperable platform"
November 14, 2009
Jason Scott rounds up Geocities' top 10 most popular MIDI files — along with a torrent with 51,000 MIDIs rescued by Archive Team
Matt Haughey on the discovery of his brain tumor, treatment, and the Internet's response — there were about 1,000 #mathowielove tweets in 24 hours
Belle de Jour reveals herself after six year of anonymity — only six people in the world knew, she only told her parents yesterday (via)
Paul F. Tompkins debates comedy ethics with Improv Everywhere's Charlie Todd — great discussion, and it's hard not to see where both are coming from (via)
November 13, 2009
Rogue Amoeba stops iPhone app development after App Store idiocy — I'm with Marco, the only fix is allowing external apps, but it's unlikely (via)
Numb3rs on IRC — "Luckily, I speak l33t."
Prank War 8: The Skydiving Prank — hard to say if life-threatening situations are funnier than public humiliation
301 Works, Internet Archive works to preserve URL shortener data — the shorteners will provide regular backups and hand over data on closure, though TinyURL's conspicuously missing
November 12, 2009
Quizipedia — simple game with trivia scraped from Wikipedia entries
Kill Screen, funding a new art magazine about videogames — sounds like the English analogue of Amusement I was hoping for

Andy Baio lives here. Some rights reserved, for your pleasure.