Waxy.org
Waxy.org is the sandbox of Andy Baio, an independent journalist and programmer living in Portland, Oregon. I created Upcoming.org and some other stuff too.

Contact Me: log@waxy.org or waxpancake on AIM
« April 2008 | May 2008 Archives | June 2008 »

BBC Two's "The Net," Episodes 2 and 5 from 1994

Posted May 30, 2008 (Updated Jun 3, 2008)

Back in March, I posted the first episode of a BBC Two series called The Net from 1994. It's a great time capsule of how the media portrayed cyberculture in the early 1990s, very much like a TV version of early Wired Magazine.

Thanks again to Martin Brewer, who also contributed the Horizon show, here are two more episodes from the first season of The Net from 1994.

Continue reading (492 more words)...
8 comments

Star Wars Kid: The Data Dump

Posted May 21, 2008 (Updated Jun 14, 2008)

This Friday, I'll be speaking at the Webvisions conference in Portland about Internet memes, how they spread, and how their distribution's changed over time.

As part of that research, I've been digging into my original server logs from the Star Wars Kid debacle, five years after I played a major role in what some say is the biggest viral video of all-time.

Be warned, this is more detail than you'll ever want about the origins of the Star Wars Kid meme and how it spread. You don't care about this level of detail, but I'm writing this all down so that I never have to think about it again.

In addition, I've decided to release the first six months of server logs from the meme's spread into the public domain — with dates, times, IP addresses, user agents, and referer information. (Download it below.)

Early Origins

Like I mentioned in my original entry, the video was first released by Ghyslain's schoolmates to Kazaa on April 19, 2003 with the original filename "ghyslain_razaa.wmv." Within three days, it was being passed around in the offices of Raven Software in Madison, Wisconsin, where a game developer named Bryan Dube posted it on his personal website on April 22. Two days later, he created the first Star Wars Kid remix, adding lightsabers and sound effects in a new video titled "TheLastHope.avi."

On April 27, a mostly-NSFW online community called Sensible Erection linked to the video on Bryan's website. Later that evening, an SE user cross-posted it to a private file-sharing community I belong to with the new filename "star_wars_guy.wmv." It quickly became the most popular file on the site, which is where I found it the following day, April 28 at 7:52pm.

On April 29, I renamed it Star_Wars_Kid.wmv and posted it to my site at 4:49pm &mdash inadvertently giving the meme its permanent name. (Yes, I coined the term "Star Wars Kid." It's strange to think it would've been "Star Wars Guy" if I was any lazier.) An hour later, Scott Gowell becomes the first person to link to the video.

From there, for the first week, it spread quickly through news site, blogs and message boards, mostly oriented around technology, gaming, and movies. Throughout the life of the meme, most of the referers are blank, suggesting people were primarily sending the links by email or instant message.

The chart below shows the distinct top-level domains that appeared in the referral logs grouped by day.

It's worth noting that the majority of sites sent less than 10 referers in that first month, and 21% of domains referred only one user. (Note: The chart below is on a logarithmic scale for both axes.)

Mainstream News Coverage

Here's some of the highlights from the mainstream media coverage. The New York Times was the first major paper to report on it, almost a week after I tracked Ghyslain down, Jish and I interviewed him for the first time, and we started the fundraiser.

May 19, New York Times
May 19, Wired News
May 20, Public Radio International's "The World" (radio program)
May 20, Globe and Mail
May 20, National Post
May 23, The Mirror UK

Jun 6, LA Times

Jul 4, The Independent UK
Jul 12, The Age
Jul 23, Wired News
Jul 25, BBC News
Jul 31, NPR w/Tavis Smiley (radio interview)

Aug 21, USA Today, syndicated Associated Press article
Aug 25, NBC's Today Show (TV program)
Aug 26, MSNBC's Countdown (TV program)
Aug 28, USA Today
Aug 30, Seattle P-I

Sep 8, SF Chronicle
Sep 15, Variety
Sep 16, Globe and Mail

Nov 18, CBS Evening News

Statistics

Here's what the Star Wars Kid meme did to my overall traffic. At its peak, I received almost a million pageviews in a single day.

That includes all pageviews on my weblog entries. Isolating only the video downloads from my site, or later redirected to one of the mirrors, gives the following chart.

Download the Data

This file is a subset of the Apache server logs from April 10 to November 26, 2003. It contains every request for my homepage, the original video, the remix video, the mirror redirector script, the donations spreadsheet, and the seven blog entries I made related to Star Wars Kid. I included a couple weeks of activity before I posted the videos so you can determine the baseline traffic I normally received to my homepage.

The file is 158 megabytes &mdash 1.6GB uncompressed — so I'm distributing it with BitTorrent. The data is public domain. If you use it for anything, please drop me a note!

Download: star_wars_kid_logs.zip.torrent

16 comments

Garry Kasparov Griefed by Flying Penis

Posted May 19, 2008

In a bizarre example of Second Life leaking into the real world, a political assembly on Saturday led by chess grandmaster Garry Kasparov was disrupted by a flying penis.

Kasparov is a leader of the Other Russia movement, a loose coalition of activists opposing Vladamir Putin and the current Russian government. Over 700 people showed up for the event in central Moscow, but Kasparov's speech was interrupted when a large phallus-shaped helicopter started buzzing around the room. The Moscow Times attributed the prank to "a couple of pro-Kremlin Young Russia activists."

Warning: Mildly NSFW images and video follow.

Continue reading (183 more words)...
21 comments

The Whitburn Project: One-Hit Wonders and Pop Longevity

Posted May 16, 2008 (Updated Jul 14, 2008)

How has the record industry changed in the last 50 years? Using the Whitburn Project spreadsheet I talked about yesterday, I've been trying to dig into some of the underlying trends. Today, I'll be tackling the longevity and diversity of pop songs, and a look at which decades had one-hit wonders.

Longevity of a Pop Song

One of the trickier questions I've been trying to visualize is how long pop songs are staying on the charts relative to the past. Are they staying on the charts longer than in the past?

In the chart below, I plotted the total number of weeks charted for all 23,924 songs that appeared on the Billboard Hot 100 from 1957 to earlier this year. (In other words, a little dot on the "60" line means there was a song released that week that stayed on the Hot 100 chart for 60 weeks.)

See the heavy dropoff on the 20th week starting in 1991? In an attempt to increase diversity and promote newer artists and songs, Billboard changed their methodology, removing tracks that had been on the Hot 100 for twenty consecutive weeks and slipped below the 50th position. These songs, called "recurrents," were then moved to their own chart in 1991, the Hot 100 Recurrent.

Unfortunately, this shift makes it much harder to compare the last 15 years to the decades before it. In the chart below, I've isolated the effect by only showing songs that reached the top 50.

A couple interesting observations... Looking at the very bottom of the chart, you can see that in the last couple years, it's become very common for a single to appear in the Top 50 and fall out of the Hot 100 within four weeks. Prior to the mid-1990s, this almost never happened.

Also, songs are staying in the Top 50 for far longer than they used to. Unfortunately, I don't have any actual sales numbers to compare to, so it's hard to say if these 30-70 week singles are massive megahits eclipsing the #1 singles of the past, or if it's because the record industry is producing fewer hits than before.

Diversity

Did Billboard's methodology changes in 1991 make the charts more diverse, like they hoped? By looking at the total number of unique songs that have charted yearly, it's clear their changes did nothing to slow the decline.

According to Billboard, the late 1960s were the peak of musical diversity in popular music, with 743 different songs appearing on the 1966 Billboard Top 100. It's fallen consistently since, hitting an all-time low in 2002 with only 295 songs. Since then, it's improved only slightly, with 351 unique songs appearing on last year's Top 100.

One Hit Wonders

I've always thought the 1970s were the decade of the one-hit wonder, but now I have the data to see for sure.

In raw numbers, the 1960s had more one-hit wonders than any other decade, followed closely by the 1950s. But that's not entirely fair since, as we saw earlier, there were simply more unique songs on the 1960s charts. To find out the true numbers, we need to look at the number of one-hit wonders as a percentage of all songs in the Top 100.

This tells a totally different story. The 1960s, 1970s, and 1980s all had about the same ratio of one-hit wonders to hits by more established artists. The big surprise for me is that 1950s, 1990s, and 2000s really seem to be the eras where one-hit wonders dominated the charts.

Joshua Porter was wondering about the longest-charting one-hit wonders of all time. The longest-charting one-hit wonder to hit the #1 spot is Daniel Powter's "Bad Day" from 2006, which stayed on the charts for 32 weeks. The one-hit wonder that stayed at the #1 longest is Anton Karas' "The Third Man Theme" from 1950, which stayed in the #1 position for 11 weeks. Finally, the longest-charting one-hit wonder to appear anywhere in the Top 100 is Duncan Sheik's "Barely Breathing" from 1997, which peaked at #16 but stayed in the top 100 for 55 weeks.

Have any other questions about the data, or done any analysis yourself? I'd love to hear about it.

May 20: Don't miss Mike Frumin's chart of pop longevity,from 1998-2002.

May 21: Using the Whitburn data, Tom Whitwell generated a tag cloud showing the top 100 commonly-used words in song names. Dianne Warren should write a #1 hit called "Love my Baby Blue Heart: A Girl's Night Song."

July 14: Pedro did some additional analysis, including artists with multiple hits in the same week and one-week wonders.

29 comments

The Whitburn Project: 120 Years of Music Chart History

Posted May 15, 2008 (Updated May 17, 2008)

For the last ten years, obsessive record collectors in Usenet have been working on the Whitburn Project — a huge undertaking to preserve and share high-quality recordings of every popular song since the 1890s. To assist their efforts, they've created a spreadsheet of 37,000 songs and 112 columns of raw data, including each song's duration, beats-per-minute, songwriters, label, and week-by-week chart position. It's 25 megs of OCD, and it's awesome.

As far as I know, this is the first time the project and its data have ever been discussed outside of Usenet. Despite its illegality, they've created a wonderful resource and you can do some fun things with the data. For the next three days, I'm going to publish some analysis and insights gleaned from their work. Update: I published an entry about one-hit wonders and pop longevity.

Continue reading (731 more words)...
53 comments
« April 2008 | May 2008 Archives | June 2008 »
Waxy Links
Ads via The Deck
January 6, 2009
The Perils of Zero-Gravity Videography — Matt Harding discovers hard drive-based camcorder don't work in zero-gravity (via)
Screenshot: 4chan hacks MacRumorsLive during Apple keynote — the 4chan thread shows how they found the admin interface, password hashes, and finally cracked a user's password
January 5, 2009
xkcd's Guide to Converting to Metric — even Liberia and Myanmar are mostly metric, compared to the U.S.
Crowdsourcing an Ethical Dilemma — Dolores Labs uses Mechanical Turk to answer the Trolley Problem
January 3, 2009
Stamen's Mike Migurski on extreme programming vs. interaction design — the linked interview is great
January 2, 2009
Jason Scott on the closure of AOL's online communities — like physical evictions, there need to be laws protecting community data in the event of closure
JPG Magazine to stop publishing, turn off website — with only three days notice; here's the response from Derek and the JPG community
December 31, 2008
Wikipedia over DNS — loony hack serves summaries of Wikipedia articles; also available as JSON and JS
Leap year bug caused every 30GB Zune to crash at 2am this morning — as strange as the Android bug that ran every keystroke as root
Metafilter's exhaustive tour of the early origins of Adult Swim — the Cartoon Network breathed new life into old cartoons, while constantly trying to find the next big thing
December 30, 2008
Infochimps' massive scrape of Twitter's friend network — Twitter gave their blessing on sharing the 56-million records, which includes 10M tweets and 220k hashtags
The Lonely Island's We Like Sportz — the sequel to Just 2 Guyz
Niall Kennedy documents the undocumented Google Reader API — whoops, this was three years ago; here's an updated version
Sakurako Shimizu's Waveform Jewelry — the "I Do" wedding band and Atari chip ring are cute, too
Fimoculous' 30 Most Notable Blogs of 2008 — an incredibly well-researched list, with related recommendations for every entry
December 29, 2008
DJ Earworm's United State of Pop 2008 — mashing up the top 25 singles of the year into a single song and video
Twit 4 Dead, four Twitter bots fight zombies in real-time — watch their collected activity here
Facebook sentiment mining predicts presidential polls — like StateStats, Facebook Lexicon is tons of fun
Giganews reports Usenet upload growth since 2001 — note this doesn't reflect Usenet popularity, but most likely the rise of huge Blu-Ray and HD rips
December 28, 2008
List of Starbucks employee jargon — culled from the Starbucks Gossip blog
December 27, 2008
Rocketboom covers the history of the Lip Dub — the Know Your Meme series is consistently well-researched and fun to watch
Jennifer 8. Lee on the history of General Tso's Chicken — different cultures each localized their own versions of Chinese food around the world (via)
Top 20 freeware games released by Cactus, this year — is Jonatan the most prolific game developer alive?
December 26, 2008
Paul and Storm finish their 25 Days of Randy Newman — hosted on Bandcamp, and now with the solo piano track used in each song
AutoPager, infinite scrolling for Firefox — love the idea, but too clunky for everyday use (via)
December 24, 2008
Net Cafe archives, dot-com nostalgia TV show from 1996-2002 — Sergei Brin in 2000 at the newly-opened Metreon, Mondo 2000 and Boing Boing, awkward Webby broadcasts, and hundreds of dead dot-coms (via)
The Offworld's best indie and overlooked games of 2008 — also: Gamasutra's top 5 indie games
Left 4k Dead — lo-fi zombie shooter in 4k of Java (via)
NORAD's Santa Tracker on Twitter — they just passed through Kazakhstan; also tracking on Google Maps and in 3D on Google Earth
December 22, 2008
ScummVM adds support for 7th Guest — I didn't realize they expanded into non-Scumm engines last year, including the Sierra AGI games

Andy Baio lives here. Some rights reserved, for your pleasure.