Skip to content
Waxy.org
About
Mastodon
Contact

Dirty, Fast, and Free Audio Transcription with YouTube

Posted January 24, 2014 by Andy Baio

Five years ago, I wrote about how I transcribe audio with Amazon’s Mechanical Turk, splitting interviews into small segments and distributing the work among dozens of anonymous people. It ended up as one of my most popular posts ever, continuing to draw traffic and comments every day.

Lately, I’ve been toying with a free, fast way to generate machine transcriptions: repurposing YouTube’s automatic captions feature.

How It Works

Every time you upload a video, YouTube tries to generate a caption file. If there’s audible text, you can grab a subtitle file within a few minutes of uploading the video.

But how’s the quality? Pretty mediocre! It’s about as good as you’d expect from a free machine-generated transcript. The caption files have no punctuation between sentences, speakers aren’t broken out separately, and errors are very common.

But if you’re transcribing interviews, it’s often easier to edit a flawed transcript than starting from scratch. And YouTube provides a solid interface for editing your transcript audio and getting the results in plaintext.

I used TunesToTube, a free service for uploading MP3s to YouTube, to upload the first 15 minutes of our New Disruptors interview, with permission from Glenn Fleishman.

It took about 30 seconds for TunesToTube to generate the 15-minute-long video, three seconds to upload it, and about a minute for the video to be viewable on my account.

It takes a bit more time for YouTube to generate the audio transcriptions. Testing in the middle of a weekday, it took about six minutes to transcribe a two-minute video, and around 30 minutes for the 15-minute video. Fortunately, there’s nothing you need to do while it processes. Just upload and wait.

I ran a number of familiar film monologues through the YouTube’s transcription engine, and the results vary from solid to laughably bad. I’ve posted the videos below with the automatic transcription and their actual text.

As you’d expect, it works best with clear enunciation and spoken word. Soft words over background music, like in the Breakfast Club clip, falls apart pretty quick. But some, like Independence Day, aren’t terrible.

Continue reading “Dirty, Fast, and Free Audio Transcription with YouTube” →

39 Comments

Ellen DeGeneres' "Walter Mitty" Screener Leaks Online

Posted January 9, 2014 by Andy Baio

It’s Oscar piracy season, that time of the year where screeners of newly-released critical darlings are leaked online as DVD and Blu-Ray screeners are sent out around the world to Academy voters and secretly loaned to their friends and relatives.

Yesterday was a busy day with screener copies of Frozen, Her, and The Wolf of Wall Street all appearing online.

Today, I got a tip that there was a very unusual watermark in the screener of The Secret Life of Walter Mitty that leaked online today. I dug it up, and sure enough, a very familiar name pops up in the first scene of the screener. I made a GIF of it:

Oh, Ellen! If the watermark’s accurate, this screener belonged to Ellen DeGeneres. But was it actually an Oscar screener? Probably not.

The watermark shows that the screener was created on November 26, 2013. According to Ken Rudolph’s Academy screener list, he received the Walter Mitty DVD screener via UPS on December 19.

That’s a pretty huge gap, indicating that Ellen’s screener wasn’t for Oscar consideration, but instead given to her for review in advance of Ben Stiller’s December 4 appearance on her show.

Of course, there’s a chance, albeit small, that this watermark was added by someone besides 20th Century Fox — by someone trying to hide the identity of the actual source, maybe.

More likely, the watermark is accurate and Ellen’s screener simply ended up in the wrong hands. A postal worker, one of her employees, friend, family member, or countless others in the production and distribution chain could be responsible for ripping the DVD and putting it online.

It’s very common for screeners to leak, but rare for a celebrity’s name to be identified as the source. In 2011, a screener copy of Super 8 leaked online with Howard Stern’s name clearly watermarked on it. Stern vehemently denied leaking the film on air.

Curious to see if Ellen responds the same way.

As usual, I’ll update my spreadsheet of Oscar screener piracy statistics as soon as the nominees are announced on the morning of January 16.

7 Comments

GoldieBlox and the Three MCs

Posted November 26, 2013 by Andy Baio

Update: GoldieBlox removed the original video and posted a public apology. See below for updates.

Everyone thinks they know how copyright works, and everyone’s usually wrong. Who can blame them? It’s often counterintuitive, inconsistent, and riddled with grey areas and edge cases.

And no area of copyright law is more confusing than fair use, deliberately designed to be judged in court on a case-by-case basis without any “bright line” tests to guide the way.

The test for fair use is a balancing act of four factors, but how they’re weighed is often subjective, determined by a judge. Different judges rule differently on similar fair use cases, and circuit courts commonly reverse fair use rulings from district courts on appeal.

If even judges can’t agree on fair use, what chance do the rest of us have of understanding it?

In fair use, there’s no silver bullet and exceptions are the norm. Some parodies are fair use, others aren’t. Commercial use can weigh against a fair use ruling, but there are many notable commercial exceptions. Using a substantial amount of the original artwork can hurt your case, other times it doesn’t matter. Damaging the market value of an original artwork can hurt your claim or, as with parodies, it may not matter at all.

So, how does that play out in GoldieBlox v. Beastie Boys?

It’s entirely possible that the GoldieBlox video is simultaneously:

  • A parody
  • An advertisement
  • A derivative of the Beastie Boys’ copyrighted work
  • A violation of MCA’s dying wishes
  • And, yet, perfectly legal under the fair use doctrine.

Only a judge can decide whether GoldieBlox’s parody is fair use. And, until they do and all the appeals are closed, none of us will know.

In the meantime, let’s bust some myths!

Disclaimer: Hey, I’m not a lawyer either. But I’ve been writing about copyright here for over ten years and dealt with several copyright disputes myself, including my tangle with fair use from Kind of Bloop. I’m going to try to avoid any conjecture here, and stick to actual case law. If I miss something, please let me know.

Myth: The Beastie Boys sued GoldieBlox.

The Beastie Boys were quick to debunk this one themselves in their open letter. “When we tried to simply ask how and why our song ‘Girls’ had been used in your ad without our permission,” they wrote, “YOU sued US.”

But GoldieBlox filed a very particular type of lawsuit, a declaratory judgement. Unlike typical lawsuits, GoldieBlox isn’t seeking damages. They’re asking the court to issue an opinion without ordering Beastie Boys to do anything in particular or pay damages, beyond possibly their own legal expenses.

This appears confusingly aggressive, but it’s a common tactic when threatened with a copyright lawsuit. If it works, the court’s clarification can save the time and money spent fighting an expensive trial. You may remember Robin Thicke reluctantly suing Marvin Gaye’s family, when they threatened to take him to court over “Blurred Lines.” Same deal.

Update: Yesterday, on December 10, the Beastie Boys filed a countersuit. So now they actually are suing GoldieBlox.

Myth: It’s an advertisement, so it’s not fair use.

More than any other, I’ve seen this myth repeated everywhere. Can a company parody a famous artist’s work and use it, against their will, to advertise an unrelated product? Actually, yes, as long as the use is transformative enough.

The most famous case is the Naked Gun advertisement below, a parody of photographer Annie Leibovitz’s famous portrait of Demi Moore for Vanity Fair.

If you care about this sort of thing, the District Court’s decision is a fantastic, and surprisingly readable, breakdown of the history of parody and fair use.

In her decision, Judge Preska noted that the landmark 2 Live Crew case, settled by the Supreme Court only two years earlier, set a new precedent for deciding fair use cases.

In that case, the Supreme Court ruled that commercial use does not preclude a finding of fair use, so long as the work is “transformative” — does it add value to the original material and use it for a different purpose, such as criticism or parody?

Delivering the opinion of the Supreme Court, Justice Souter wrote, “The goal of copyright, to promote science and the arts, is generally furthered by the creation of transformative works… The more transformative the new work, the less will be the significance of other factors, like commercialism, that may weigh against a finding of fair use.”

Later in the ruling, Justice Souter specifically addressed parodies in advertising. He wrote, “The use, for example, of a copyrighted work to advertise a product, even in a parody, will be entitled to less indulgence under the first factor of the fair use enquiry, than the sale of a parody for its own sake.”

In the Naked Gun case, armed with this new precedent, the District Court decided in Paramount Pictures’ favor:

“I can only reconcile these disparate elements by returning to the core purpose of copyright: to foster the creation and dissemination of the greatest number of creative works. The end result of the Nielsen ad parodying the Moore photograph is that the public now has before it two works, vastly different in appeal and nature, where before there was only one.”

Annie Leibovitz appealed, but the 2nd Circuit Court affirmed the decision, saying, “On balance, the strong parodic nature of the ad tips the first factor significantly toward fair use, even after making some discount for the fact that it promotes a commercial product.”

So, in the GoldieBlox case, the court will decide whether the parody’s criticism of “Girls” sexist lyrics outweigh its commercial nature. The EFF believes they will, and given the existing precedent, they may be right.

Myth: GoldieBlox stole from the Beastie Boys.

First off, infringement is not theft. These are two completely different terms with different meanings. If GoldieBlox stole something, the Beastie Boys wouldn’t have it anymore.

Second, it’s worth noting that GoldieBlox didn’t sample from the original song. (If they had, this would be a very different lawsuit.) Their parody was recorded with new instrumentation, vocals, and lyrics.

GoldieBlox used the composition to create a derivative work. Because it was unlicensed and created without permission, that new work may infringe the Beastie Boys’ copyright. This lawsuit will determine whether it’s infringement or fair use.

But however you look at it, it’s not stealing.

Myth: The Beastie Boys always have a right to decide how their music is used.

Usually, but not always! The Copyright Act grants broad exclusive rights to musicians to control the reproduction, performance, and distribution of their work for an absurdly long time—70 years after their death.

But there are a number of exceptions. Musicians can’t, for example, stop the secondhand sale of their albums or stop people from covering their songs.

Similarly, fair use is an exception to those exclusive rights. If someone can defend their use of a song in court, and the court rules it a fair use, then that use is legal and outside the artist’s control.

Myth: Adam Yauch’s will forbids using his songs in advertising, so it’s illegal.

In his last will, MCA stated that “in no event may my image or name or any music or any artistic property created by me be used for advertising purposes.”

By ignoring the last wishes of one of hip-hop’s greatest musicians, less than two years after his death, there’s a strong argument to be made that what GoldieBlox is doing is unethical. To me, it feels crass and insensitive.

But is it illegal? Not if the court finds the parody to be fair use.

This isn’t a moral judgement, and this isn’t copyright activism. This is the law, as it exists right now.

Myth: If this is legal, then any company can parody songs in ads for free.

The crux of this case is whether the GoldieBlox parody is transformative. The parody video’s new lyrics criticize the misogynistic lyrics of the original Beastie Boys song. If it didn’t, there wouldn’t be a case.

Any other parody in advertising that doesn’t transform the original will still need permission and pay licensing fees. Snuggie will still have to pay for their version of the Macarena because it doesn’t comment or criticize the original in any way.

It’s worth noting that this isn’t the first time GoldieBlox used a song in an ad. This earlier ad from July rewrote some of the chorus to Queen’s “We Are the Champions”, but left most of it intact. I’d wager they never licensed this music either, and wouldn’t really have a defense if EMI came knocking.

Undetermined: The Beastie Boys were just asking questions and GoldieBlox sued them.

Neither party has released the initial complaint letter from the Beastie Boys, so we don’t know who sent the letters, the tone of the questions or what, if anything, they were demanding.

We do know that GoldieBlox claims in their lawsuit that they were contacted by “lawyers for the Beastie Boys” and the letter claimed that the video is “a copyright infringement, is not a fair use, and that GoldieBlox’s unauthorized use of the Beastie Boys intellectual property is a ‘big problem’ that has a ‘very significant impact.'”

It’s possible that GoldieBlox’s legal team is lying in a court filing, but it seems unlikely. More likely, the truth is somewhere in the middle. The law firm representing the Beastie Boys contacted GoldieBlox, asking for details and pushing them to delete the video. GoldieBlox felt they were in the right, and filed the request for declaratory judgment to find out.

I hope either party releases the original correspondence, it should be interesting.

Undetermined: This is all a publicity stunt.

It could be. GoldieBlox founder and CEO Debra Sterling, despite her Stanford engineering background, spent seven years as a brand strategist and marketing director before starting GoldieBlox. She definitely knows how to get publicity for her projects.

But there are certainly more affordable, less risky ways to gain publicity than filing a lawsuit. If they felt it wasn’t a serious threat, they could have simply gone public with the legal threat, posting the correspondence and writing a blog post.

But there’s no question this lawsuit has raised the profile of GoldieBlox, for better or worse.

The More You Know

So, who knows? This could go either way, and should be a fascinating case to watch. I’m in favor of more case law in either direction, helping draw the lines for what artists can or can’t do. It can be agonizing to make something that skirts the grey areas of copyright law without knowing whether you’re going to end up bankrupt.

Want to learn more?

Both the 2 Live Crew and Annie Leibovitz rulings are surprisingly readable explanations of how copyright and fair use are interpreted by the courts.

On the Media’s PJ Vogt published a great interview with Julie Ahrens, the director of Copyright & Fair Use at Stanford’s Center for Internet & Society. The EFF’s legal analysis is interesting, but I think they downplay the advertising issue too much. Rachel Sklar does her own fair use analysis.

On the other side of the spectrum, Felix Salmon blames Silicon Valley’s cult of disruption for GoldieBlox’s behavior. And, hey, are the toys actually any good?

Updates

Update: Last night, on November 26, GoldieBlox marked the original video private and uploaded a new version with modified music and all Beastie Boys references removed. This morning, founder and CEO Debbie Sterling posted this public letter to the Beastie Boys.

December 11: Yesterday, the Beastie Boys filed a countersuit for copyright and trademark infringement. We may see a ruling after all.

March 18, 2014: GoldieBlox settled out of court with the Beastie Boys, agreeing to publicly apologize and pay a percentage of proceeds to STEM education for girls.

51 Comments

Listen to MEEE

Posted November 15, 2013 by Andy Baio

Ever wanted to hear me blather on about different subjects for hours? You’re in luck! I’ve been invited on a number of podcasts recently, and for some reason, never mentioned it here.

This is just as much for my own records, but hey, here are all the podcasts I’ve spoken on lately. I just fixed your boring commute for a week!

The Crapshoot, November 14, 2013

Jesse Holden and Metafilter’s Josh Millard invited me to come shoot the crap in his basement. On the ride there, I tried to pick up some Four Loko, but found something so much better—Joose, 22 ounces of 12% ABV sickly-sweet malt beverage that tastes like Jolly Rogers soaked in turpentine. We ended up chatting for nearly two hours about Kind of Bloop, XOXO, my crazy Sunset Strip childhood, and weird experiments I’ve run on my son. A good time was had by all.

Let’s Make Mistakes, October 23, 2013

After Mule Radio’s Let’s Make Mistakes spent three consecutive episodes discussing this year’s XOXO, cofounder Mike Monteiro asked me to come on the show. I talked about what we learned from mistakes made during our registration process, the challenges of running an inclusive event, and the value of criticism.

The New Disruptors, June 19, 2013

Leading up to XOXO, Glenn Fleishman asked Andy McMillan and me to come on the podcast that we inspired him to start the year before. We talked about the previous year and what we were planning to follow it up.

Quit!, June 19, 2013

Andy McMillan and I went on Dan Benjamin’s show about quitting your day job and working on things you love, and had a great conversation about XOXO and our respective projects.

If you got this far, you probably have a miserable commute or just really, really love podcasts. So, you’re probably already listening to 99% Invisible, This American Life, Bullseye, and Radiolab.

Here’s three more that I’ve enjoyed lately:

The New Disruptors is basically XOXO: The Podcast, a weekly interview by Glenn Fleishman with indie artists and makers using the network to make a living.

Welcome to Night Vale is like NPR in the Twilight Zone, the community radio station for a fictional desert town. I’m kind of surprised how many people I talk to that still have never heard of this.

How to Do Everything, in which listeners send Mike Danforth and Ian Chillag their questions about life and they find experts to answer them.

Fix Your Boring Slides

Posted November 14, 2013 by Andy Baio

After running XOXO for two years, and doing a bunch of talks myself at various conferences, I’ve seen my share of presentation slides from non-designers.

Way too often, they look something like this:

Inevitably, they use one of the two basic Keynote templates — black on white, or white on black.

I get it! You’re not a designer. You’re a writer, filmmaker, musician, whatever, and even learning how to make these slides was kind of a big deal for you. So you stick with the default templates.

The problem with any defaults is that they’re boring and overused, which distracts from your message. They suck any life out of your story, and fixing it is so easy.

So I made a little guide with six simple tips for XOXO speakers to improve their slides in a couple minutes. I thought you might find it useful.

1. Use the right resolution.

At XOXO, we project slides on a widescreen movie-like display (a 16:9 aspect ratio, if you want to get geeky about it). But many projectors only support the standard 4:3 ratio, the default for most presentation software like Keynote and Powerpoint and your old TV.

Ask your event organizer if slides will be projected in widescreen. If so, be sure to use the 1280×720 resolution when you start your presentation. Otherwise, you’ll end up with black bars on both sides of your slides.

If it’s not widescreen, stick with the default 1024×768 resolution.

2. Fill the screen.

One of the most agonizing, fixable things is seeing a photo that someone placed unresized in the middle of a slide, surrounded by a sea of whitespace. Take that photo and scale it to fill every edge of the slide, even if it cuts off the edges. It’ll look better, and easier to see from the back rows.

3. Move it up.

Don’t put anything important in the bottom third area of your slide. Depending on your venue, there’s a good chance the people in the back won’t be able to see it, blocked by the heads of everyone in front of them. But this shouldn’t be an issue, as long as you…

4. Make it big.

Never have more than four or five words on a single slide. Any more than that, and people will start reading them instead of listening to you. If it’s a longer passage or quote, you’re going to have to read it out loud anyway for the back rows, so you might as well leave it off the screen.

Oh, and make them huge. I don’t think I’ve used anything smaller than 96 point type in years. Best slides I’ve ever seen? Cabel Sasser’s at XOXO.

The full-bleed photos and big, bold centered type on colorful backgrounds frame him throughout the talk, and create a wash of color and imagery for him to talk over that you can see from the back of the room. And it photographs beautifully. Follow his lead.

Photo by Ian Linkletter

5. Add some color.

The Keynote default is white text on black, or black text on white. As a deliberate design choice, this can sometimes work. But most of the time, it’s pretty generic.

Consider using a wash of color behind your giant type. Not sure where to start? Go to Colourlovers and pick out a scheme you like, and stick to it.

Or put the type over a relevant, full-screen photo, as long as it’s not distracting from readability. Even subtle full-screen video can work, if done well. All you’re trying to do is provide some texture to your talk. The focus should be on you.

6. Change the type.

Gill Sans is a great typeface, but because it’s the Keynote default, it shows up everywhere and feels deadly boring. Avenir, Seravek, and Helvetica Neue Condensed Bold are safe bets and ship with current versions of OS X. If you know what you’re doing, drop some money on a good commercial typeface.

If you’re on Windows, there are free alternatives that work well. You could certainly do worse than Open Sans, Bebas Neue, Chunkfive, and so on.

Hope that helps!

10 Comments
⇠ Older Posts
Newer Posts ⇢
Waxy.org | About