Streaming Media

Streaming Media on Facebook Streaming Media on Twitter Streaming Media on LinkedIn
 

Tutorial: Sound Removal, Auto-Speech Alignment, and More in Adobe Audition CC

Exploring 3 new key features in Audition CC, the newest version of Adobe's professional audio editing application: Sound Removal for eliminating hums and other variable-frequency unwanted noises, the Loudness Radar Meter for matching and adhering to broadcast volume standards, and Automatic Speech Alignment for ADR.
Bookmark/Share
Email
Print
Digg

Working With Automatic Speech Alignment

If you work with scripted projects that need to be delivered with no mistakes whatsoever in the dialogue, ADR, or Additional Dialogue Recording (or Replacement), is something that you're probably familiar with. Adobe has a tool that they introduced in the last version of the Creative Suite that carries over into Audition Creative Cloud called Automatic Speech Alignment.

Let’s say you have a set of clips, and one of them was badly recorded or has some unwanted noise, or one includes dialogue from an actor who appears in multiple clips, but his voice sounds substantially different in one of them To clean up the project, you need to match up the bad recording to a studio version, recorded later, that you want lip-synced to it. Auto Speech Alignment can help.

In the example shown in Figure 16 (below), created from files provided by Adobe for this tutorial, I've got two tracks. On top is the original audio with the video, and you can tell it's varied wildly. There are three different scenes--one outside of the Adobe offices in Seattle, one in a construction zone, and a third that looks like it's outdoors, perhaps near a fountain or a public area, with a lot of traffic and other noises in the background. And you can just look at the waveform in Figure 16 and see the differences from scene to scene. The one outside of the office building is pretty clean. There's a little bit of noise at the end. The one in the construction area has a lot of ambient sound. And then the last one has kind of an in-between range, particularly in the first half of the clip. There's some noise, but it's not quite as bad as the middle one.

Figure 16. Our original audio (top track) and studio re-record (below).

On the contrary, on the track below, we have their studio version, where they brought Durin (the speaker) in to re-record his dialogue. And he recorded a clean version in their nice quite studio so that they'd have a nice clean version of what he needed to say. Listen to the original, just so you get an idea of what we're dealing with, at the 9:00 mark of the clip below.

 

So there's a lot to deal with here. Fortunately, with Automatic Speech Alignment, all you have to do is select one and two that match. And you can see they recorded just the audio needed for each one of these clips individually. So all you've got to do is get these two clips selected and you've got the same words in each. Then right-click on either one of them, and go down to Automatic Speech Alignment (Figure 17, below).

Figure 17. Choosing Automatic Speech Alignment.

The Automatic Speech Alignment dialog opens (Figure 18, below). There, you're going to choose the reference clip, which is the clip it's going to look at to find out where the words and syllables should go. In this case, it's going to be the top one, the MVI file. And you're also going to choose a channel. Stereo could be used in a variety of ways. But in this case, it looks like they had an on-camera stereo mic. So it really doesn't matter which channel we choose. The channels in the original look about the same. You'd normally want to try to choose the cleaner one so it has an easier time of reading it. But we'll just leave it off. Their studio recording, of course, is in mono. So there's no selection for the channel there. And there are three different settings that you can choose for how it aligns. Each one of them may have different results, depending on what quality your audio is. For now, I'm just going to choose kind of the middle of the road safe one, Balanced Alignment and Stretching, and leave it at that.

Figure 18. Choosing settings in the Automatic Speech Alignment dialog.

When you’re done choosing settings, click OK. Audition is going to analyze it and create a new track with just that synchronized audio. And we're going to solo that. So we only hear the new audio. At the 10:46 mark in the clip below, you can hear how close we got it.

 

As you can hear, it started off with good intentions and kind of crashed and burned at the end. But let's give it another chance. Let's choose the next two clips (the construction site clip and the corresponding studio clip). Right-click, choose Automatic Speech Alignment. Then choose our source. Again, the channel doesn't really matter because they're both pretty awful. And this one's so bad I'm going to choose reference clip as noisy (Figure 19, below), which I assume will help it filter that out all that noise a little bit better.

Figure 19. Choosing settings for matching the second clip.

Durin is really hard to hear in this clip. So let's just hear what it does. Again, it's going to align it and create a new track. In Figure 20 (below), you can already see that it's improved because the waveforms are now in about the same place. And it appears to be a little off at the end, just visually looking at it.

Figure 20. The second clip, well aligned now.

But let's hear what it sounds like by listening to the clip below at the 11:53 mark in the clip below.

 

That one was just about spot on. We’ll give it one more shot on this last pair. You know the drill: Right-click. Auto Speech Alignment. It's already choosing the right track. Again, left and right are about the same so we'll leave that. Let's leave this reference clip as noisy, because I heard a diesel idling in the background. Solo the new track, back it up and hear what it sounds like at the 12:41 mark in the clip just above this paragraph.

Wow. I have no idea what happened at the end there. It was right on until the very end. But anyway, you get the idea. Your mileage may vary on this. You have to try various settings. As I showed you, there are three different ways to allow it to automatically align. So one of these might have worked better for one of these pairs of clips. It's all going to vary, depending on your source audio, the quality of it, what kind of noises are in the background, and how consistent it is.

A lot of the renders I did in the video tutorial that accompanies this article were done in real time. Some of them I skipped just for sake of time. But for the most part, they were all done right in front of you on the screen. And I was also in the background recording, in a separate application, my audio and recording the screen capture in Camtasia, which was generously provided by TechSmith. So all of that was eating up some of the RAM, which is why I skipped ahead a few times and cut the video.

But you'll notice right away that the processing of all the effects in real time is extremely fast and vastly improved over previous versions. So that's an overview of some of the newest and improved features in Adobe Audition Creative Cloud.

Related Articles
Two types of noises degrade the audio you shoot with your video: random noises like microphone clicks and pops, and consistent noises like white noise or air conditioning hum. This tutorial demonstrates how to remove both of them in Adobe Audition CS6.
In this tutorial you'll learn how to create and edit industry-standard closed captions for video using the new closed-captioning capabilities in the just-released Adobe Premiere Pro CC.
Here's a quick tutorial for Premiere Pro CC users on how to migrate encoding presets that you created to customize and streamline encodes in Adobe Media Encoder CS5/6 into Adobe Media Encoder CC so you can pick up right where you left off.
If you ever find yourself having to render multiple sequences from Premiere Pro, there's an easier way to do it than by using the Premiere Pro Export control. This tutorial will explain the easier and more efficient way using Adobe Media Encoder.
Using Adobe Lightroom 5 to color-grade and apply metadata to DSLR video files is simple and efficient and can benefit live-switchers doing minimal edits, editing novices, and pro editors exploring non-traditional workflows.
New SpeedLooks in SpeedGrade, Mask Tracker in After Effects, Sync Settings in Adobe Media Encoder, improved multicam in Premiere Pro, expanded UltraHD/4k support, and more
In this first installment of a two-part series, Jan Ozer compares the declipping and crackle and pop-removal features in iZotope's new RX 3 pro audio editor to the parallel features in Adobe Audition CC.
In this final round between audio editing champs iZotope RX 3 and Adobe Audition CC, we compare the two audio editors in noise reduction and reverb/echo reduction.
Recent upgrades to Adobe CC make it easy to apply graded looks in Adobe Premiere Pro CC and match shots with different color temperatures via seamless roundtripping between Premiere Pro CC and SpeedGrade CC.
This tutorial demonstrates how to apply an effect to a portion of a video image while leaving the rest of the clip untouched, and how to track that portion of the image throughout the duration of the clip, using the Track Matte effect in Adobe Premiere Pro CC.
This video tutorial demonstrates how to use and leverage 3 key new features in Adobe Media Encoder CC: Lumetri Looks support, and image, text, and timecode overlay.
This video tutorial demonstrates how to restore muffled sections of spoken audio using the Multiband Compressor in Adobe Premiere Pro CC or Adobe Audition.
Here are three quick tips that will streamline your titling workflow in Premiere Pro when you create styles you like and want to use them consistently without reinventing them each time.
In this tutorial, we'll look at how to create screencams with TechSmith Camtasia, and then import them into Adobe Premiere Pro to incorporate them into your existing Premiere Pro projects to create professional-quality instructional videos that seamlessly combine screencams and HD footage.
iShowU, a Mac-based screencam app from ShinyWhiteBox.com, is quick, easy-to-use, and inexpensive; here's a look at how to use it to produce pro screencams that you can import into Adobe Premiere Pro CC to integrate into your video projects.
While After Effects may be daunting for some Premiere Pro editors, here is an easy-to-follow workflow that can enliven your text and titles with pre-built animations found in Adobe Bridge, applied in a few simple steps in After Effects, and imported directly into your Premiere Pro timeline.
Today Adobe announced updates to all Creative Cloud video apps that will debut at NAB. Here are details on the updates, plus a video tutorial on four key new features in Premiere Pro CC: Master Clips, Live Text, Masking and Tracking, and new 4K format support.