Skip to main content

Six AI Audio Cleaning and Transcription Resources for Video and Animation Content Creators

AI Audio Cleaning Robot. Image by TET and Leonardo.ai
While AI applications may have been receiving a lot of bad press lately in the visual arts, there are definitely times when AI is a game changer, in a good way, for more mundane applications like audio cleaning and transcribing.

Maybe there's some militant audio engineers or transcribers out there who just love what they do but the ability to give an audio file to an AI, and have it automatically improve the quality of sound or transcribe an hour or more of speech in seconds, is pure magic.

Sure the AI doesn't always get it right, particularly with transcribing, but it's pretty good. Plus, correcting a few short falls is certainly better than doing all the work yourself.

I recently recorded a thirty minute video with audio that was borderline awful. It was clear enough to understand but I hadn't been able to filter out the static noise of the microphone, and it would distort on the louder sections, just enough to notice, even though the levels were well under the clipping threshold.

Nothing I did in post would fix it so I decided to see what AI audio cleaning services were out there (and of course find out if there were any free services).

Clean Voice AI

Sound Wave Before and After Cleaning.
I'm leading with Clean Voice AI because it was the service I used to fix my audio. Your first 30 minutes is completely free (which was all I needed).

Clean Voice over delivers, not just fixing terrible audio but also removing dead air, 'ums' and 'ahs' (as well as other mouth sounds), and more.

Although I didn't try Clean Voice's audio transcribing, its inclusion  makes the site a great, one stop service, for many of your audio needs. It's particularly targeted at podcasters including a number of free tools for them like their Podcast Episode Title Generator.

Clean Voice AI is browser based. Subscription pricing seems quite reasonable to me but what I most liked is that you can pay as you go as well.

Audo Studio

If all you need is just a straight up audio cleaner that can remove almost any unwanted background noise then Audo Studio is a browser based application that may be what you're looking for.

Some examples of the kind of noises they can fix include background restaurant noise, bird squawks, dog barking and more. Audo Studio can also auto adjust volume levels so your voice can be heard.

Another nice feature is that you can upload video files for audio cleaning. No need to separate your audio just to use the service. Which may be useful if you're wanting to fix audio on older completed videos.

Audo Studio is subscription based but they do have a free plan that gives you 20 minutes of audio cleaning per month.


Deciphr

Transcript Sample from Deciphr
Another browser based AI service targeting podcasters (but also can process video files) Deciphr is a one stop shop for turning audio into all kinds of text. 

Not limited to transcribing, it can also generate show notes, show summaries, pull out quote highlights, create captions for social media posts, list keywords for SEO and, on a paid plan includes the creation of audiograms and video reels (highlighted audio and video for social media). 

All output is organized on a sharable page with a nice headline or you can download everything as a Word Document.

Deciphr has a flexi-free plan that gives you 40 minutes of audio/video upload to get started then it's a pay as you go plan. Unfortunately you do need a credit card as they only accept payments through Stripe. Which for me is disappointing because I would definitely subscribe to a plan if I could use PayPal.

Riverside

Riverside is actually a complete browser based studio for professional podcast and video recording which you can try for free. Alongside that Riverside has a host of free and paid tools including their transcription service (which is free).

You don't need an account for their transcription service, just drag'n'drop an audio or video file onto the browser window and you're away. Text can be downloaded as a transcript or caption text file. 

Note that if you're using a browser other than Chrome or Edge you may find this doesn't work. I've been trialing Opera's Browser and the transcript tool wouldn't go past the choose file section. Chrome worked just fine though.

Well worth checking out some of their other free tools which include things like a YouTube Channel Name generator.

Descript

The Descript Editor
The Descript Editor can edit video
direct from your script.
Descript
is also an all in one video creation tool that is browser based but also has a desktop version. Their studio uses a fairly unique concept of editing video based on your text script.

Some of Descript's features include cleaning your audio, removing filler words, and you can clone your own voice and have it speak new dialogue. There's also natural speaking AI voices you can utilize.

Since Descript's video editor relies on a text based script it goes without saying that it can also transcribe your audio and video. Not only that, if you already have a transcription, they can sync it to your media word for word.

Descript is well worth a look since there is a free plan that gives you most features with the ability to create up to an hour of video per month.

* Note: Links to Descript are affiliate links that support this site if you sign up for a Descript paid plan.

AI-coustics

AI-coustics is a fairly basic, browser based, AI audio cleaner if you just need to knock out some background noise from your recordings. Clean up to an hour of audio per month on the free account. Test the service out before you sign up in their Playground area.

The Levelator (Bonus Non-AI Free Software)

Not really an audio cleaning tool but more of an audio enhancing tool for podcasters and video creators too. The Levelator is free software for Mac or Windows that simply takes your voice audio and adjusts the speaking level of all voices to one consistent level.

Pretty much does the job of a compressor filter but the authors say it does more than that, evening out all the voices so none sound too quiet and hard to hear.

The software is quite old now but still does the job. Great if you just need something simple to even out your audio and don't really understand the technicalities of using a compressor filter in your video/audio editing software.
 

Comments

Popular posts from this blog

Eight 2D Animation Apps For Your Phone or Tablet Mobile Device

M obile productivity apps have become so capable that they can be great alternatives to their PC/MAC equivalents or serve as great tools in their own right when you're away from your desk. While some apps simply mimic their desktop counterparts, others offer well thought out, touch-friendly interfaces that are easier and more fun to use. Every so often I check out what's available for 2D animation for Android devices, since that's what I use, that can complement my workflow with Reallusion's Cartoon Animator 5. Some may be available for Apple devices as well. Below I've listed six free (F) apps (with optional paid (P) upgrades) on the Google Play Store that you might want to explore. Some are just fun apps on their own while others may be useful as part of your workflow on bigger animation projects. Not all are exclusively animation apps and could be used on any production. JotterPad (F/P) The name JotterPad makes this sound like a notepad application but it's ...

Inochi2D - Free Open Source 2D VTuber Avatar Rigging and Puppeteering Software (Part 2 - Inochi2D Session)

In part one of my deep dive into the free VTuber software, Inochi2D , I focused mainly on Inochi2D Creator, which is used for rigging your character avatar in the correct file format for use with Inochi2D Session, the puppeteering part of the software. The two sides of the software are still very much in development and the documentation, particularly for Session, is very thin on the ground. To the point where I don't think I could even do a comprehensive tutorial because I'm not sure I'm even doing things right, and the software could change significantly in a single update. As a result, in this part of my Inochi2D deep dive I'm changing tact from presenting my finished Cartoon Animator TET Avatar, and will be summarizing my experience of getting Session up and running using OpenSeeFace as the recommended webcam motion capture software. To do this I will be using  the TET avatar I created in my review of Mannequin , since that can be exported as a full, ready to go r...

Review: Animaker - 10X Better than other Online Animation Video Making software (#DIY)... or is it?

Animaker's bold claim, right on its homepage is that it's  10X Better than other Online Animation Video Making software (#DIY). Also featured on their homepage is a cool promotional video that's dynamic, full of charming lip synced characters, with high quality animation that matches perfectly to the story being told. If I could make anything even half as good with their studio, I'll at least buy that they're better than most of their competitors. Let's see if they live up to their tagline 'Animated Videos, Done Right!' Animaker is a flash based, cloud animation studio application that gives you access to an entire library of thousands of characters, props, backgrounds, sounds and more, to create almost any kind of 2D animated video. In fact they make the bold claim that theirs is the largest animated library in the world of any similar online application (it's not... or if it actually is, it's not as versatile as other comparable librari...

The Family Guy Method - Animating Talking Hand Gestures in Cartoon Animator

Once you start getting into character animation you learn pretty quickly that people don't just speak with their mouths. Hand gestures and movements play a pretty important part of how people communicate too. The problem is, animating hand gestures and movements is extremely time consuming... and who knows what gestures and movements should be used and when? In Reallusion's Cartoon Animator I use pre-animated talking character motions that I chop and move gestures around so the arm and hand movements 'feel' right based on my own understanding of body language (and I also act out dialogue to get a sense of what arm and hand movements I might make with what's being spoken). Recently I came across a video by the creator of Culpamland Extra , an online animated series, in which they briefly outlined how they animate talking using the Family Guy Method. I'd never heard of this, and if you try to search for it online you'll be hard pressed to find anything. So I...

Review: Headshot Plugin for Reallusion's Character Creator 3

Headshot for CC3. Quite possibly the best 3D Avatar I've made of myself in any 3D application. Creating a realistic 3D human avatar is a whole lot easier with Reallusion's new Headshot Plugin for Character Creator 3. The plugin is an AI powered extension that can generate 3D digital humans from one photo. Which sounds like an amazing proposition but, in practice, if you're trying to achieve a specific likeness to an actual person, Headshot will give you an excellent base to work from. Headshot has two modes, Auto and Pro. Auto Mode Auto is well worth a try if you have an ideal photo of a front facing person that is properly lit and posed to Headshot's optimum requirements. It's also the only mode that will take a crack at generating a hair model. I grabbed an image of Harrison Ford, dragged it into Headshot without changing any of the default settings (other than specifying 'male' and selecting an 'old male' setting) and this is what I...

Moho 14 Released - Still the Best 2D Animation Software for Indy Animators on a Budget

Moho 14 Released. Regular readers know I am a Reallusion, Cartoon Animator advocate through and through. Hands down I would recommend Cartoon Animator 5 first over Lost Marble's Moho 14 to anyone who is just starting in 2D animation, is a team of one, or just needs to animate as quickly as possible. However, feature for feature, Moho is, arguably, the best 2D animation software for the rest of us who can't justify a Toon Boom Harmony , or Adobe Creative Cloud subscription (and even with their applications Moho is very competitive on features). You can get started with Moho Debut for just USD$59.99 which is a cut down version of Moho Pro but it still has the most essential features needed for 2D animation. While Moho Pro is a whopping USD$399.99 (Cartoon Animator, which only has one version, is just USD$149.00) upgrades to new version numbers come down to a quarter of the price at USD$99.00. Even though Reallusion just released features like Motion Pilot Puppet Animation and...

KIT Scenarist - Free, Open Source, Screenwriting Software that Helps Research Your Ideas Too

KIT Scenarist Script Writing Software's Mascot, Alexander Cat. While you can write a script in any word processing app, if you're writing stories (screenplays) that feature characters and dialogue, a dedicated script writing app can save a lot of time formatting, letting you focus more on the actual story. Script writing apps are also very useful if you plan to send your screenplays out to production companies, or if you're collaborating with actors and other production people, who are used to scripts being in a particular standard format.  [Note: In case you're wondering there are reasons scripts follow a standard format and are always written in Courier (typewriter) font, including but not limited to; being easy to read by actors, plenty of space for notes, and the general rule that one page of a script (in this format) equals approximately one minute of screen time.] KIT Scenarist , in my opinion, is one of the best script writing apps out there for ease of use, simp...