Skip to main content

Revisiting Replica Studios - The Most Natural A.I. Text To Speech Based on Real Human Voices

Replica AI Voices Hero Website Image.
Replica A.I. Voices are great for
in game character voices.

I first trialed Replica Studios A.I. Voice Actor Text To Speech (TTS) voices shortly after they launched about a year ago and was suitably impressed by how natural they sounded. Some voices better than others but most sounding very close to human with the right inflections at the appropriate moments. Since then they have been working on improvements and adding more voices to their voice cast library.

I have used Replica Studios on projects in the past, most notably on my 2021 Reallusion, Lip Sync Contest entry featuring my character, Bat Storm. 

Mia and the Tourist and their R2-D2 replica droid.
Mia and the Tourist.

This time I've decided to use Replica to cast the other two voices in my upcoming Mia and the Tourist animated short that I've been blogging the process of creating through demonstrating the apps I'm using to make it.

So far I've covered scriptwriting, storyboarding, background creation, and character rigging.

When I say the other two voices I'm referring to Mia and the Car Rental Manager (see my post on scriptwriting if you want more details of this animated short). The Tourist is loosely based on myself so I typically use my normal speaking voice for this character.

Generating Replica Voices

Replica itself is a desktop app that connects to their server. Once installed and logged in you can begin creating voices right away. If you're using  iClone, Unreal Engine or Unity there are integrations available that let you use Replica directly from those apps.

Initially you start with 30 minutes of free credit and you can buy more from the app as needed. You can also earn credit by referring friends to the app with your unique link (as I've done with my link at the beginning of this article).

There are two ways to proceed in the app, the Sandbox, where you can generate individuals lines of dialogue and download as needed, or Projects, where you can enter an entire script worth of dialogue and download all at once. Unlike the Sandbox you can save multiple projects here too.

The Sandbox

Replica Studio's Sandbox interface. Change attributes like Speech Rate, Pitch, and the Style.
Replica Studio's Sandbox interface.
Change attributes like Speech Rate, Pitch,
and the Style.

The Sandbox is about as simple as it gets. Select a voice, type your dialogue, click Play to preview. Select any part of the text to make further adjustments like adding silence, volume, or changing the speech rate, pitch, or style (when options are available).

Saving a take will count towards your time credit so it's best to save only those takes you intend to download.

The sandbox will let you save takes with different character voices and dialogue so it is possible to generate all the voices you might need for a short project here.

When you have all your takes just download each individually with their download button.

Projects

Replica Projects Section helps keep your script organised and easy to enter.
Replica Projects Section helps keep your
script organised and easy to enter.

The Project section is a new addition to the app since I first reviewed it. At the individual line level it works the same as the sandbox but above that it allows you to pick your character voices and assign them to characters in your script. You can then enter your script broken up into its scenes.

The interface for entering dialogue is more of a list so you can work quickly, see an overview of key settings and make changes to text and character voices without having to get down to individual line level (you only need to go that deep if you want to play around with volume, speech rate, and pitch on individual words and phrases).

Just aware that if you preview the same line of dialogue multiple times it does start using up your credit. So keep an eye on this if your credit is running low.

Other useful features include the ability to move lines of dialogue around, and you can preview an entire scene with the play button at the bottom of the scene window.

Once you have all your dialogue entered you can download each line individually (which is more useful if you want to change a line of dialogue later) or download an entire scene at a time as a zip file in WAV, MP3, OGG, and FLAC formats. Each audio file is named after the line number in the scene.

Replica's Voice Library

New voices are being added to Replica all the time. You can browse the library and check out each voice with the pre-recorded samples just by going to the voice library section of the app. However it's actually easier to browse from the sandbox where, when you go to select a voice, you can filter the available voices with search options like Gender, Age, and Accent etc.

Like any Text To Speech system there will always be some voices better than others. Personally I think Replica still has some way to go before their voices will pass as human in every situation but many of them are good enough to use in situations that don't involve too many emotional extremes.

Adding the Voices to my Storyboard Animatic

As mentioned earlier I used Replica to voice two of my characters. The third is being voiced by me.

I recorded my lines using Audacity. I won't be covering that as part of this series but if you do want to get some insight into how I batch export my dialogue from Audacity into individual audio files for each line of dialogue check out my article, A Better Way to Export Clips From a Single Audio Dialogue Track in Audacity

For the purpose of letting you hear my generated Replica voices I imported my audio into Storyboarder and exported an animatic of my rough sketches (below). I wouldn't usually do this. Normally I'd make an animatic using my characters and sets in Cartoon Animator but now, having seen this, I'm glad I did.

As you can see (and hear) my Replica voices aren't as successful as I'd like. Mia's voice is kind of okay but has some issues with volume and pitch. The Rental Car Manager is all over the place. You can easily tell his voice is text to speech. I may have to try something else for him.

My own voice is fine, I guess, given the Tourist only has three very short lines.

Overall I think Replica's voices are coming along. While there is a good selection of voices not all of them work well as character voices. I really struggled to find a voice for Mia that I liked. The one you hear in the animatic is as close as I could get to what I imagined but it still sounds too youthful.

The next step for my animation is to start putting it altogether in Cartoon Animator.

Comments

Popular posts from this blog

Eight 2D Animation Apps For Your Phone or Tablet Mobile Device

M obile productivity apps have become so capable that they can be great alternatives to their PC/MAC equivalents or serve as great tools in their own right when you're away from your desk. While some apps simply mimic their desktop counterparts, others offer well thought out, touch-friendly interfaces that are easier and more fun to use. Every so often I check out what's available for 2D animation for Android devices, since that's what I use, that can complement my workflow with Reallusion's Cartoon Animator 5. Some may be available for Apple devices as well. Below I've listed six free (F) apps (with optional paid (P) upgrades) on the Google Play Store that you might want to explore. Some are just fun apps on their own while others may be useful as part of your workflow on bigger animation projects. Not all are exclusively animation apps and could be used on any production. JotterPad (F/P) The name JotterPad makes this sound like a notepad application but it's ...

Inochi2D - Free Open Source 2D VTuber Avatar Rigging and Puppeteering Software (Part 1)

Inochi2D Creator - Free Open Source VTuber Software. If you've been looking for a way to live perform as a 2D cartoon avatar on camera, whether it be for a live stream or for pre-recorded content like educational videos, then VTuber software is a low cost (or even no cost) option worth looking into. In my previous post, How to Become a VTuber - 2D and 3D Software for Creating and Controlling Your Avatar , I took a brief look at the relatively new but completely free and open source Inochi2D  which I thought showed great potential for my own needs of creating a live performance character rig for my own TET Avatar that I use for all my promotional materials. While it is possible to live perform my character using Cartoon Animator itself, Reallusion's MotionLive2D capture system isn't great - with lip sync in particular. More importantly though, I can't exactly teach people how to use Cartoon Animator if I'm using Cartoon Animator to control my Avatar. What is Inochi2D...

LTX Studio (Beta): AI-Powered Visual Storytelling, From Script to Screen in One App.

LTX Studio can generate consistent characters across storyboard panels - even if one character is a dragon! W hile text to image, and text to video (and image to video) AI tend to be getting a lot of the press, the real exciting aspect of generative AI implementation is how it can be used to speed up creator workflow. Being able to realize your creative vision in a shorter length of time can lead to more ambitious projects. Particularly if you're a team of one, with a very limited budget, but you one day dream of creating your own epic animated feature film. LTX Studio (beta), a new 'all-in-one' AI film making tool, is not going to let you realize that dream from a single text prompt but, by bringing a bunch of generative AI technologies together, the developers have created a one platform workflow that can help anyone rapidly visualize and deliver a story from initial idea to finished film in days rather than weeks (depending upon how ambitious the project is). Even bette...

XP-Pen Artist 12 (2nd Gen) Pen Display Drawing Tablet Review - Portable and Robust Quality Ideal for Sketching on the Go!

XP-Pen's Artist 12 (2nd Gen) Pen Display Tablet. I've been looking for a more portable drawing solution for a while to work with my Samsung Galaxy Tab A, 8 inch, Android tablet, which is why, when XP-Pen invited me to collaborate on an animation project, I asked about trying their Artist 12 (2nd Gen) Pen Display Drawing Tablet . Does It Really Work With Android Devices? Having heard many of XP-Pen's mobile drawing displays could be connected to Android devices I, incorrectly, assumed wide compatibility. Unfortunately this isn't the case. There is a list of specific Android devices that work with XP-Pen's Pen Display tablets and none of them are any of the three Samsung devices I own. XP-Pen could definitely improve the compatibility of their displays with more Android devices if they're going to promote that as a feature (or make it more clear to check their device compatibility list before you buy). Also note the additional USB-C to USB-C video cable, needed to...

Tokkingheads - Make Anyone's Head Shot Talk with Artificial Intelligence

I'm increasingly fascinated by how artificial intelligence systems are being incorporated into more creative applications like visual effects, A.I. generated art, and particularly the development of human sounding voices that can interpret dialogue with more human intonation. Tokkingheads , by Rosebud AI is an interesting application available as a mobile app in both Apple and Play stores, as well as a browser based desktop version.  The simple premise is to upload a headshot image of any person (or use one of theirs), record yourself speaking anything, and then the A.I. will work out how to animate your image saying those words. There's the additional option of filming yourself speaking those words (or you can use one of their videos) and the A.I. will add the movement of your head and face into the mix to 'puppet' your image. The final animation is kind of like a budget light, deep fake video, except this was created in seconds and is relatively impressive with the ...

Review: Animaker - 10X Better than other Online Animation Video Making software (#DIY)... or is it?

Animaker's bold claim, right on its homepage is that it's  10X Better than other Online Animation Video Making software (#DIY). Also featured on their homepage is a cool promotional video that's dynamic, full of charming lip synced characters, with high quality animation that matches perfectly to the story being told. If I could make anything even half as good with their studio, I'll at least buy that they're better than most of their competitors. Let's see if they live up to their tagline 'Animated Videos, Done Right!' Animaker is a flash based, cloud animation studio application that gives you access to an entire library of thousands of characters, props, backgrounds, sounds and more, to create almost any kind of 2D animated video. In fact they make the bold claim that theirs is the largest animated library in the world of any similar online application (it's not... or if it actually is, it's not as versatile as other comparable librari...

The Ultimate Independent Animator's App and Resource List - Animation and Video Life

Image created with Cartoon Animator 4. Being an independent animator is not like a studio animation job. There's so much more to do that is indirectly related to the actual task of animating. Over the years I've sought out many apps, tools, and services that can help me achieve that one single task, expressing myself through animation. Below is my Ultimate Independent Animator's Resource List for 2024 (last updated Oct 2024). It started out as a list of free or low cost apps that could help you in every stage of producing either 2D or 3D animation, and then just kind of grew from there. You may not have been looking for a Time Management App as much as you needed something to get you started in 3D animation but when those commissioned projects start coming in you'll have a head start on maximizing your time. All the apps and services on this list had to meet two main criteria: They had to be useful and relevant to an Indy Animator/artist. The base app/se...