Skip to main content

Creating an AI Digital Avatar and Voice Clone of Myself with Free and Low Cost AI Tools

AI Image of TET, wearing clothes he doesn't own, in an office that doesn't exist, generated from an AI model trained on photos of TET.

Over the years I've looked at various ways of creating and animating a digital avatar, from simply creating a character for Cartoon Animator and voicing and animating it myself, to creating a live motion capture ready Vtuber avatar puppeted and voiced by me in real-time.

In the last year or so, making photographic images talk, using AI and AI voice cloning has really progressed. To the point where I wondered if I could create a photographic AI avatar of myself, complete with my cloned voice, that I might use on some of my videos?


Creating My Avatar: Artflow

To create my avatar image, as far as I'm concerned, Artflow.ai is the best value AI site for creating consistent digital characters (or 'actors' as Artflow calls them). That's pretty much their entire focus.

Artflow's avatar model training user interface.
Artflow's actor model training user interface.

You get your first actor for free, 100 free credits per month (which equates to 100 still images per month - get an additional 50 credits if you sign up with my link), 4 minutes of video from their Story Studio (put your characters into video stories), and 2 minutes of video from their Video Studio (kind of like Story Studio but you can add dialogue and lip sync your characters to it).

As well you can buy additional actors for a small, one time fee, and put multiple actors into the same image using their actor director mode. The only negative for me is you need a credit or debit card.

Training a model takes a minimum of five images but I would recommend using as many as the site will let you if you can.

Once trained, Artflow will show you a selection of sample images with your actor in different situations. Text prompt any situation or use one of Artflow's templates to get you started. It's pretty addictive, especially if your first actor is of yourself or someone you know. It's just fun imagining different scenes to see what you get.


Voice Cloning: ElevenLabs/Speechify

I looked around for a good, free voice cloner and couldn't find any that actually delivered a voice that sounded anything close to mine. Most sounded like bad text to speech (TTS) voices from 5-8 years ago.

If you really don't want to spend any money then Speechify is the only site I came across that would let you train the AI on your own voice for free and create a quality model. They give you 1000 characters of TTS to try out your trained model. Use them wisely because once they're gone, there's no topping up next month, they're gone!

ElevenLabs Voice Training UI.
ElevenLabs Voice Training UI.
It really is this simple.

Unfortunately Speechify is very pricy to pay by the month, but almost a third of the monthly price if you're prepared to pay annually.

ElevenLabs on the other hand, while you can't train a model on your voice for free, is extremely cheap for a single month of their Starter plan (again credit/debit card or bank transfer only). Currently they're considered the Gold Standard in AI voices.

I decided to take the gamble and paid for a month on the Starter plan - totally worth it! I uploaded less than two minutes of sample audio and got a near perfect copy of my voice in seconds, and now I have 40,000 characters of TTS to burn through.


Animating My Avatar: Hedra

Artflow's Video Studio actually does have the ability to make your images talk but not in a completely convincing way. You'll get some head movement but mostly just mouth movement. On the plus side you can make more than one character speak in a single image.

Hedra, which is currently free to use, by comparison, is specifically designed to animate an avatar head in a more natural way. It's not perfect but in short bursts, it almost passes for actual video footage of the person. It's able to add not just lip syncing but head and upper body movement too.

Unfortunately Hedra doesn't really give you any control over the output. You get what you get. But the result is generally pretty good. It does limit you to thirty seconds of video each generation because by about that point the magic is starting to fall apart just a little.

Hedra's UI.
Hedra is a three step process. 1. Upload your audio.
2. Upload your image (or prompt for a character). 3. Click Generate video.

Hedra also tends to crop relatively close to your image's face. If you wanted the full image, there's no way to tell Hedra. Probably because Hedra's animation is centered mostly around the head and shoulders.


The Finished Clone Video

You can see my finished demonstration video of my cloned avatar below. 

I didn't want my finished video to be all one camera shot so I made an attempt to stitch Hedra's cropped video image back into the full image of me in an office that doesn't exist outside of a computer (as far as I know).

You can see the wide shots don't exactly work if you look too closely. I'm sure I could make it seamless  with a little more work. However if you look at the close up shot, where I've stitched the sides of the image back into the shot so it fills the frame, that section is pretty convincing that it could be the real me.

For the most part I think it's almost good enough to fool people into thinking it really is me. Though I will say, I don't move my head around nearly that much in my videos. It would be nice if Hedra would let you tone down the head movement a little.

What do you think? It may or may not be convincing now but just remember the AI mantra "This is as bad as AI will ever be. It's only going to get better from here," and I think this is pretty good.


o---o--- ---o--- o---

Did you find this article useful?
Subscribe to my newsletter and get the
latest articles delivered to your inbox.

Comments

Post a Comment

This blog is monitored by a real human. Generic or unrelated spam comments with links to sites of dubious relativity may be DELETED.

I welcome, read, and respond to genuine comments relating to each post. If your comment isn't that save me some time by not posting it.

Popular posts from this blog

Eight 2D Animation Apps For Your Phone or Tablet Mobile Device

M obile productivity apps have become so capable that they can be great alternatives to their PC/MAC equivalents or serve as great tools in their own right when you're away from your desk. While some apps simply mimic their desktop counterparts, others offer well thought out, touch-friendly interfaces that are easier and more fun to use. Every so often I check out what's available for 2D animation for Android devices, since that's what I use, that can complement my workflow with Reallusion's Cartoon Animator 5. Some may be available for Apple devices as well. Below I've listed six free (F) apps (with optional paid (P) upgrades) on the Google Play Store that you might want to explore. Some are just fun apps on their own while others may be useful as part of your workflow on bigger animation projects. Not all are exclusively animation apps and could be used on any production. JotterPad (F/P) The name JotterPad makes this sound like a notepad application but it's ...

Inochi2D - Free Open Source 2D VTuber Avatar Rigging and Puppeteering Software (Part 1)

Inochi2D Creator - Free Open Source VTuber Software. If you've been looking for a way to live perform as a 2D cartoon avatar on camera, whether it be for a live stream or for pre-recorded content like educational videos, then VTuber software is a low cost (or even no cost) option worth looking into. In my previous post, How to Become a VTuber - 2D and 3D Software for Creating and Controlling Your Avatar , I took a brief look at the relatively new but completely free and open source Inochi2D  which I thought showed great potential for my own needs of creating a live performance character rig for my own TET Avatar that I use for all my promotional materials. While it is possible to live perform my character using Cartoon Animator itself, Reallusion's MotionLive2D capture system isn't great - with lip sync in particular. More importantly though, I can't exactly teach people how to use Cartoon Animator if I'm using Cartoon Animator to control my Avatar. What is Inochi2D...

LTX Studio (Beta): AI-Powered Visual Storytelling, From Script to Screen in One App.

LTX Studio can generate consistent characters across storyboard panels - even if one character is a dragon! W hile text to image, and text to video (and image to video) AI tend to be getting a lot of the press, the real exciting aspect of generative AI implementation is how it can be used to speed up creator workflow. Being able to realize your creative vision in a shorter length of time can lead to more ambitious projects. Particularly if you're a team of one, with a very limited budget, but you one day dream of creating your own epic animated feature film. LTX Studio (beta), a new 'all-in-one' AI film making tool, is not going to let you realize that dream from a single text prompt but, by bringing a bunch of generative AI technologies together, the developers have created a one platform workflow that can help anyone rapidly visualize and deliver a story from initial idea to finished film in days rather than weeks (depending upon how ambitious the project is). Even bette...

XP-Pen Artist 12 (2nd Gen) Pen Display Drawing Tablet Review - Portable and Robust Quality Ideal for Sketching on the Go!

XP-Pen's Artist 12 (2nd Gen) Pen Display Tablet. I've been looking for a more portable drawing solution for a while to work with my Samsung Galaxy Tab A, 8 inch, Android tablet, which is why, when XP-Pen invited me to collaborate on an animation project, I asked about trying their Artist 12 (2nd Gen) Pen Display Drawing Tablet . Does It Really Work With Android Devices? Having heard many of XP-Pen's mobile drawing displays could be connected to Android devices I, incorrectly, assumed wide compatibility. Unfortunately this isn't the case. There is a list of specific Android devices that work with XP-Pen's Pen Display tablets and none of them are any of the three Samsung devices I own. XP-Pen could definitely improve the compatibility of their displays with more Android devices if they're going to promote that as a feature (or make it more clear to check their device compatibility list before you buy). Also note the additional USB-C to USB-C video cable, needed to...

Tokkingheads - Make Anyone's Head Shot Talk with Artificial Intelligence

I'm increasingly fascinated by how artificial intelligence systems are being incorporated into more creative applications like visual effects, A.I. generated art, and particularly the development of human sounding voices that can interpret dialogue with more human intonation. Tokkingheads , by Rosebud AI is an interesting application available as a mobile app in both Apple and Play stores, as well as a browser based desktop version.  The simple premise is to upload a headshot image of any person (or use one of theirs), record yourself speaking anything, and then the A.I. will work out how to animate your image saying those words. There's the additional option of filming yourself speaking those words (or you can use one of their videos) and the A.I. will add the movement of your head and face into the mix to 'puppet' your image. The final animation is kind of like a budget light, deep fake video, except this was created in seconds and is relatively impressive with the ...

Review: Animaker - 10X Better than other Online Animation Video Making software (#DIY)... or is it?

Animaker's bold claim, right on its homepage is that it's  10X Better than other Online Animation Video Making software (#DIY). Also featured on their homepage is a cool promotional video that's dynamic, full of charming lip synced characters, with high quality animation that matches perfectly to the story being told. If I could make anything even half as good with their studio, I'll at least buy that they're better than most of their competitors. Let's see if they live up to their tagline 'Animated Videos, Done Right!' Animaker is a flash based, cloud animation studio application that gives you access to an entire library of thousands of characters, props, backgrounds, sounds and more, to create almost any kind of 2D animated video. In fact they make the bold claim that theirs is the largest animated library in the world of any similar online application (it's not... or if it actually is, it's not as versatile as other comparable librari...

The Ultimate Independent Animator's App and Resource List - Animation and Video Life

Image created with Cartoon Animator 4. Being an independent animator is not like a studio animation job. There's so much more to do that is indirectly related to the actual task of animating. Over the years I've sought out many apps, tools, and services that can help me achieve that one single task, expressing myself through animation. Below is my Ultimate Independent Animator's Resource List for 2024 (last updated Oct 2024). It started out as a list of free or low cost apps that could help you in every stage of producing either 2D or 3D animation, and then just kind of grew from there. You may not have been looking for a Time Management App as much as you needed something to get you started in 3D animation but when those commissioned projects start coming in you'll have a head start on maximizing your time. All the apps and services on this list had to meet two main criteria: They had to be useful and relevant to an Indy Animator/artist. The base app/se...