Skip to main content

AI 2D Animation Challenges: Character Consistency and Multiple Characters Speaking in the Same Shot

Two Space Cargo Pilots in a Space Port Bar.
An unlikely pair of Space Cargo Pilots.


A
nimation, specifically 2D animation using generative AI has really started to evolve into a more useful tool for creators however, character consistency across camera shots, within the same scene, is still an issue.

As well, another issue I've bumped up against, as someone who makes very dialogue heavy animations, is having more than one character talk within the same camera shot. 

There are a few all-in-one generative AI services that include effective tools to overcome these two issues, such as OpenArt.ai, but I wanted to see what I could achieve with completely free tools.  

For image generation I went with Google Gemini because you get limited free access to one of the best image generation models, Nano Banana. 

For video generation I went with Grok Imagine because it's very good with 2D animated characters and includes audio and lip sync. The output on the free plan is only  480p and tastefully watermarked in the lower right corner but it's good enough for experimenting. (Note that as of March 19, 2026 Grok has changed Imagine to a three day free trial and doesn't seem to let you generate anything unless you sign up to the trial).

Character Consistency

Google's Nano Banana is actually quite good with character consistency, even if you only use a single image of your character as a reference. That said creating a character reference sheet showing different angles definitely will help keep things more consistent.
 
AI Samson is a YouTuber I'm subscribed to, who helps to keep me informed about all AI developments in the creative arts, created a video on character consistency that works across different AIs. In it he demonstrates a prompt he developed (and gives away for free) that generates a character sheet of any character you can use as a reference for any AI that let's you upload images.
 

Using AI Samson's prompt, I modified it to generate a character sheet for my character, a space cargo pilot, that I generated with Gemini moments earlier in the same chat thread.  As you can see my pilot (who clearly doesn't get the well paid cargo runs) looks exactly like the same person across all images, including his outfit.
 
Space Cargo Pilot Character sheet generated in Gemini Nano Banana using AI Samson's prompt.
Space Cargo Pilot Character sheet generated in 
Gemini Nano Banana using AI Samson's prompt.

 

My idea was to have my space cargo pilot be sitting in a run down space port bar talking to some kind of tentacle headed alien pilot. Rather than create an alien character first, I simply prompted Gemini for the whole scene, describing what I thought the alien should look like in that prompt. This is what I got...

My Space Cargo Pilot sitting in a bar talking to a tentacle headed alien pilot.
My space cargo pilot, sitting in a space port bar, talking to a tentacle 
headed alien pilot. This was the very first output to my prompt for Gemini.

The image was pretty much what I wanted except, in my mind, the alien might have been more tentacle headed with less body but I didn't want spend hours on getting the character exactly right for a demo.

I was interested to see that, now I had my alien, could I generate a character sheet for him, to help keep him consistent in the future. I asked Gemini to make one using the same format as the one I used for my pilot - I didn't even have to re-enter my character sheet prompt. Here is what I got...

Alien Space Cargo Pilot character turnaround sheet.
Gemini generated this complete character turnaround of the alien character 
just based on the character it generated in the image of my scene.

My prompt didn't ask for any text so it's interesting that Gemini felt the need to not only label certain features but also give the species of alien a name.

The one thing I didn't want to compromise on was the composition of my scene. I wanted both characters sitting next to each other at the bar as if you were viewing them from the point of view of a bartender. It took me a few prompts to get the composition right but Gemini got there in the end...

The two space cargo pilots sitting at the bar facing the camera.
This was the final image I used for the animation, except I did crop it so 
you could no longer see each character's legs.

All of these images were generated in the same chat thread so I wasn't really using those reference sheets directly in my prompts to maintain character consistency. I'd just say, as part of the prompt, to maintain character consistency, and it worked relatively well, though I do think my human pilot aged up a little.

 Multiple Characters Speaking in the Same Shot

 Having both characters talking in the same scene was surprisingly easy with Grok. I was able to 'freestyle' a conversation using simple prompts that described what was happening in the scene and who I wanted to say specific dialogue.

On the free version of Grok you only get six second clips so I simply prompted each part of the conversation in a sort of order, creating another six second clip each time. 

I didn't actually prompt for specific dialogue unless I needed a character to say something to lead the conversation forward... or to end it like I did in the clip prompt screen shot shown below. 

An example screen shot of my final prompt in Grok Imagine.
My final conversation ending prompt where the alien excuses 
himself in need of the bathroom.
 
Grok chose the voice for each character and seemed to maintain the same voice for each across multiple clips. This may be because everything was generated within the same conversation thread. 
 
Were I to start a new project with these characters they may end up with different voices. If that was the case I would consider voice replacement with ElevenLabs.

The Final Edit

After generating nine different clips using the same image of my two characters sitting in a bar I brought all the clips into my video editor (Shotcut). I edited them together so the whole thing wasn't just one long scene with jump cuts. The final edit is below followed by all the source clips so you can see the output Grok Imagine generated.
 
A Space Pilot Walks Into a Bar... Gemini, Grok Demo 
 
 
The final scene is nothing particularly spectacular but it does demonstrate that it is possible to animate conversations with multiple characters in a scene. It may be somewhat harder if you were sticking to a rigid script but it is fun that you can kind of freestyle a conversation together that allows for unexpected ad-libs, as well as scripted, key moments.
 
To some degree, it really feels like you're directing actors, who aren't afraid to go off script, rather than getting too frustrated because they're not saying your words exactly. For me this was a very fun bit of experimentation.
 
 
o---o--- ---o--- o---

Did you find this article useful?
Subscribe to my newsletter and get the
latest articles delivered to your inbox.

Comments

Popular posts from this blog

Eight 2D Animation Apps For Your Phone or Tablet Mobile Device

M obile productivity apps have become so capable that they can be great alternatives to their PC/MAC equivalents or serve as great tools in their own right when you're away from your desk. While some apps simply mimic their desktop counterparts, others offer well thought out, touch-friendly interfaces that are easier and more fun to use. Every so often I check out what's available for 2D animation for Android devices, since that's what I use, that can complement my workflow with Reallusion's Cartoon Animator 5. Some may be available for Apple devices as well. Below I've listed six free (F) apps (with optional paid (P) upgrades) on the Google Play Store that you might want to explore. Some are just fun apps on their own while others may be useful as part of your workflow on bigger animation projects. Not all are exclusively animation apps and could be used on any production. JotterPad (F/P) The name JotterPad makes this sound like a notepad application but it's ...

The Family Guy Method - Animating Talking Hand Gestures in Cartoon Animator

Once you start getting into character animation you learn pretty quickly that people don't just speak with their mouths. Hand gestures and movements play a pretty important part of how people communicate too. The problem is, animating hand gestures and movements is extremely time consuming... and who knows what gestures and movements should be used and when? In Reallusion's Cartoon Animator I use pre-animated talking character motions that I chop and move gestures around so the arm and hand movements 'feel' right based on my own understanding of body language (and I also act out dialogue to get a sense of what arm and hand movements I might make with what's being spoken). Recently I came across a video by the creator of Culpamland Extra , an online animated series, in which they briefly outlined how they animate talking using the Family Guy Method. I'd never heard of this, and if you try to search for it online you'll be hard pressed to find anything. So I...

Review: Animaker - 10X Better than other Online Animation Video Making software (#DIY)... or is it?

Animaker's bold claim, right on its homepage is that it's  10X Better than other Online Animation Video Making software (#DIY). Also featured on their homepage is a cool promotional video that's dynamic, full of charming lip synced characters, with high quality animation that matches perfectly to the story being told. If I could make anything even half as good with their studio, I'll at least buy that they're better than most of their competitors. Let's see if they live up to their tagline 'Animated Videos, Done Right!' Animaker is a flash based, cloud animation studio application that gives you access to an entire library of thousands of characters, props, backgrounds, sounds and more, to create almost any kind of 2D animated video. In fact they make the bold claim that theirs is the largest animated library in the world of any similar online application (it's not... or if it actually is, it's not as versatile as other comparable librari...

Inochi2D - Free Open Source 2D VTuber Avatar Rigging and Puppeteering Software (Part 1)

Inochi2D Creator - Free Open Source VTuber Software. If you've been looking for a way to live perform as a 2D cartoon avatar on camera, whether it be for a live stream or for pre-recorded content like educational videos, then VTuber software is a low cost (or even no cost) option worth looking into. In my previous post, How to Become a VTuber - 2D and 3D Software for Creating and Controlling Your Avatar , I took a brief look at the relatively new but completely free and open source Inochi2D  which I thought showed great potential for my own needs of creating a live performance character rig for my own TET Avatar that I use for all my promotional materials. While it is possible to live perform my character using Cartoon Animator itself, Reallusion's MotionLive2D capture system isn't great - with lip sync in particular. More importantly though, I can't exactly teach people how to use Cartoon Animator if I'm using Cartoon Animator to control my Avatar. What is Inochi2D...

Inochi2D - Free Open Source 2D VTuber Avatar Rigging and Puppeteering Software (Part 2 - Inochi2D Session)

In part one of my deep dive into the free VTuber software, Inochi2D , I focused mainly on Inochi2D Creator, which is used for rigging your character avatar in the correct file format for use with Inochi2D Session, the puppeteering part of the software. The two sides of the software are still very much in development and the documentation, particularly for Session, is very thin on the ground. To the point where I don't think I could even do a comprehensive tutorial because I'm not sure I'm even doing things right, and the software could change significantly in a single update. As a result, in this part of my Inochi2D deep dive I'm changing tact from presenting my finished Cartoon Animator TET Avatar, and will be summarizing my experience of getting Session up and running using OpenSeeFace as the recommended webcam motion capture software. To do this I will be using  the TET avatar I created in my review of Mannequin , since that can be exported as a full, ready to go r...

Software Review: CrazyTalk 8 Pipeline by Reallusion

Reallusion released the newest version of its CrazyTalk software during December 2015 and I was fortunate enough to get my hands on a free copy CrazyTalk 8 Pipeline version after buying a truckload of Reallusion content in order to get the special deal. If you're not familiar with CrazyTalk it's basically specialized software for creating and animating talking heads quickly. CrazyTalk 8 extends the original capability of creating a semi 3D head from a 2D image to creating true 3D heads that can be imported into iClone and Reallusion's new 3D Character Creator (Pipeline version only).

The Ultimate Independent Animator's App and Resource List - Animation and Video Life

Image created with Cartoon Animator 4. Being an independent animator is not like a studio animation job. There's so much more to do that is indirectly related to the actual task of animating. Over the years I've sought out many apps, tools, and services that can help me achieve that one single task, expressing myself through animation. Below is my Ultimate Independent Animator's Resource List for 2024 (last updated Oct 2024). It started out as a list of free or low cost apps that could help you in every stage of producing either 2D or 3D animation, and then just kind of grew from there. You may not have been looking for a Time Management App as much as you needed something to get you started in 3D animation but when those commissioned projects start coming in you'll have a head start on maximizing your time. All the apps and services on this list had to meet two main criteria: They had to be useful and relevant to an Indy Animator/artist. The base app/se...