T2IG AI Studio — Feature Guide

Everything you need to know about creating AI-generated images, videos, and avatar content.

Table of Contents
1. Studio Tabs 2. Image Generation 3. Video Generation 4. Avatar Videos 5. Avatar Backgrounds 6. Avatar Style & Positioning 7. Voice Selection & Emotion 8. Text Overlay 9. Captions, Test Mode & Matting 10. Smart Video Pipeline 11. Text-to-Speech 12. Formats & Aspect Ratios 13. Credits & Costs 14. Pro Tips

1. Studio Tabs

The studio has five main tabs across the top. Each one opens a different creative tool:

TabWhat It DoesCost
ImageGenerate images from text prompts using Flux, SDXL, and other AI models5 credits
VideoGenerate short videos from text or images using Kling, Runway, etc.10 credits
AvatarCreate talking head videos with AI avatars or your own photos15 credits
Smart VideoFull automated video pipeline — scripting, visuals, narration, music25 credits
VoiceText-to-speech with 2,000+ voices from HeyGen, ElevenLabs, Azure3 credits

2. Image Generation

Type a description of the image you want. Use the Inspire button for random creative prompts, or Enhance to improve your prompt with AI. Choose your model, style, and aspect ratio in the settings panel below the prompt.

Tip: Be specific about what you want. "A photorealistic portrait of a woman in golden hour light, shallow depth of field" works better than "woman photo".

3. Video Generation

Describe the video scene you want, or start from an image. Select your video model (Kling, Runway, etc.) and aspect ratio. Videos typically take 1–3 minutes to render.

You can also use the Create Video button on any image card in your feed to animate an existing image.

4. Avatar Videos

The Avatar tab has two sub-tabs:

Preset Avatars

Choose from 1,200+ professional AI avatars. Use the search bar and gender filter to find the right one. Hover over any avatar to see a large preview with video animation.

Photo to Talking

Choose from 5,800+ stock talking photos, or upload your own image. Your photo will be animated to speak your script with realistic lip-sync.

Script limits: Preset avatars support up to 5,000 characters. Custom photo avatars (AV4) are limited to 630 characters.

5. Avatar Backgrounds

The background section has three modes, accessible via tabs:

Color

Pick from 8 preset colors: Dark, Green Screen, White, Navy, Purple, Midnight, Cream, and Red. Use the color picker circle for any custom color. Green Screen (#00b140) is useful if you plan to composite the avatar onto other footage later.

Image

Switch to the Image tab and paste a direct URL to any image. The image will be placed behind your avatar. Supports .jpg, .png, and other standard image formats.

You can also paste a video URL (.mp4, .webm, .mov) — the system will automatically detect it and loop the video behind your avatar.

Search

Switch to the Search tab to find professional backgrounds from Pexels. Use the quick-tag buttons (Office, Nature, Studio, City, Abstract, Gradient) or type your own search. Click any photo to select it as your background.

How it works: When you select a Pexels image or paste a URL, it overrides the solid color. To go back to a solid color, click any color preset on the Color tab — this clears the image URL.

6. Avatar Style & Positioning

StyleDescription
Full BodyDefault — shows the avatar from the waist up with natural body language
Close-UpZoomed in on the face and shoulders — great for personal, direct-to-camera content
CircleAvatar appears in a circular frame — ideal for overlaying on presentations or web content

For portrait (9:16) format, the avatar is automatically scaled up by 25% to fill the vertical frame, so it doesn't appear small and floating.

7. Voice Selection & Emotion

The voice picker shows only HeyGen-compatible voices. Use the search bar, language dropdown, and gender filter to find the right voice. Click the play button on any voice card to preview it.

Voice Emotion

Add emotional tone to the AI voice. Not all voices support all emotions — if unsupported, the voice will fall back to its natural tone.

EmotionBest For
NaturalDefault neutral delivery — works for everything
ExcitedProduct launches, announcements, high-energy content
FriendlyOnboarding, tutorials, customer-facing content
SeriousNews, reports, corporate announcements
SoothingMeditation, wellness, ASMR-style content
BroadcasterNews anchor style — polished and authoritative

8. Text Overlay

Add a title or subtitle directly burned into the video. Type your text in the overlay field, then configure:

SettingOptions
PositionTop, Center, or Bottom of the frame
SizeSmall (20pt), Medium (28pt), Large (40pt), XL (56pt)
BoldToggle bold weight on/off
Note: Text overlay is white on a semi-transparent background. For best readability, use darker backgrounds or position the text where the avatar isn't standing.

9. Captions, Test Mode & Matting

Captions

Check the Captions box to automatically burn subtitles into the video. HeyGen generates captions from your script and synchronizes them with the audio. This is rendered server-side — no additional processing needed.

Remove Photo BG (Matting)

Check Remove Photo BG to strip the original background from the talking photo or avatar. This is useful when combining with a custom background — it prevents the avatar's original background from showing through.

Test Mode

Check Test Mode to generate a free, low-resolution preview. This does NOT consume credits. Use it to test avatar + voice combinations, check background placement, or preview text overlays before committing to a full render.

Workflow tip: Use Test Mode to dial in your settings (avatar, voice, background, text), then uncheck it for the final high-quality render.

10. Smart Video Pipeline

The Smart Video tab is a fully automated pipeline that takes a topic, URL, or raw text and produces a complete video with multiple scenes, AI-generated visuals, voiceover narration, and background music.

Choose a directive style (Explainer, Viral, Story, Product, Listicle, News), set the number of scenes, write your prompt or paste a URL, and hit generate. The alien spinner pipeline will show real-time progress through each stage.

11. Text-to-Speech

The Voice tab gives you access to 2,000+ voices from HeyGen, ElevenLabs, and Azure. Type your text, select a voice, and generate an audio file. Voices can be filtered by language, gender, and provider.

12. Formats & Aspect Ratios

FormatResolutionBest For
Landscape (16:9)1280 × 720YouTube, presentations, websites
Portrait (9:16)720 × 1280TikTok, Instagram Reels, YouTube Shorts
Square (1:1)1080 × 1080Instagram posts, LinkedIn, social ads

13. Credits & Costs

ActionCost
Image generation5 credits
Video generation10 credits
Avatar video15 credits
Smart video pipeline25 credits
Text-to-speech3 credits
Test mode avatarFree (0 credits)

Credits are only deducted when the generation succeeds. If a job fails, your credits are returned.

14. Pro Tips

Combine features for maximum impact: Use a Pexels "office" background + Close-Up avatar style + Friendly emotion + Captions for a professional talking-head video that looks like it was produced in a studio.

Test before you commit: Always use Test Mode first to verify your avatar, voice, and background look good together. Test Mode is free and renders in seconds.

Green screen for compositing: Use the Green Screen color preset if you plan to key out the background in a video editor later. Combine with Remove Photo BG (matting) for a clean key.

Portrait for social media: Choose 9:16 Portrait format for TikTok, Reels, and Shorts. The avatar auto-scales to fill the vertical frame.

Hover to preview: Hover over any avatar or talking photo thumbnail to see a large preview. Avatars show a video animation preview from their Bunny CDN cache.

← Back to Studio