Perso
Perso AI is an AI-powered video localization platform that provides natural-sounding AI dubbing, voice cloning, pixel-perfect lip-sync, automatic subtitles and real-time script editing to translate and localize videos across 33+ languages and interactive AI experiences via an SDK and physical AI agents.
Perso is video software teams evaluate for video. Use this page to review pricing, integration signals, and the best alternatives before you commit.
Used in These Packs
Quick Overview
Best for: Video
What it does
Video software for decision-makers comparing workflow fit and alternatives.
Best fit
Video
Pricing snapshot
Freemium from $0
Next step
Compare Perso with similar tools before you shortlist it.
Compare this tool before you shortlist it
Review alternatives, pricing posture, and workflow fit side by side.
Perso
Perso AI is a video translation, localization and dubbing platform that uses machine learning to deliver natural voice cloning, precise lip-sync, automated subtitle generation and editable translated scripts. It targets content creators, education providers, marketers, enterprises and media studios who want to scale multilingual video production quickly and cost-effectively. In addition to cloud-based AI dubbing, Perso provides an Interactive SDK and physical AI "Human Station" products for interactive, multimodal conversational experiences in retail, travel, public venues and other spaces. Perso emphasizes speed (minutes, not weeks), high voice-match fidelity, enterprise security (SOC 2) and multi-format export for global distribution.
Perso AI is an AI-powered video localization platform that provides natural-sounding AI dubbing, voice cloning, pixel-perfect lip-sync, automatic subtitles and real-time script editing to translate and localize videos across 33+ languages and interactive AI experiences via an SDK and physical AI agents.
Own this listing?
Claim this page to add pricing, features, screenshots, and verified owner details.
Claim this listingKey Features
Natural Voice Cloning
AI reproduces the speaker's unique voice characteristics, accent and tone to create natural-sounding dubbed audio that preserves speaker identity.
Pixel-Perfect Lip Sync
Advanced lip-sync generation aligns translated audio with mouth movements for seamless, native-feeling video playback.
One-Click Translation
Upload video, choose target languages and let Perso transcribe, translate, clone voices and generate dubbed audio automatically.
Real-Time Script Editing
Edit AI-generated translations directly in the interface and instantly regenerate dubbing, lip-sync and subtitles to refine tone and terminology.
33+ Languages (AI Dubbing)
Supports over 33 languages for video localization and claims reach across many global markets; Interactive SDK supports 100+ languages for conversational experiences.
Multi-Speaker Support
Handles videos with multiple speakers, preserving distinct voices and enabling multi-speaker dubbing workflows.
Multi-Format Export & Subtitles
Export localized content in MP4, MOV, WebM and audio formats (WAV), with embedded subtitles or separate SRT files.
Project Storage & Booster Processing
Plans include project storage windows, booster concurrent processing and queue controls to accelerate production and parallelize jobs.
Interactive SDK & AI Human Station
SDK and physical AI agents for deploying conversational AI experiences in physical spaces and across screens with gaze processing and emotional intelligence.
Enterprise Controls & Security
Enterprise plans offer dedicated managers, priority support, multi-team workspace management, SOC 2 compliance and encryption in transit and at rest.
Pricing
Free plan with 1 minute of fast dubbing for new users (5 free minutes referenced elsewhere on site); includes watermark and basic features.
Free
$0- One-time fast speed 1 minute for new users
- Total dubbing time: 1 min
- Max length per video: 1 min
- 30-day project storage
Starter
$6.99 / month (monthly billing only)- Fast speed allocation: 15 min (monthly reset)
- Total dubbing limit: 15 min
- Max length per video: 5 min
- Unlimited project storage
Creator
$21 / month billed yearly ($252/year) — $29 monthly (promotional yearly price shown)- Fast speed allocation: 30 min (monthly reset)
- Unlimited low-speed dubbing content creation
- Max length per video: 15 min
- Unlimited project storage
Pro
$44 / month billed yearly ($528/year) — $59 monthly (promotional yearly price shown)- Fast speed allocation: 60 min (monthly reset)
- Max length per video: 30 min
- Unlimited project storage
- Booster concurrent processing up to 2, booster queue up to 3
Enterprise
Custom pricing (contact sales)- High-volume capacity with custom plans and dedicated infrastructure
- 1,000+ min/mo discounts with dedicated infrastructure
- Dedicated manager and priority response
- Multi-team workspace management and exclusive support resources
Use Cases
Content Creators & Social
Localize YouTube, TikTok, Instagram and short-form videos to reach international audiences with native-sounding dubbed audio and accurate lip-sync.
E-Learning & Training
Localize courses, training videos and webinars at scale to expand into new markets and reduce localization costs.
Enterprise & Internal Communications
Translate product demos, corporate comms and onboarding content for global teams while preserving speaker identity and tone.
Marketing & E-Commerce
Produce localized ads, product demos and customer testimonial videos to improve conversion rates and engagement in target regions.
Media & Entertainment
Localize documentaries, series and films for distribution with faster turnaround than traditional dubbing workflows.
Retail & Public Venues (Interactive AI)
Deploy Perso Interactive AI Human Stations and SDK in stores, airports and exhibitions to provide multilingual, context-aware customer interactions and information services.
Integrations
Interactive SDK
SDK to embed Perso's interactive, multilingual conversational experiences into web and device experiences.
LLM Model APIs
Supports multiple LLM options and APIs (the site references customization with providers such as OpenAI GPT-3.5 and HyperCLOVA X).
Content Sources (YouTube / Cloud Storage)
Direct dubbing from video URLs and support for sources like YouTube and cloud storage integrations (Google Drive referenced).
Export & Subtitles
Integration with downstream publishing via multi-format exports (MP4, MOV, WebM) and SRT subtitle files for workflows and CMS ingestion.
Benefits
Limitations
Frequently Asked Questions
What languages does Perso AI support for AI dubbing?
Does Perso AI include lip-syncing, voice-over and voice cloning?
Can I edit the AI-translated script?
Can I dub videos directly from YouTube or Google Drive?
What is Perso AI Dubbing?
Getting Started
- 1 Step 1 Upload Video or Audio Files: Drag & drop files or provide a URL (supports MP4, MOV, MP3, WAV and common sources such as YouTube).
- 2 Step 2 Select a Language: Choose a target language and let Perso transcribe, translate, voice clone and generate dubbed audio.
- 3 Step 3 Download your Dubbed Video: Export localized video (MP4) or audio (WAV), and optionally download SRT subtitle files.
Support
Contact / Sales
Contact Us / Talk to Sales for enterprise inquiries and custom pricing; the site has a sales contact form.
Docs & Blog
Blog & Insights, guides and product resources are available on the website for onboarding and best practices.
Community
Community resources and media/press sections are referenced on the site for additional support and updates.
Enterprise Customer Success
Enterprise plans offer a dedicated customer success manager and priority support channels.
API
Not available in provided content (site references LLM model APIs and Interactive SDK—contact Perso or check developer docs for full API references).
Compare Perso with similar tools
See how it stacks up against alternatives
Related Tools
View all 81 →Videoideas
VideoIdeas.ai is an AI-powered content platform built for YouTube creators that generates video ideas, full scripts, short-form content, ad scripts, channel analysis, and style cloning to help creators produce engaging videos faster and grow their channels.
Latentsync
LatentSync is an AI-powered video lip-synchronization framework that uses audio-conditioned latent diffusion models to produce precise, natural-looking lip motion alignment for videos across multiple languages and formats.
Premium Alternatives
Retouchpro
Retouchpro (AI Photo Generator) is a web-based AI image generation and editing platform for creators, influencers, and agencies that produces photorealistic and stylized images in seconds using multiple top image models and community-driven templates.
Aidancevideo
AI Dance Video is a web tool that turns any still photo (people, pets, or objects) into a short, shareable dancing video using motion-control AI models — aimed at social creators and casual users who want quick, humorous dance clips.
Hairstyleai
HairstyleAI is a virtual AI-powered hairstyle try-on service for men and women that generates photorealistic images of you in different haircuts so you can preview styles before committing to a real haircut.
Closerscopy
ClosersCopy is an AI-powered copywriting platform that helps marketers, copywriters, and teams generate long-form content, sales copy, ads, emails and SEO-optimized blog posts using proprietary AI models, customizable frameworks and a library of templates.