Perso
Perso AI is an AI-powered video localization platform that provides natural-sounding AI dubbing, voice cloning, pixel-perfect lip-sync, automatic subtitles and real-time script editing to translate and localize videos across 33+ languages and interactive AI experiences via an SDK and physical AI agents.
Perso is video software teams evaluate for video. Use this page to review pricing, integration signals, and the best alternatives before you commit.
Used in These Packs
Quick Overview
Best for: Video
What it does
Video software for decision-makers comparing workflow fit and alternatives.
Best fit
Video
Pricing snapshot
Freemium from $0
Next step
Compare Perso with similar tools before you shortlist it.
Compare this tool before you shortlist it
Review alternatives, pricing posture, and workflow fit side by side.
Perso
Perso AI is a video translation, localization and dubbing platform that uses machine learning to deliver natural voice cloning, precise lip-sync, automated subtitle generation and editable translated scripts. It targets content creators, education providers, marketers, enterprises and media studios who want to scale multilingual video production quickly and cost-effectively. In addition to cloud-based AI dubbing, Perso provides an Interactive SDK and physical AI "Human Station" products for interactive, multimodal conversational experiences in retail, travel, public venues and other spaces. Perso emphasizes speed (minutes, not weeks), high voice-match fidelity, enterprise security (SOC 2) and multi-format export for global distribution.
Perso AI is an AI-powered video localization platform that provides natural-sounding AI dubbing, voice cloning, pixel-perfect lip-sync, automatic subtitles and real-time script editing to translate and localize videos across 33+ languages and interactive AI experiences via an SDK and physical AI agents.
Own this listing?
Claim this page to add pricing, features, screenshots, and verified owner details.
Claim this listingKey Features
Natural Voice Cloning
AI reproduces the speaker's unique voice characteristics, accent and tone to create natural-sounding dubbed audio that preserves speaker identity.
Pixel-Perfect Lip Sync
Advanced lip-sync generation aligns translated audio with mouth movements for seamless, native-feeling video playback.
One-Click Translation
Upload video, choose target languages and let Perso transcribe, translate, clone voices and generate dubbed audio automatically.
Real-Time Script Editing
Edit AI-generated translations directly in the interface and instantly regenerate dubbing, lip-sync and subtitles to refine tone and terminology.
33+ Languages (AI Dubbing)
Supports over 33 languages for video localization and claims reach across many global markets; Interactive SDK supports 100+ languages for conversational experiences.
Multi-Speaker Support
Handles videos with multiple speakers, preserving distinct voices and enabling multi-speaker dubbing workflows.
Multi-Format Export & Subtitles
Export localized content in MP4, MOV, WebM and audio formats (WAV), with embedded subtitles or separate SRT files.
Project Storage & Booster Processing
Plans include project storage windows, booster concurrent processing and queue controls to accelerate production and parallelize jobs.
Interactive SDK & AI Human Station
SDK and physical AI agents for deploying conversational AI experiences in physical spaces and across screens with gaze processing and emotional intelligence.
Enterprise Controls & Security
Enterprise plans offer dedicated managers, priority support, multi-team workspace management, SOC 2 compliance and encryption in transit and at rest.
Pricing
Free plan with 1 minute of fast dubbing for new users (5 free minutes referenced elsewhere on site); includes watermark and basic features.
Free
$0- One-time fast speed 1 minute for new users
- Total dubbing time: 1 min
- Max length per video: 1 min
- 30-day project storage
Starter
$6.99 / month (monthly billing only)- Fast speed allocation: 15 min (monthly reset)
- Total dubbing limit: 15 min
- Max length per video: 5 min
- Unlimited project storage
Creator
$21 / month billed yearly ($252/year) — $29 monthly (promotional yearly price shown)- Fast speed allocation: 30 min (monthly reset)
- Unlimited low-speed dubbing content creation
- Max length per video: 15 min
- Unlimited project storage
Pro
$44 / month billed yearly ($528/year) — $59 monthly (promotional yearly price shown)- Fast speed allocation: 60 min (monthly reset)
- Max length per video: 30 min
- Unlimited project storage
- Booster concurrent processing up to 2, booster queue up to 3
Enterprise
Custom pricing (contact sales)- High-volume capacity with custom plans and dedicated infrastructure
- 1,000+ min/mo discounts with dedicated infrastructure
- Dedicated manager and priority response
- Multi-team workspace management and exclusive support resources
Use Cases
Content Creators & Social
Localize YouTube, TikTok, Instagram and short-form videos to reach international audiences with native-sounding dubbed audio and accurate lip-sync.
E-Learning & Training
Localize courses, training videos and webinars at scale to expand into new markets and reduce localization costs.
Enterprise & Internal Communications
Translate product demos, corporate comms and onboarding content for global teams while preserving speaker identity and tone.
Marketing & E-Commerce
Produce localized ads, product demos and customer testimonial videos to improve conversion rates and engagement in target regions.
Media & Entertainment
Localize documentaries, series and films for distribution with faster turnaround than traditional dubbing workflows.
Retail & Public Venues (Interactive AI)
Deploy Perso Interactive AI Human Stations and SDK in stores, airports and exhibitions to provide multilingual, context-aware customer interactions and information services.
Integrations
Interactive SDK
SDK to embed Perso's interactive, multilingual conversational experiences into web and device experiences.
LLM Model APIs
Supports multiple LLM options and APIs (the site references customization with providers such as OpenAI GPT-3.5 and HyperCLOVA X).
Content Sources (YouTube / Cloud Storage)
Direct dubbing from video URLs and support for sources like YouTube and cloud storage integrations (Google Drive referenced).
Export & Subtitles
Integration with downstream publishing via multi-format exports (MP4, MOV, WebM) and SRT subtitle files for workflows and CMS ingestion.
Benefits
Limitations
Frequently Asked Questions
What languages does Perso AI support for AI dubbing?
Does Perso AI include lip-syncing, voice-over and voice cloning?
Can I edit the AI-translated script?
Can I dub videos directly from YouTube or Google Drive?
What is Perso AI Dubbing?
Getting Started
- 1 Step 1 Upload Video or Audio Files: Drag & drop files or provide a URL (supports MP4, MOV, MP3, WAV and common sources such as YouTube).
- 2 Step 2 Select a Language: Choose a target language and let Perso transcribe, translate, voice clone and generate dubbed audio.
- 3 Step 3 Download your Dubbed Video: Export localized video (MP4) or audio (WAV), and optionally download SRT subtitle files.
Support
Contact / Sales
Contact Us / Talk to Sales for enterprise inquiries and custom pricing; the site has a sales contact form.
Docs & Blog
Blog & Insights, guides and product resources are available on the website for onboarding and best practices.
Community
Community resources and media/press sections are referenced on the site for additional support and updates.
Enterprise Customer Success
Enterprise plans offer a dedicated customer success manager and priority support channels.
API
Not available in provided content (site references LLM model APIs and Interactive SDK—contact Perso or check developer docs for full API references).
Compare Perso with similar tools
See how it stacks up against alternatives
Related Tools
View all 81 →Aiconverthub
AI Convert Hub is a browser-based file conversion service for video, audio, and images that offers batch uploads, codec-aware presets, FFmpeg-based encoding with AI tuning, and secure transfers for creators and developers.
Cinemadrop
CinemaDrop is an all-in-one AI filmmaking studio that turns a single idea into a full script, storyboard, and finished video using generative image, video and audio models with built-in character & scene consistency.
Videocaption
Videocaption is a browser-based tool that auto-generates and burns stylish, editable subtitles into videos for social and professional use, using AI transcription and support for .srt/.vtt files. It's aimed at creators, marketers, educators and teams who need fast, accessible captioning without heavy desktop software.
Premium Alternatives
automateclips
AutomateClips is an AI-powered video generator that transforms app walkthroughs into viral-ready content featuring virtual influencers, designed to showcase app features and drive downloads on platforms like TikTok, Instagram, and YouTube.
seogeek
seoGEEK is an all-in-one SEO and digital marketing tool designed for web developers, SEO experts, and digital marketing agencies. It offers advanced AI-powered features for content creation, keyword analysis, project management, and advertising optimization to streamline workflows and grow businesses.
Seeyourbaby
SeeYourBaby is an AI-powered baby generator that predicts a future child's likely appearance from photos of two parents, delivering multiple high-resolution boy and girl images via email with a one-time payment.
Deepwander
Deepwander is an AI-powered companion for personal growth that guides interactive self-reflection to help users explore thoughts, emotions, and behaviors and arrive at clarity and practical next steps.
botgauge
BotGauge is an AI-driven autonomous QA solution that delivers over 80% test coverage within two weeks, enabling faster, more reliable end-to-end testing with human-verified accuracy. It is designed for engineering teams seeking to automate testing without the need for scripting or large QA teams.
Stringartgenerator
String Art Generator is a web-based tool that converts photos into precise, printable string-art patterns using an advanced algorithm, customizable settings, and high-resolution exports for hobbyists and professionals.
Soulmatedrawing
Soulmate Drawing generates a personalized soulmate sketch by combining your birth details, astrological archetypes, and an AI artist to create a hand-drawn style digital portrait for entertainment and self-reflection.