Perso

Perso

Perso AI is an AI-powered video localization platform that provides natural-sounding AI dubbing, voice cloning, pixel-perfect lip-sync, automatic subtitles and real-time script editing to translate and localize videos across 33+ languages and interactive AI experiences via an SDK and physical AI agents.

Perso is video software teams evaluate for video. Use this page to review pricing, integration signals, and the best alternatives before you commit.

Freemium API Enterprise 80/100
#81 in Video (81 tools)
Added 3 months ago
18223 directory views this week

Quick Overview

Best for: Video

What it does

Video software for decision-makers comparing workflow fit and alternatives.

Best fit

Video

Pricing snapshot

Freemium from $0

Next step

Compare Perso with similar tools before you shortlist it.

Compare this tool before you shortlist it

Review alternatives, pricing posture, and workflow fit side by side.

Perso

Perso AI is a video translation, localization and dubbing platform that uses machine learning to deliver natural voice cloning, precise lip-sync, automated subtitle generation and editable translated scripts. It targets content creators, education providers, marketers, enterprises and media studios who want to scale multilingual video production quickly and cost-effectively. In addition to cloud-based AI dubbing, Perso provides an Interactive SDK and physical AI "Human Station" products for interactive, multimodal conversational experiences in retail, travel, public venues and other spaces. Perso emphasizes speed (minutes, not weeks), high voice-match fidelity, enterprise security (SOC 2) and multi-format export for global distribution.

Perso AI is an AI-powered video localization platform that provides natural-sounding AI dubbing, voice cloning, pixel-perfect lip-sync, automatic subtitles and real-time script editing to translate and localize videos across 33+ languages and interactive AI experiences via an SDK and physical AI agents.

Own this listing?

Claim this page to add pricing, features, screenshots, and verified owner details.

Claim this listing

Key Features

Natural Voice Cloning

AI reproduces the speaker's unique voice characteristics, accent and tone to create natural-sounding dubbed audio that preserves speaker identity.

Pixel-Perfect Lip Sync

Advanced lip-sync generation aligns translated audio with mouth movements for seamless, native-feeling video playback.

One-Click Translation

Upload video, choose target languages and let Perso transcribe, translate, clone voices and generate dubbed audio automatically.

Real-Time Script Editing

Edit AI-generated translations directly in the interface and instantly regenerate dubbing, lip-sync and subtitles to refine tone and terminology.

33+ Languages (AI Dubbing)

Supports over 33 languages for video localization and claims reach across many global markets; Interactive SDK supports 100+ languages for conversational experiences.

Multi-Speaker Support

Handles videos with multiple speakers, preserving distinct voices and enabling multi-speaker dubbing workflows.

Multi-Format Export & Subtitles

Export localized content in MP4, MOV, WebM and audio formats (WAV), with embedded subtitles or separate SRT files.

Project Storage & Booster Processing

Plans include project storage windows, booster concurrent processing and queue controls to accelerate production and parallelize jobs.

Interactive SDK & AI Human Station

SDK and physical AI agents for deploying conversational AI experiences in physical spaces and across screens with gaze processing and emotional intelligence.

Enterprise Controls & Security

Enterprise plans offer dedicated managers, priority support, multi-team workspace management, SOC 2 compliance and encryption in transit and at rest.

Pricing

Free Tier Available

Free plan with 1 minute of fast dubbing for new users (5 free minutes referenced elsewhere on site); includes watermark and basic features.

Free

$0
  • One-time fast speed 1 minute for new users
  • Total dubbing time: 1 min
  • Max length per video: 1 min
  • 30-day project storage

Starter

$6.99 / month (monthly billing only)
  • Fast speed allocation: 15 min (monthly reset)
  • Total dubbing limit: 15 min
  • Max length per video: 5 min
  • Unlimited project storage

Creator

$21 / month billed yearly ($252/year) — $29 monthly (promotional yearly price shown)
  • Fast speed allocation: 30 min (monthly reset)
  • Unlimited low-speed dubbing content creation
  • Max length per video: 15 min
  • Unlimited project storage

Pro

$44 / month billed yearly ($528/year) — $59 monthly (promotional yearly price shown)
  • Fast speed allocation: 60 min (monthly reset)
  • Max length per video: 30 min
  • Unlimited project storage
  • Booster concurrent processing up to 2, booster queue up to 3

Enterprise

Custom pricing (contact sales)
  • High-volume capacity with custom plans and dedicated infrastructure
  • 1,000+ min/mo discounts with dedicated infrastructure
  • Dedicated manager and priority response
  • Multi-team workspace management and exclusive support resources

Use Cases

Content Creators & Social

Localize YouTube, TikTok, Instagram and short-form videos to reach international audiences with native-sounding dubbed audio and accurate lip-sync.

E-Learning & Training

Localize courses, training videos and webinars at scale to expand into new markets and reduce localization costs.

Enterprise & Internal Communications

Translate product demos, corporate comms and onboarding content for global teams while preserving speaker identity and tone.

Marketing & E-Commerce

Produce localized ads, product demos and customer testimonial videos to improve conversion rates and engagement in target regions.

Media & Entertainment

Localize documentaries, series and films for distribution with faster turnaround than traditional dubbing workflows.

Retail & Public Venues (Interactive AI)

Deploy Perso Interactive AI Human Stations and SDK in stores, airports and exhibitions to provide multilingual, context-aware customer interactions and information services.

Integrations

Interactive SDK

SDK to embed Perso's interactive, multilingual conversational experiences into web and device experiences.

LLM Model APIs

Supports multiple LLM options and APIs (the site references customization with providers such as OpenAI GPT-3.5 and HyperCLOVA X).

Content Sources (YouTube / Cloud Storage)

Direct dubbing from video URLs and support for sources like YouTube and cloud storage integrations (Google Drive referenced).

Export & Subtitles

Integration with downstream publishing via multi-format exports (MP4, MOV, WebM) and SRT subtitle files for workflows and CMS ingestion.

Benefits

Reach global audiences in 33+ languages without remaking content
Reduce dubbing costs by up to ~90% compared to traditional production
Translate and publish in minutes rather than days or weeks
High voice-match fidelity (company claims ~98% voice match) to preserve speaker identity
Increased engagement—native-language content drives higher conversions (company cites ~3x improvement)
Enterprise-grade security: SOC 2 compliance and encryption in transit and at rest

Limitations

Free plan and entry tiers impose short fast-speed dubbing limits (e.g., 1 min one-time for free, 15 min/month for Starter) and maximum per-video lengths depending on plan.
Higher-resolution exports (4K) and larger fast-speed allocations require Pro or Enterprise plans.
Exact API documentation, rate limits and developer docs are not provided in the supplied content and require contacting Perso or checking developer resources.

Frequently Asked Questions

What languages does Perso AI support for AI dubbing?
Perso supports 33+ languages for AI dubbing and the Interactive SDK / AI Human Station supports communication in 100+ languages via various LLM integrations.
Does Perso AI include lip-syncing, voice-over and voice cloning?
Yes — Perso provides natural voice cloning, AI lip-sync (pixel-perfect mouth alignment) and voice-over generation as integrated features.
Can I edit the AI-translated script?
Yes — Perso includes real-time script editing so you can refine translations and instantly regenerate dubbing, lip-sync and subtitles.
Can I dub videos directly from YouTube or Google Drive?
Yes — the platform supports dubbing from URLs and references direct dubbing from YouTube and cloud sources like Google Drive.
What is Perso AI Dubbing?
Perso AI Dubbing is the end-to-end AI workflow for translating, voice cloning, lip-syncing and exporting localized video content quickly and at scale.

Getting Started

  1. 1 Step 1 Upload Video or Audio Files: Drag & drop files or provide a URL (supports MP4, MOV, MP3, WAV and common sources such as YouTube).
  2. 2 Step 2 Select a Language: Choose a target language and let Perso transcribe, translate, voice clone and generate dubbed audio.
  3. 3 Step 3 Download your Dubbed Video: Export localized video (MP4) or audio (WAV), and optionally download SRT subtitle files.

Support

Contact / Sales

Contact Us / Talk to Sales for enterprise inquiries and custom pricing; the site has a sales contact form.

Docs & Blog

Blog & Insights, guides and product resources are available on the website for onboarding and best practices.

Community

Community resources and media/press sections are referenced on the site for additional support and updates.

Enterprise Customer Success

Enterprise plans offer a dedicated customer success manager and priority support channels.

API

Available: Yes
Documentation:

Not available in provided content (site references LLM model APIs and Interactive SDK—contact Perso or check developer docs for full API references).

Compare Perso with similar tools

See how it stacks up against alternatives

Related Tools

View all 81 →
Freemium
Zeemo

Zeemo

Zeemo is an AI-driven video creation and captioning platform that helps creators generate viral videos, AI-powered captions, and idea-to-video content—aimed at boosting views and saving editing time.

Video
Contact for pricing
Klipmeapp

Klipmeapp

Klipme is a web-based visual AI clip maker that automatically creates short-form promotional clips (TikToks, Reels, Shorts) and summaries from long-form video using generative and visual AI technologies.

Video
Free
Videoideas

Videoideas

VideoIdeas.ai is an AI-powered content platform built for YouTube creators that generates video ideas, full scripts, short-form content, ad scripts, channel analysis, and style cloning to help creators produce engaging videos faster and grow their channels.

Video
Free
Animoto

Animoto

Animoto is a web-based, drag-and-drop video maker that helps individuals and businesses create professional-looking videos quickly using templates, stock media, screen recording, and simple editing tools—no advanced editing skills required.

Video
High-growth
Free
Latentsync

Latentsync

LatentSync is an AI-powered video lip-synchronization framework that uses audio-conditioned latent diffusion models to produce precise, natural-looking lip motion alignment for videos across multiple languages and formats.

Video
Freemium
Hitpaw

Hitpaw

HitPaw is a consumer and professional multimedia software suite focused on AI-driven video, photo, and audio tools — highlighted by VikPea, a cloud-accelerated AI video enhancer capable of upscaling to 4K/8K, stabilizing, colorizing, repairing, and batch-processing footage.

Video
Contact for pricing
Klap

Klap

Klap is a tool for turning longer videos into short, shareable clips intended to drive viral engagement, aimed at creators, marketers, and publishers looking to repurpose video content for short-form platforms.

Video
Contact for pricing
Berrycast

Berrycast

Berrycast is a video screen recorder for Windows and Mac designed to capture and share screen recordings for tutorials, demos, and asynchronous communication.

Video

Premium Alternatives

Paid
Buildai

Buildai

BuildAI, ekiplerin proje yönetimi, blog ve içerik yönetimi, medya dosyaları, form oluşturma ve esnek key-value veri depolama ihtiyaçlarını tek bir platformda toplayan modern bir çok amaçlı yönetim aracıdır.

Productivity
Enterprise-ready
Paid
Retouchpro

Retouchpro

Retouchpro (AI Photo Generator) is a web-based AI image generation and editing platform for creators, influencers, and agencies that produces photorealistic and stylized images in seconds using multiple top image models and community-driven templates.

Image & Design
Enterprise-ready High-growth
Paid
Podfy

Podfy

Podfy.ai converts text and audio into fully edited videos (with narration, subtitles, effects and soundtrack) in minutes, aimed at creators who want to mass-produce content for platforms like YouTube, TikTok and Instagram.

Text-to-Video
Paid
runrly

runrly

Runrly is an AI-powered marketing platform offering on-demand marketing teams for startups and lean brands, enabling fast, scalable campaign execution with real-time insights and predictable subscription pricing.

Marketing
Paid
Aidancevideo

Aidancevideo

AI Dance Video is a web tool that turns any still photo (people, pets, or objects) into a short, shareable dancing video using motion-control AI models — aimed at social creators and casual users who want quick, humorous dance clips.

Video Generation
Paid
Hairstyleai

Hairstyleai

HairstyleAI is a virtual AI-powered hairstyle try-on service for men and women that generates photorealistic images of you in different haircuts so you can preview styles before committing to a real haircut.

Image & Design
Paid
Drawmy

Drawmy

DrawMy.Pet is an AI-powered service that generates custom pet portraits and social-media-ready video reels in 50+ styles with fast (often 24-hour) delivery, secure payment, and a money-back guarantee.

Generative Art
Paid
Closerscopy

Closerscopy

ClosersCopy is an AI-powered copywriting platform that helps marketers, copywriters, and teams generate long-form content, sales copy, ads, emails and SEO-optimized blog posts using proprietary AI models, customizable frameworks and a library of templates.

Copywriting

Explore Related Categories

Explore by Outcome