Perso

Perso

Perso AI is an AI-powered video localization platform that provides natural-sounding AI dubbing, voice cloning, pixel-perfect lip-sync, automatic subtitles and real-time script editing to translate and localize videos across 33+ languages and interactive AI experiences via an SDK and physical AI agents.

Perso is video software teams evaluate for video. Use this page to review pricing, integration signals, and the best alternatives before you commit.

Freemium API Enterprise 80/100
#81 in Video (81 tools)
Added 3 months ago
17894 directory views this week

Quick Overview

Best for: Video

What it does

Video software for decision-makers comparing workflow fit and alternatives.

Best fit

Video

Pricing snapshot

Freemium from $0

Next step

Compare Perso with similar tools before you shortlist it.

Compare this tool before you shortlist it

Review alternatives, pricing posture, and workflow fit side by side.

Perso

Perso AI is a video translation, localization and dubbing platform that uses machine learning to deliver natural voice cloning, precise lip-sync, automated subtitle generation and editable translated scripts. It targets content creators, education providers, marketers, enterprises and media studios who want to scale multilingual video production quickly and cost-effectively. In addition to cloud-based AI dubbing, Perso provides an Interactive SDK and physical AI "Human Station" products for interactive, multimodal conversational experiences in retail, travel, public venues and other spaces. Perso emphasizes speed (minutes, not weeks), high voice-match fidelity, enterprise security (SOC 2) and multi-format export for global distribution.

Perso AI is an AI-powered video localization platform that provides natural-sounding AI dubbing, voice cloning, pixel-perfect lip-sync, automatic subtitles and real-time script editing to translate and localize videos across 33+ languages and interactive AI experiences via an SDK and physical AI agents.

Own this listing?

Claim this page to add pricing, features, screenshots, and verified owner details.

Claim this listing

Key Features

Natural Voice Cloning

AI reproduces the speaker's unique voice characteristics, accent and tone to create natural-sounding dubbed audio that preserves speaker identity.

Pixel-Perfect Lip Sync

Advanced lip-sync generation aligns translated audio with mouth movements for seamless, native-feeling video playback.

One-Click Translation

Upload video, choose target languages and let Perso transcribe, translate, clone voices and generate dubbed audio automatically.

Real-Time Script Editing

Edit AI-generated translations directly in the interface and instantly regenerate dubbing, lip-sync and subtitles to refine tone and terminology.

33+ Languages (AI Dubbing)

Supports over 33 languages for video localization and claims reach across many global markets; Interactive SDK supports 100+ languages for conversational experiences.

Multi-Speaker Support

Handles videos with multiple speakers, preserving distinct voices and enabling multi-speaker dubbing workflows.

Multi-Format Export & Subtitles

Export localized content in MP4, MOV, WebM and audio formats (WAV), with embedded subtitles or separate SRT files.

Project Storage & Booster Processing

Plans include project storage windows, booster concurrent processing and queue controls to accelerate production and parallelize jobs.

Interactive SDK & AI Human Station

SDK and physical AI agents for deploying conversational AI experiences in physical spaces and across screens with gaze processing and emotional intelligence.

Enterprise Controls & Security

Enterprise plans offer dedicated managers, priority support, multi-team workspace management, SOC 2 compliance and encryption in transit and at rest.

Pricing

Free Tier Available

Free plan with 1 minute of fast dubbing for new users (5 free minutes referenced elsewhere on site); includes watermark and basic features.

Free

$0
  • One-time fast speed 1 minute for new users
  • Total dubbing time: 1 min
  • Max length per video: 1 min
  • 30-day project storage

Starter

$6.99 / month (monthly billing only)
  • Fast speed allocation: 15 min (monthly reset)
  • Total dubbing limit: 15 min
  • Max length per video: 5 min
  • Unlimited project storage

Creator

$21 / month billed yearly ($252/year) — $29 monthly (promotional yearly price shown)
  • Fast speed allocation: 30 min (monthly reset)
  • Unlimited low-speed dubbing content creation
  • Max length per video: 15 min
  • Unlimited project storage

Pro

$44 / month billed yearly ($528/year) — $59 monthly (promotional yearly price shown)
  • Fast speed allocation: 60 min (monthly reset)
  • Max length per video: 30 min
  • Unlimited project storage
  • Booster concurrent processing up to 2, booster queue up to 3

Enterprise

Custom pricing (contact sales)
  • High-volume capacity with custom plans and dedicated infrastructure
  • 1,000+ min/mo discounts with dedicated infrastructure
  • Dedicated manager and priority response
  • Multi-team workspace management and exclusive support resources

Use Cases

Content Creators & Social

Localize YouTube, TikTok, Instagram and short-form videos to reach international audiences with native-sounding dubbed audio and accurate lip-sync.

E-Learning & Training

Localize courses, training videos and webinars at scale to expand into new markets and reduce localization costs.

Enterprise & Internal Communications

Translate product demos, corporate comms and onboarding content for global teams while preserving speaker identity and tone.

Marketing & E-Commerce

Produce localized ads, product demos and customer testimonial videos to improve conversion rates and engagement in target regions.

Media & Entertainment

Localize documentaries, series and films for distribution with faster turnaround than traditional dubbing workflows.

Retail & Public Venues (Interactive AI)

Deploy Perso Interactive AI Human Stations and SDK in stores, airports and exhibitions to provide multilingual, context-aware customer interactions and information services.

Integrations

Interactive SDK

SDK to embed Perso's interactive, multilingual conversational experiences into web and device experiences.

LLM Model APIs

Supports multiple LLM options and APIs (the site references customization with providers such as OpenAI GPT-3.5 and HyperCLOVA X).

Content Sources (YouTube / Cloud Storage)

Direct dubbing from video URLs and support for sources like YouTube and cloud storage integrations (Google Drive referenced).

Export & Subtitles

Integration with downstream publishing via multi-format exports (MP4, MOV, WebM) and SRT subtitle files for workflows and CMS ingestion.

Benefits

Reach global audiences in 33+ languages without remaking content
Reduce dubbing costs by up to ~90% compared to traditional production
Translate and publish in minutes rather than days or weeks
High voice-match fidelity (company claims ~98% voice match) to preserve speaker identity
Increased engagement—native-language content drives higher conversions (company cites ~3x improvement)
Enterprise-grade security: SOC 2 compliance and encryption in transit and at rest

Limitations

Free plan and entry tiers impose short fast-speed dubbing limits (e.g., 1 min one-time for free, 15 min/month for Starter) and maximum per-video lengths depending on plan.
Higher-resolution exports (4K) and larger fast-speed allocations require Pro or Enterprise plans.
Exact API documentation, rate limits and developer docs are not provided in the supplied content and require contacting Perso or checking developer resources.

Frequently Asked Questions

What languages does Perso AI support for AI dubbing?
Perso supports 33+ languages for AI dubbing and the Interactive SDK / AI Human Station supports communication in 100+ languages via various LLM integrations.
Does Perso AI include lip-syncing, voice-over and voice cloning?
Yes — Perso provides natural voice cloning, AI lip-sync (pixel-perfect mouth alignment) and voice-over generation as integrated features.
Can I edit the AI-translated script?
Yes — Perso includes real-time script editing so you can refine translations and instantly regenerate dubbing, lip-sync and subtitles.
Can I dub videos directly from YouTube or Google Drive?
Yes — the platform supports dubbing from URLs and references direct dubbing from YouTube and cloud sources like Google Drive.
What is Perso AI Dubbing?
Perso AI Dubbing is the end-to-end AI workflow for translating, voice cloning, lip-syncing and exporting localized video content quickly and at scale.

Getting Started

  1. 1 Step 1 Upload Video or Audio Files: Drag & drop files or provide a URL (supports MP4, MOV, MP3, WAV and common sources such as YouTube).
  2. 2 Step 2 Select a Language: Choose a target language and let Perso transcribe, translate, voice clone and generate dubbed audio.
  3. 3 Step 3 Download your Dubbed Video: Export localized video (MP4) or audio (WAV), and optionally download SRT subtitle files.

Support

Contact / Sales

Contact Us / Talk to Sales for enterprise inquiries and custom pricing; the site has a sales contact form.

Docs & Blog

Blog & Insights, guides and product resources are available on the website for onboarding and best practices.

Community

Community resources and media/press sections are referenced on the site for additional support and updates.

Enterprise Customer Success

Enterprise plans offer a dedicated customer success manager and priority support channels.

API

Available: Yes
Documentation:

Not available in provided content (site references LLM model APIs and Interactive SDK—contact Perso or check developer docs for full API references).

Compare Perso with similar tools

See how it stacks up against alternatives

Related Tools

View all 81 →
Freemium
Imobie

Imobie

iMobie’s FocuSee is an AI-powered screen recorder for Windows and macOS that automatically polishes recordings with auto-zoom, cursor tracking, AI audio enhancement, subtitles, background removal and one-click exports for social platforms.

Video
High-growth
Freemium
Colossyan

Colossyan

Colossyan is an AI-driven platform for creating presenter-led videos, interactive training, and full learning programs from documents, slides, or scripts, aimed at teams producing onboarding, compliance, and enablement content at scale.

Video
Freemium
Visla

Visla

Visla is an AI-powered video workflow platform for teams that captures, creates, edits, and collaborates on branded videos at scale for marketing, training, sales, support, product, HR, and education.

Video
High-growth
Free
Tubebuddy

Tubebuddy

TubeBuddy Assistant is an AI-powered tool that turns YouTube videos with captions into interactive conversations—providing instant summaries, smart timestamps, and Q&A to extract insights and save viewing time.

Video
Freemium
Aiconverthub

Aiconverthub

AI Convert Hub is a browser-based file conversion service for video, audio, and images that offers batch uploads, codec-aware presets, FFmpeg-based encoding with AI tuning, and secure transfers for creators and developers.

Video
Freemium
Cinemadrop

Cinemadrop

CinemaDrop is an all-in-one AI filmmaking studio that turns a single idea into a full script, storyboard, and finished video using generative image, video and audio models with built-in character & scene consistency.

Video
High-growth
Contact for pricing
Klap

Klap

Klap is a tool for turning longer videos into short, shareable clips intended to drive viral engagement, aimed at creators, marketers, and publishers looking to repurpose video content for short-form platforms.

Video
Freemium
Videocaption

Videocaption

Videocaption is a browser-based tool that auto-generates and burns stylish, editable subtitles into videos for social and professional use, using AI transcription and support for .srt/.vtt files. It's aimed at creators, marketers, educators and teams who need fast, accessible captioning without heavy desktop software.

Video

Premium Alternatives

Paid
automateclips

automateclips

AutomateClips is an AI-powered video generator that transforms app walkthroughs into viral-ready content featuring virtual influencers, designed to showcase app features and drive downloads on platforms like TikTok, Instagram, and YouTube.

Video Generation
Paid
seogeek

seogeek

seoGEEK is an all-in-one SEO and digital marketing tool designed for web developers, SEO experts, and digital marketing agencies. It offers advanced AI-powered features for content creation, keyword analysis, project management, and advertising optimization to streamline workflows and grow businesses.

SEO
Paid
Buildai

Buildai

BuildAI, ekiplerin proje yönetimi, blog ve içerik yönetimi, medya dosyaları, form oluşturma ve esnek key-value veri depolama ihtiyaçlarını tek bir platformda toplayan modern bir çok amaçlı yönetim aracıdır.

Productivity
Enterprise-ready
Paid
Seeyourbaby

Seeyourbaby

SeeYourBaby is an AI-powered baby generator that predicts a future child's likely appearance from photos of two parents, delivering multiple high-resolution boy and girl images via email with a one-time payment.

Image & Design
Paid
Deepwander

Deepwander

Deepwander is an AI-powered companion for personal growth that guides interactive self-reflection to help users explore thoughts, emotions, and behaviors and arrive at clarity and practical next steps.

Chat
High-growth
Paid
botgauge

botgauge

BotGauge is an AI-driven autonomous QA solution that delivers over 80% test coverage within two weeks, enabling faster, more reliable end-to-end testing with human-verified accuracy. It is designed for engineering teams seeking to automate testing without the need for scripting or large QA teams.

Automation
Paid
Stringartgenerator

Stringartgenerator

String Art Generator is a web-based tool that converts photos into precise, printable string-art patterns using an advanced algorithm, customizable settings, and high-resolution exports for hobbyists and professionals.

Design Generators
High-growth
Paid
Soulmatedrawing

Soulmatedrawing

Soulmate Drawing generates a personalized soulmate sketch by combining your birth details, astrological archetypes, and an AI artist to create a hand-drawn style digital portrait for entertainment and self-reflection.

Design Generators
High-growth

Explore Related Categories

Explore by Outcome