What It Does
AutoCaption transcribes speech from audio or video files and generates styled caption layers directly in After Effects. It uses AI-powered speech recognition (Deepgram Nova-3 or OpenAI Whisper Large v3) to convert spoken words into text layers that you can edit, animate, and style like any native After Effects element.
The plugin handles the transcription process through server-side processing (requires internet connection), then creates the captions locally as real text layers. Nothing is baked in, so you keep full control over timing, wording, fonts, colors, and animation after generation.
Key Features
Five caption pacing modes let you match different speech rhythms. One Word places each word on its own layer for maximum impact. Two Words and Three Words offer balanced pacing. Smart Parts groups 2-5 words based on natural speech rhythm. Sentences splits by pauses and logical structure.
Text layer modes give you control over complexity. Generate captions as a single text layer for simpler projects or multiple layers when you need granular animation control.
30+ ready-to-use caption styles cover social media formats, explainers, kinetic typography, and motion-driven content. Every style is fully editable with customizable fonts, colors, timing, and animation parameters.
Auto font detection recognizes non-Latin scripts (Arabic, Chinese, Japanese, Korean, Hebrew, Thai) and suggests appropriate fonts automatically.
Pre-compose option groups subtitle layers into a single composition, keeping your timeline organized.
Multi-language support covers 99 languages through Whisper v3 Large Turbo with automatic language detection. Best results come from clear speech and clean audio. The plugin can also detect sung vocals and generate captions from songs when lyrics are audible.
No usage limits. Unlike credit-based services, AutoCaption has no restrictions on minutes processed or captions generated. Use it as much as needed within your license.
No API key required. The plugin works out of the box after activation without creating external accounts or managing API credentials.
Technical Details
Upload limit is 200 MB per file. Processing time depends on audio length. Background noise and audio quality affect transcription accuracy. Large text layer counts may impact After Effects performance. Requires internet connection for caption generation. Compatible with After Effects CC 2019 or later.
Who It’s For
Useful for editors working on social media content, YouTube videos, training materials, or any project requiring accurate subtitles. The multiple pacing modes suit different content types, from fast-paced reels to long-form interviews. The style library speeds up delivery for clients who need caption variations.
Pricing
7-day free trial at $0 provides full feature access with no credit card required.
- Monthly: $8 per month, cancel anytime
- Yearly: $80 per year (saves 2 months compared to monthly)
- Perpetual: $150 one-time payment for lifetime license
All tiers include complete feature access with no usage limits. Upgrade pricing may be available for existing customers.