DEFINITIONS for VOICE ACTORS

Standard for Voice Actors →

Voice actors today face a credibility challenge: how do you prove your work is authentically yours when AI can clone voices instantly? This standard draws a clear line between human performance and machine generation.

QUICK REFERENCE

The VerifiedHuman label requires VH3 (Human-Led) or higher on the Human-AI Spectrum
Essential question: "Who gave voice to this?"
Acceptable AI use: Noise reduction, EQ, compression, mastering, audio cleanup
Not acceptable: AI-generated or AI-cloned voices you claim as your performance
The human voice remains distinct from AI simulation

QUICK NAV

Foreword →

Definitions in the standard →

Other definitions →

Interpretation →

Essential vocalization →

FOREWORD

The foundation of this standard is Essential Vocalization: who gave voice to this?

Voice acting is unique. Unlike writers, voice actors typically don't write their scripts—they perform them. The craft lies in bringing words to life through vocal performance. Voiceover must be heard to be experienced; even closed captions can only reference the words, not the inflection, emotion, and nuance that voice actors provide.

When a human performs vocal work, they are the essential vocalizer. When a machine generates it, the machine is the essential vocalizer.

Vocal work means the production of vocal elements for live performance, broadcast, or recording—narration, dialogue, or voice-over that has been performed, recorded, or broadcast.

The human voice remains distinct from AI simulation. Modern voice production involves technical processes in which AI tools may be employed—from studio recording enhancement to post-production processing—but these tools support, rather than replace, the voice actor's performance.

We support voice actors who use AI in non-standard ways.

For instance, some voice actors may experiment with AI tools for creative effects or stylized productions. They may find the VerifiedHuman label unhelpful on that piece, but use it for traditional voice work.

In this way, we hope to be helpful and encouraging to all voice actors.

HUMAN-AI COLLABORATION LEVELS

Our 5-level Human-AI Spectrum shows who led the creative process. All VerifiedHuman-labeled work must meet or exceed VH3, meaning the voice actor performed the vocal work.

Note: Voice acting differs from other creative domains. The core question isn't whether you transformed AI material—it's whether a human voice or a synthetic voice speaks. At VH3 and above, the voice must be human.

VH5 - Entirely Human-Performed

The voice actor performed all vocal content without generative AI
Traditional tools (noise reduction, EQ, compression, mastering) are fine
The work reflects the performer's voice, interpretation, and emotional expression

VH4 - Human-Performed, AI-Enhanced

The voice actor performed all vocal content
Generative AI may assist with non-performance tasks: script generation, research, session notes, technical documentation
The performer's voice and artistic choices dominate; AI supports the workflow but doesn't generate vocal content

VH3 - Human-Led, AI-Assisted

The voice actor leads the performance; AI contributes to production but doesn't replace the human voice
AI may assist with significant production elements (heavy processing, timing adjustments), but the core vocal performance is human
The final work reflects the performer's voice and interpretation—not AI-generated or AI-cloned vocals
If the voice itself is AI-generated, it's not human performance

Work below VH3 is not eligible for the VerifiedHuman label.

USING CONTENT OF UNKNOWN ORIGIN (AIU)

Sometimes, voice actors work with audio elements where the origin is uncertain. In rare cases, a voice actor may suspect that AI has generated some audio components they are working with, but the origin, time, place, or purpose is unknown or uncertain. When using audio elements of unknown origin, add the AIU (AI-Unknown) tag to your VH level designation, for example, VH4-AIU.

In the spirit of this standard, the question remains: Did you perform it, or did AI generate it? We believe the voice actor's intention to create and present vocal work as their own should always remain with them.

We acknowledge the hard work of skilled professionals—women and men who have honed their skills and mastered this craft in various languages. We appreciate your human voices and are grateful to be able to help you mark them as appropriately yours.

DEFINITIONS

Here are definitions of words and ideas used in the Standard for Voice Actors.

Standard

A statement of commitment to specific creative practices

Voice Actor/Voiceover Artist

The performer of vocal work

Represent

Claim, present, or share work as your own

Vocal work

Vocal elements produced for live performance, broadcast, or recording—narration, dialogue, or voice-over that has been performed, recorded, or broadcast

Team

A group of people working together

Intellectual property

A creative work to which one has legal rights, such as copyright or trademark

Essential

The fundamental elements or characteristics of something

Essentially vocalized

When a voice actor performs vocal elements to create vocal work. The test: Who gave voice to this? (see Essential Vocalization below)

Essential vocal elements

Sounds made by the vocal tract—can be heard as talking, singing, laughing, crying, screaming, humming, mumbling, or shouting

Human

Noun: a person; Adjective: originating from a person

Generative AI

AI systems trained on existing content that generate new text, images, audio, code, or video in response to prompts. Examples include ChatGPT, ElevenLabs, and Resemble AI.

Machine learning

Algorithms that learn from data to make predictions or generate content

Other generative processes

Any other AI or machine learning process that creates new content

OTHER DEFINITIONS

Other definitions of words and ideas related to the Standard for Voice Actors.

Artificial Intelligence (AI)

A field of computer science focused on creating systems that can perform tasks typically requiring human intelligence, such as learning, reasoning, and generating content.

Voice synthesis/Text-to-speech (TTS)

AI systems that generate spoken audio from text input, creating synthetic voices rather than recording human performance.

Voice cloning

AI systems trained on recordings of a specific human voice to generate new speech that mimics that person's voice.

Inflection

The rise and fall of pitch in the voice; changes in tone used to convey meaning, emotion, or emphasis.

Timbre

The characteristic quality of a voice, independent of pitch or volume, that makes each human voice distinctive.

INTERPRETATION

Here are questions voice actors commonly face when working with AI tools. The voice actor is responsible for adhering to their own values and exercising their own judgment.

Q: Can I use AI to clean up my recordings and still qualify for VerifiedHuman?

Here's how to evaluate this approach:

ACCEPTABLE: Using AI for technical post-processing (noise reduction, EQ, compression, mastering) while you performed the entire vocal work. AI enhanced your recording; you created the performance.

GRAY AREA: Using heavy pitch correction or vocal processing that significantly alters your natural voice. Ask yourself: Is this still recognizably my performance?

NOT ACCEPTABLE: Using AI voice cloning or synthesis to generate vocal performances. If AI generated the voice, you didn't perform it—even if you wrote the script or provided the voice sample.

The question remains: Who performed it?

REAL-WORLD SCENARIO: Audiobook Narration with AI Enhancement

Sarah narrates a 10-hour audiobook. She uses AI to:

Remove background noise and mouth clicks (audio cleanup)
Normalize volume levels across chapters (mastering)
Reduce echo in her home studio (acoustic correction)
Apply light compression for consistency (production enhancement)

Sarah performed every word, character voice, and emotional inflection herself.

VERDICT: VH4-VH5 (Human-Performed, AI-Enhanced / Entirely Human-Performed)

WHY IT QUALIFIES: AI assisted with technical post-production, but Sarah performed all vocal work. She is the essential vocalizer.

ESSENTIAL VOCALIZATION

In vocal performance, artists use their minds and distinctive physical and verbal traits to bring human expressions to life, crafting unique soundscapes. These come alive in movies, radio, audiobooks, and more.

The essential question of vocalization is: Who (or what) gave voice to this sound?

RATIONALE

We hear and experience spoken vocalizations—either human voices or voices simulated by a machine.
When a human's voice can be experienced live or in recorded media, the result is vocal work.

ASSUMPTION OF ESSENTIAL HUMAN VOCALIZATION

If a human (and not a machine) gives voice to words and sounds, then a human is the essential vocalizer.

VerifiedHuman helps voice actors certify their work as human-made. Established in April 2023, the standard provides a clear framework for communicating who led the creative process. Free to join, pay what you can. Creators certified across 25+ countries worldwide.

Standard for Voice Actors →

EXPLORE OTHER DEFINITIONS

Writers → | Visual Artists → | Musicians →

LEARN MORE

VH Human-AI Spectrum →

J. Michael Collins

Gary DuBois

Dean Allen

Cacala Carvalho

David Hollandsworth

Patricia Neumann

Natalie Tyer

Wendy Ballentyne

Jeremy DV Boyd

DEFINITIONS for VOICE ACTORS

Standard for Voice Actors →

Voice actors today face a credibility challenge: how do you prove your work is authentically yours when AI can clone voices instantly? This standard draws a clear line between human performance and machine generation.

Standard for Voice Actors →