Hume AI staff integrate voice and emotion into DeepMind models at Google, enhancing AI capabilities.

Editorial illustration for Google hires Hume AI staff to add voice, emotion to DeepMind models

Gemini 3: Google's Most Intelligent Search AI Yet

Google hires Hume AI staff to add voice, emotion to DeepMind models

By AI Daily Post Edited by Brian Petersen, Editor-in-Chief

January 22, 2026 • Updated: July 4, 2026 • 3 min read

Google's DeepMind just hired Alan Cowen, the founder of Hume AI. That's the hard fact. It signals a sharp pivot: the lab is now embedding voice and emotion-reading tech directly into its core models.

The startup itself burned millions training systems to parse subtle emotional cues in recorded speech. And the new Hume CEO, Andrew Ettinger, states it plainly—voice is headed for primetime as the way people talk to AI.

Cowen and the other Hume AI recruits will help Google DeepMind integrate voice and emotional intelligence into its latest models, according to sources who spoke on the condition of anonymity as they aren't authorized to speak publicly about the deal. Hume AI has invested millions in developing models and tools to hone realistic voice interfaces and to detect emotions in the voices of users. The company trains its models by having experts annotate emotional cues in real conversations.

At Google, Cowen and his colleagues will help the tech giant integrate voice and emotion technology into its frontier models, sources say. "Voice is going to become a primary interface for AI, that is absolutely where it's headed," says Andrew Ettinger, an experienced investor and executive who is taking over as the CEO of Hume AI.

Google Acquires Top Talent From AI Voice Startup Hume AI in Licensing Deal - WIRED AI

This is a strategic shift for Google. The focus moves beyond raw data processing toward making AI sound, and feel, human. Their specific goal?

Building models that detect subtle vocal tones—frustration, fatigue, everything. If Ettinger’s prediction holds and voice becomes the main interface, this underlying emotional tech will define how millions finally engage with Google's products.

Common Questions Answered

What is OCTAVE and what makes it unique among speech-language models?

OCTAVE (Omni-Capable Text and Voice Engine) is a next-generation speech-language model that can generate not just voices, but entire personalities from brief prompts or recordings. Unlike traditional text-to-speech systems, OCTAVE can create multiple interacting AI personalities with distinct characteristics like gender, age, accent, and emotional intonation, while maintaining the capabilities of a frontier large language model.

How detailed can OCTAVE's voice and personality generation be?

OCTAVE can generate extremely nuanced voices and personalities with remarkable specificity, from a 'gravelly male voice as if gargling hot asphalt' to a 'New Zealand female wellness coach with a soothing, deliberately slow therapeutic voice'. The model can emulate precise vocal characteristics including vocational speaking styles, emotional tones, and even specific accent variations with professional-level detail.

What are the key capabilities of OCTAVE for developers and creators?

OCTAVE is designed to power AI systems that can communicate richly with humans while following detailed instructions and using tools. The model is well-suited for applications like creating multi-character audiobooks, generating podcast dialogues, producing video voiceovers, and developing conversational agents with highly customizable vocal and personality characteristics.

Ship an AI product this weekend — no engineers required.

Structured, in-depth lessons on the exact no-code tools — not scattered tutorials.

The exact platforms, taught in depth
Build real, working projects
Our honest review + a reader discount

Read the review →

Gemini 3: Google's Most Intelligent Search AI Yet

Common Questions Answered

What is OCTAVE and what makes it unique among speech-language models?

How detailed can OCTAVE's voice and personality generation be?

What are the key capabilities of OCTAVE for developers and creators?

Further Reading

Ship an AI product this weekend — no engineers required.

Latest News

Anthropic expands voice mode to Gmail, Slack apps

PhantomFill: When Language Models Invent Answers to Unanswerable Questions

ChatGPT Health Expands to All US Users, Adds Medical Record Integration

Security researcher says AI guardrails don't impede his offensive work

Single Tampered ChatGPT Link Spawns Rogue AI Agent in Minutes

Microsoft launches cost-cutting AI models in shift from single flagship approach

Runway launches AI model router based on its creative team's evaluation expertise

OpenAI adds voice control to desktop Codex and ChatGPT

New Bill Would Let US Government Order Shutdown of AI Systems

Andrew Ng's OpenWorker Desktop AI Returns Finished Work, Uses Local Models

Related Reading

Nordic pilot adds Gemini for Education, NotebookLM to boost AI literacy

Kling launches Video O1, all-in-one model with MVL bridge using transformer

DeepSeek Seeks More Capital Weeks After USD 7B Funding Round

Google tests visual 'magazine-style' UI for Gemini 3 Pro users

Google's FACTS benchmark shows 70% factuality ceiling across four tests

Meta AI lab to debut ‘Avocado’ text model and ‘Mango’ vision model in Q1

Telangana Launches Aikam AI Innovation Hub at Davos, Calls for Global Partners

India aims for USD 150 bn AI spend by 2026; Google, Microsoft, Amazon commit USD 70 bn

Free on-demand full-length SAT practice tests now live in Gemini

Common Questions Answered

What is OCTAVE and what makes it unique among speech-language models?

How detailed can OCTAVE's voice and personality generation be?

What are the key capabilities of OCTAVE for developers and creators?

Further Reading

Ship an AI product this weekend — no engineers required.

Latest News

Anthropic expands voice mode to Gmail, Slack apps

PhantomFill: When Language Models Invent Answers to Unanswerable Questions

ChatGPT Health Expands to All US Users, Adds Medical Record Integration

Security researcher says AI guardrails don't impede his offensive work

Single Tampered ChatGPT Link Spawns Rogue AI Agent in Minutes

Microsoft launches cost-cutting AI models in shift from single flagship approach

Runway launches AI model router based on its creative team's evaluation expertise

OpenAI adds voice control to desktop Codex and ChatGPT

New Bill Would Let US Government Order Shutdown of AI Systems

Andrew Ng's OpenWorker Desktop AI Returns Finished Work, Uses Local Models