Lyria 3 image-to-music input in Google AI Studio, shaping audio with advanced AI technology.

Editorial illustration for Lyria 3 supports image‑to‑music input, shaping audio in Google AI Studio

Lyria 3: AI Transforms Images into Custom Music Tracks

Lyria 3 supports image‑to‑music input, shaping audio in Google AI Studio

March 25, 2026 • 2 min read

Google’s latest foray into generative sound arrives with a model that pushes past the usual text prompts. Lyria 3, unveiled under the “Build with Lyria 3, our newest music generation model” banner, promises creators a way to tie visual cues directly to audio output. While earlier versions required you to describe a vibe in words, this iteration lets you drop an image and let the system infer mood, style and atmosphere.

The shift matters because it blurs the line between visual and auditory storytelling, offering a more intuitive workflow for designers, game developers, and marketers who already juggle graphics and sound. Here’s the thing: Google AI Studio is rolling out a dedicated music‑generation experience to let users test the feature right away. That immediate access lowers the barrier for experimentation, turning a concept that once felt speculative into a hands‑on tool.

The result? A preview of how multimodal inputs could reshape creative pipelines—if the technology lives up to its promise.

- Multimodal image-to-music input: Beyond text, Lyria 3 supports multimodal inputs. You can provide an image to influence the mood, style and atmosphere of the audio. Try Lyria 3 in Google AI Studio

- Multimodal image-to-music input: Beyond text, Lyria 3 supports multimodal inputs. You can provide an image to influence the mood, style and atmosphere of the audio. Try Lyria 3 in Google AI Studio To help you start experimenting immediately, we are also launching a new music generation experience in AI Studio.

Using a paid API key, this dedicated workspace provides a first-class environment to create with Lyria 3 and explore its advanced features like image to music. Inside the playground, you can explore two powerful creation modes for music: - Text mode: Describe the music you want to hear using natural language including parameters like Tempo or Key.

Build with Lyria 3, our newest music generation model - Google AI Blog

Is this the next step for AI‑driven composition? Lyria 3 and its Pro variant have entered public preview through the Gemini API and Google AI Studio, offering developers a model that claims deep musical awareness paired with structural coherence. The rollout includes a new music‑generation experience that lets users test high‑fidelity pieces—vocals, verses and choruses—while the system aims to keep consistency from the opening note to the final bar.

Beyond text prompts, Lyria 3 accepts image inputs, allowing an uploaded picture to shape mood, style and atmosphere, a feature highlighted in the Studio preview. Yet, it’s unclear whether the multimodal approach will translate into broader adoption or meaningful improvements over existing tools. The preview status means performance limits and real‑world robustness remain unverified.

For teams ready to experiment now, the platform provides immediate access, but developers will need to assess how well the model integrates with their pipelines and whether the promised musical continuity holds up under diverse use cases.

Common Questions Answered

How does Lyria 3 differ from previous music generation models in terms of input?

Lyria 3 introduces a groundbreaking multimodal approach by allowing image-to-music input, moving beyond traditional text-based prompts. This means creators can now upload an image and have the AI generate music that captures the mood, style, and atmosphere suggested by the visual input.

Where can developers access and experiment with Lyria 3's music generation capabilities?

Google has launched a dedicated music generation experience in Google AI Studio, where developers can access Lyria 3 through a paid API key. This workspace provides a comprehensive environment for exploring the model's advanced features, including the innovative image-to-music generation.

What makes Lyria 3's music generation approach unique in terms of musical coherence?

Lyria 3 aims to maintain structural consistency throughout a musical piece, ensuring that the generated audio remains coherent from the opening note to the final bar. The model claims to have deep musical awareness, allowing it to create high-fidelity compositions that include nuanced elements like vocals, verses, and choruses.

🎓

Featured Review

No Code MBA

Build AI apps without coding. Our in-depth course review.

Read Review

Lyria 3: AI Transforms Images into Custom Music Tracks

Further Reading

Common Questions Answered

How does Lyria 3 differ from previous music generation models in terms of input?

Where can developers access and experiment with Lyria 3's music generation capabilities?

What makes Lyria 3's music generation approach unique in terms of musical coherence?

Most Popular

Cursor launches Composer 2, outperforms Claude Opus 4.6, lags GPT‑5.4

Anthropic launches Claude Code Channels for Telegram and Discord messaging

Dfinity's Caffeine AI Builds Apps Through Conversation

Adobe Firefly adds custom models to train AI on your own art

Google rolls out Gemini AI to all US users, free tier gets Personal Intelligence

EU to ban nudify apps after Grok surge; amendment blocks Musk's liability plan

Xiaomi's MiMo-V2-Pro LLM nears GPT‑5.2 performance, beats Opus 4.6 at lower cost

Mistral AI launches Forge to let firms build proprietary AI models

Random Labs releases Slate V1, swarm‑native coding agent with OS‑style memory

OpenClaw: Free AI Agent Tool Goes Viral in 2026, Enables Scripted Workflows

Further Reading

Related Reading

Ant Group unveils Ring-1T, first open-source trillion-parameter reasoning model

ChatGPT Health Event Shows AI Modernizing Dev Workflows, GitLab Unveils Plans

Gen AI app sessions up fivefold, downloads jump 778% as ChatGPT leads traffic

Google AI Advisors Let Users Probe Performance with Conversational “Why” Queries

Game stocks slide as Google launches AI world‑gen tool, Project Genie limits noted

Anthropic introduces safer auto mode for Claude Code, balancing handholding and autonomy

Google adds Gemini checkout partners while OpenAI upgrades ChatGPT product views

Google TV adds three Gemini features for interactive, guided walkthroughs

LangChain to Appear at Google Cloud Next 2026 with Atlassian and Google Leaders

Common Questions Answered

How does Lyria 3 differ from previous music generation models in terms of input?

Where can developers access and experiment with Lyria 3's music generation capabilities?

What makes Lyria 3's music generation approach unique in terms of musical coherence?

Most Popular

Cursor launches Composer 2, outperforms Claude Opus 4.6, lags GPT‑5.4

Anthropic launches Claude Code Channels for Telegram and Discord messaging

Dfinity's Caffeine AI Builds Apps Through Conversation

Adobe Firefly adds custom models to train AI on your own art

Google rolls out Gemini AI to all US users, free tier gets Personal Intelligence

EU to ban nudify apps after Grok surge; amendment blocks Musk's liability plan

Xiaomi's MiMo-V2-Pro LLM nears GPT‑5.2 performance, beats Opus 4.6 at lower cost

Mistral AI launches Forge to let firms build proprietary AI models

Random Labs releases Slate V1, swarm‑native coding agent with OS‑style memory

OpenClaw: Free AI Agent Tool Goes Viral in 2026, Enables Scripted Workflows