Gemini AI on a smartphone, booking Uber and ordering DoorDash, showcasing multimodal capabilities [developers.google.com] [de

Editorial illustration for Gemini uses Model Context Protocol to book Uber and order DoorDash

Gemini uses Model Context Protocol to book Uber and...

By AI Daily Post Edited by Brian Petersen, Editor-in-Chief

February 25, 2026 • Updated: July 4, 2026 • 5 min read

Your smartphone is about to get a lot more useful, or a lot more invasive, depending on how you look at it. Starting today, Google’s Gemini can book an Uber, order a DoorDash meal, or navigate a dozen other apps on your Android device without you lifting a finger. This isn't a simple shortcut or a glorified macro.

It's a fundamental shift in how mobile intelligence works. Behind the scenes, Gemini uses the open-source Model Context Protocol (MCP) to speak directly to third-party apps, executing tasks in the background until only the checkout screen remains. When that’s not an option, it can actually open the app and tap through buttons, text fields, and menus as if it had thumbs of its own.

“This is the first time we’re doing this on Android with applications,” says Sameer Samat, Google’s president of the Android ecosystem. “We view this as the beginning of a new era of mobile intelligence.” Privacy concerns are real, and Google is treading carefully: no sensitive apps in this first wave, no data sold for advertising, and full user control over what Gemini sees. For now, you still need the phone screen to finish the task.

But Samat sees a future where you start that ride request from smart glasses, an AI pendant, or even your car. The autonomy is coming. The only question is how much trust you’re willing to hand over.

If there's a Model Context Protocol (MCP) integration--the open source universal language that lets LLMs talk to third-party apps--then Gemini can run the task in the backend. (In this instance, you wouldn't see the whole process play out; you'd just see the final checkout step appear after you make the request.) There are also "App Functions" developers can build that allows Gemini to interface with it in a structured way. But if neither exist, then that's when Gemini can open the app itself and navigate through the buttons, text boxes, and menus to complete tasks.

"This is the first time we're doing this on Android with applications, and so getting this right is really important," Samat says. "I think it's an exciting step forward in technology. We sort of view this as the beginning of a new era of mobile intelligence, and Android is where we think you see the future first." Privacy concerns abound when it comes to granting Gemini access to your apps.

Samat says that's why Google hasn't included any overly sensitive apps in this first batch for task automation. He says this data is not used for advertising, and that users can delete the data that Gemini sees. "We do think it's really important that people have trust in the system, and that comes from having control and transparency of what it's doing." While the smartphone screen is still required at the moment to complete the task, Samat envisions a future where you can start these tasks through other devices--say, a pair of smart glasses, an AI pendant, or even a car.

(There will be several new Android XR-powered smart glasses launching this year.) He says the company is looking at other ways to make the final authentication capable on these other devices.

Gemini Can Now Book You an Uber or Order a DoorDash Meal on Your Phone. Here's How It Works - WIRED AI

This is the quiet revolution: not a flashy demo, but a protocol that lets an AI speak the language of apps without needing to see the screen. Gemini isn’t just opening doors, it’s learning the layout of the house. The privacy guardrails are intentional, the transparency non-negotiable.

And if Samat’s vision holds, the phone itself becomes just one node in a chain of smarter surfaces, glasses, pendants, dashboards. The era of mobile intelligence isn’t about bigger screens. It’s about making the screen optional.

That starts here, in the background, with a protocol that turns your requests into actions you never have to watch unfold.

Common Questions Answered

How does Gemini use the Model Context Protocol to control Android apps?

The Model Context Protocol allows Gemini to communicate with and control apps like Uber and DoorDash without needing to see the screen or require manual input from the user. This protocol enables AI to speak the language of apps directly, representing a fundamental shift in how mobile interactions work beyond simple shortcuts or macros.

What specific tasks can Gemini perform on Android devices starting today?

Gemini can now book Uber rides, order DoorDash meals, and navigate a dozen other apps on Android devices without any user interaction required. These capabilities demonstrate how AI can autonomously handle multiple app-based tasks across the Android ecosystem.

What privacy and transparency measures are in place for Gemini's agentic capabilities?

According to the article, privacy guardrails are intentional and transparency is non-negotiable in Gemini's implementation of these autonomous features. The protocol is designed to let AI interact with apps while maintaining user control and visibility over what the system is doing.

How does this Model Context Protocol represent a shift beyond traditional mobile shortcuts?

Rather than being a simple shortcut or glorified macro, the Model Context Protocol enables AI to learn the layout and functionality of apps, allowing it to interact with them intelligently and autonomously. This represents a fundamental change in mobile interaction where the AI understands app structures without needing visual screen access.

What is Samat's vision for the future of mobile intelligence beyond smartphones?

Samat envisions the phone becoming just one node in a chain of smarter surfaces, including glasses, pendants, and dashboards that all leverage this intelligent capability. The era of mobile intelligence is shifting focus from bigger screens to making technology work across multiple connected devices and surfaces.

Ship an AI product this weekend — no engineers required.

Structured, in-depth lessons on the exact no-code tools — not scattered tutorials.

The exact platforms, taught in depth
Build real, working projects
Our honest review + a reader discount

Read the review →

Gemini uses Model Context Protocol to book Uber and...

Common Questions Answered

How does Gemini use the Model Context Protocol to control Android apps?

What specific tasks can Gemini perform on Android devices starting today?

What privacy and transparency measures are in place for Gemini's agentic capabilities?

How does this Model Context Protocol represent a shift beyond traditional mobile shortcuts?

What is Samat's vision for the future of mobile intelligence beyond smartphones?

Further Reading

Ship an AI product this weekend — no engineers required.

Latest News

OpenAI's Miles Wang in Talks for USD 2B AI Drug Discovery Startup

Mistral Vibe for Code Leads in Multi-Agent Programming Benchmark

OpenAI's First Hardware Device Is a Movable, Screenless Speaker

PrismML's Bonsai 27B Runs Qwen3.6 on Laptops With 1-bit and Ternary Builds

OpenAI Targets 2027 for First Major Hardware: A ChatGPT Speaker

Publishers sue Google over unauthorized AI book training

Anthropic's Claude for Teachers Vows Not to Train on Student Data

DeepSeek Seeks More Capital Weeks After USD 7B Funding Round

Anthropic's New AI Ad Campaign Draws Criticism for 'Creepy' Tactics

DeepMind CEO proposes independent AI regulator as White House advisor voices skepticism

Related Reading

ChatGPT's 'Nerdy' tweak rewards goblin metaphors in answers, study finds

Google tests visual 'magazine-style' UI for Gemini 3 Pro users

AI Engineers Face Rising Costs, Need New Strategies for Efficiency

NVIDIA and Google Cloud let developers scale AI from prototype to production

Google's FACTS benchmark shows 70% factuality ceiling across four tests

Anthropic links Claude's psychological security and sense of self to its safety

Pentagon’s Claude platform cuts costs 70% with secure single‑tenant AI

ProducerAI, the Chainsmokers‑approved AI music tool, joins Google

Google restricts Antigravity use, cuts OpenClaw access to enforce Terms

Common Questions Answered

How does Gemini use the Model Context Protocol to control Android apps?

What specific tasks can Gemini perform on Android devices starting today?

What privacy and transparency measures are in place for Gemini's agentic capabilities?

How does this Model Context Protocol represent a shift beyond traditional mobile shortcuts?

What is Samat's vision for the future of mobile intelligence beyond smartphones?

Further Reading

Ship an AI product this weekend — no engineers required.

Latest News

OpenAI's Miles Wang in Talks for USD 2B AI Drug Discovery Startup

Mistral Vibe for Code Leads in Multi-Agent Programming Benchmark

OpenAI's First Hardware Device Is a Movable, Screenless Speaker

PrismML's Bonsai 27B Runs Qwen3.6 on Laptops With 1-bit and Ternary Builds

OpenAI Targets 2027 for First Major Hardware: A ChatGPT Speaker

Publishers sue Google over unauthorized AI book training

Anthropic's Claude for Teachers Vows Not to Train on Student Data

DeepSeek Seeks More Capital Weeks After USD 7B Funding Round

Anthropic's New AI Ad Campaign Draws Criticism for 'Creepy' Tactics

DeepMind CEO proposes independent AI regulator as White House advisor voices skepticism