Skip to main content
xAI introduces autonomous coding feature in Grok Build, enabling multi-step programming with verification for AI-driven goal

Editorial illustration for xAI adds /goal to Grok Build for autonomous multi-step coding with verification

xAI adds /goal to Grok Build for autonomous multi-step...

xAI adds /goal to Grok Build for autonomous multi-step coding with verification

2 min read

xAI just dropped a new mode called /goal inside its Grok Build terminal agent. The idea is simple: give the agent a sizable coding task, then let it run on its own until it finishes and verifies the result. Most developers today have to ping‑pong with an AI—prompt, watch it act, then manually check each step. /goal flips that loop, bundling planning, execution and verification behind a single command and a visible progress checklist.

While the mode isn’t a separate product, it lives inside Grok Build, xAI’s CLI‑based coding assistant that works against your local codebase, reading files, plugins and MCP servers out of the box. Verification isn’t limited to code review; the agent can also inspect webpages or run scripts to confirm completion. Users can steer a live run with status, pause, resume or clear.

Access isn’t free—only SuperGrok or X Premium Plus subscribers can enable the feature. The rollout hints at a shift toward more autonomous, end‑to‑end AI‑driven development workflows.

Every checklist item then shows as checked.

The Verification Step

The detail worth noting is verification. It continues until the task is completed and verified.

Verification, per xAI, can take three forms. The agent may review the code it produced. It may execute scripts to test the result.

This matters for autonomous runs. An agent that only edits files may report success before the change works. Building verification into the run means the agent tests its own output before it finishes.

Interactive Demo

Use Cases With Examples

Module migration is the main example.

Why this matters

We see xAI extending Grok Build with a new /goal mode that promises autonomous, long‑running coding sessions. The idea is simple: give the agent a sizable implementation task and then step back, letting it iterate until every checklist item appears checked. In practice, most coding work still involves a back‑and‑forth dance—prompt, act, verify—so the built‑in verification step is the feature that draws attention.

It claims to continue verification until the task is both completed and confirmed, which could reduce manual oversight for developers. Yet we wonder how reliable that verification is across diverse codebases, and whether the “checked” status truly reflects functional correctness or merely passes a superficial test. For founders, the promise of reduced supervision may look attractive, but the trade‑off between autonomy and hidden errors remains unclear.

Researchers might find a testbed for studying iterative AI‑driven development, though the lack of detail on verification criteria leaves open questions. Ultimately, the addition of /goal adds a new layer to AI‑assisted programming, but its practical impact will depend on how well the verification holds up in real projects.

Further Reading