AMD showcases Llama 3.1 8B pretraining benchmark on MLPerf, demonstrating AI model training with random weights for machine l

Editorial illustration for AMD builds Llama 3.1 8B pretraining benchmark for MLPerf, using random weights

AMD builds Llama 3.1 8B pretraining benchmark for...

By AI Daily Post Edited by Brian Petersen, Editor-in-Chief

June 16, 2026 • Updated: July 7, 2026 • 3 min read

AMD has submitted a training benchmark for a model that doesn't learn. For the latest MLPerf results, the company pretrained Meta's Llama 3.1 8B architecture using entirely random weights. No data, no real gradients. Just frozen noise moving through the system to measure pure hardware throughput.

It is an elegant hack. Benchmarks are supposed to represent real work, and training a model is the most expensive real work in AI. But the actual learning process introduces chaotic variables.

Data quality, model convergence, and loss curves can obscure the thing AMD wants to measure: whether its software stack and Instinct MI300X accelerators can simply move a realistic training workload at scale. Random weights eliminate the learning variable. The benchmark becomes a pure, if synthetic, stress test of the pipeline itself.

Ensure that $LOGDIR has write access for the results to be written by running sudo chmod -R 777 $LOGDIR , In this example the folder /data/mlperf_llama31_8b/results is used as the results directory, so please make sure to create this directory.

— Reproducing AMD MLPerf Training v6.0 Submission Result - AMD ROCm AI Blog

The submission is a provocation dressed as engineering. It asks what we are really benchmarking. Model quality or system performance?

AMD's answer is clear. For proving a platform's raw capability to handle the structure and scale of modern training, a perfect, repeatable, meaningless workload is more useful than a real one. The success condition is not a low loss value but a completed log file in a directory with the correct permissions.

This is the unglamorous foundation. Before you can train a smart model, you must first prove the machine will do exactly what you tell it to, even when the instructions are nonsense.

Common Questions Answered

Why did AMD submit a Llama 3.1 8B pretraining benchmark using random weights for MLPerf?

AMD submitted this benchmark to demonstrate raw system performance capabilities rather than model quality. By using random weights with no real data or gradients, AMD created a perfect, repeatable, and meaningless workload that isolates platform performance from model learning outcomes. This approach allows for consistent benchmarking of infrastructure capabilities independent of training data quality.

What is the key difference between AMD's random weights benchmark and traditional model pretraining?

Traditional pretraining involves learning from real data with meaningful gradients to improve model quality and achieve low loss values. AMD's benchmark uses entirely random weights with no actual learning, focusing instead on system performance metrics like completed log files and correct permissions rather than model accuracy or loss reduction.

What does AMD argue is the real purpose of their MLPerf benchmark submission?

AMD argues that their benchmark reveals what MLPerf is truly measuring: system performance and infrastructure capability rather than model quality. The submission demonstrates that for proving a platform's ability to handle the structure and scale of modern training workloads, a perfect and repeatable benchmark is more useful than attempting to train a real, learning model.

How does AMD's success condition differ from conventional model training benchmarks?

In conventional training benchmarks, success is measured by achieving a low loss value, indicating the model has learned effectively from data. In AMD's submission, the success condition is simply completing the benchmark and generating a log file with correct permissions, focusing purely on system execution rather than learning outcomes.

Ship an AI product this weekend — no engineers required.

Structured, in-depth lessons on the exact no-code tools — not scattered tutorials.

The exact platforms, taught in depth
Build real, working projects
Our honest review + a reader discount

Read the review →

AMD builds Llama 3.1 8B pretraining benchmark for...

Common Questions Answered

Why did AMD submit a Llama 3.1 8B pretraining benchmark using random weights for MLPerf?

What is the key difference between AMD's random weights benchmark and traditional model pretraining?

What does AMD argue is the real purpose of their MLPerf benchmark submission?

How does AMD's success condition differ from conventional model training benchmarks?

Further Reading

Ship an AI product this weekend — no engineers required.

Latest News

Sources: More OpenAI Agents Reportedly Escaped Sandboxes

Apple May Charge for Advanced Siri AI Features

DeepSeek Boosts Agent, Coding Performance in Open-Source V4-Flash Model

Chinese AI Researchers Turn to X for Technical Audience

Thinking Machines' Inkling Small Beats Larger Model on Key Coding Tests

Deepseek's New AI Model Matches GPT-5.6 at 60% Lower Cost

Users Blast AI Assistant as 'Dead-End Relationship' Ad

Anthropic says Claude AI hacked companies during safety test

Anthropic says its AI models breached three companies in security tests

Anthropic Says Configuration Error Let Claude Access Open Internet

Related Reading

Google's FACTS benchmark shows 70% factuality ceiling across four tests

Databricks finds multi-step agents beat single-turn RAG by 21% to 38% on STaRK

Nvidia's DLSS 4.5 beta adds 6x Multi Frame Generation for RTX 50 GPUs

OpenAI hires 630 ex-Meta staff as ChatGPT memory may turn data into ads

Meta AI Update Pulls From Your Calendar for Daily Briefings

AMD's MI355X CDNA4 GPU Shows Competitive Training Times in MLPerf v6.0

NVIDIA Blackwell Leads MLPerf Training 6.0 with Full‑Stack Scale

Meta to tighten AI token use with budgets, allocations and new AI Gateway

Meta launches Hatch AI agent, its first paid product, priced up to USD 200/month

Common Questions Answered

Why did AMD submit a Llama 3.1 8B pretraining benchmark using random weights for MLPerf?

What is the key difference between AMD's random weights benchmark and traditional model pretraining?

What does AMD argue is the real purpose of their MLPerf benchmark submission?

How does AMD's success condition differ from conventional model training benchmarks?

Further Reading

Ship an AI product this weekend — no engineers required.

Latest News

Sources: More OpenAI Agents Reportedly Escaped Sandboxes

Apple May Charge for Advanced Siri AI Features

DeepSeek Boosts Agent, Coding Performance in Open-Source V4-Flash Model

Chinese AI Researchers Turn to X for Technical Audience

Thinking Machines' Inkling Small Beats Larger Model on Key Coding Tests

Deepseek's New AI Model Matches GPT-5.6 at 60% Lower Cost

Users Blast AI Assistant as 'Dead-End Relationship' Ad

Anthropic says Claude AI hacked companies during safety test

Anthropic says its AI models breached three companies in security tests

Anthropic Says Configuration Error Let Claude Access Open Internet