AI News Archive - Browse Page 16 of 160
Browse AI news articles covering LLMs, tools, research, and industry trends
AWS Agent Toolkit Shows Invocation, Success, UserError, SystemError Stats
AWS’s new Agent Toolkit tries to curb a familiar problem: agents that can spin up a Terraform script or a Lambda handler but do so on stale...
AMD Ryzen AI Max+ runs 122B‑parameter models locally with 128 GB UMA
Why does this matter? Because running today’s frontier open‑weight models no longer fits comfortably inside the 8–24 GB of VRAM that most discrete...
Semantic Search Model Assigns Class Labels and Confidence Scores to Critiques
“Beauty will save the world”—Fyodor Dostoevsky’s line opens a surprisingly practical discussion about how machines find meaning in text.
Synthetic 1,000‑Customer Dataset Uses Gender and Income to Test Bias
Machine‑learning pipelines, whether they run a classic classifier or a massive language model, carry a hidden risk: they can inherit the prejudices...
Pope Leo urges humanity amid AI-driven economic and social upheaval
Pope Leo XIV used his first major papal document, released Monday, to sound an alarm about artificial intelligence.
SciAtlas Introduces Large-Scale Knowledge Graph to Aid Automated Research
SciAtlas arrives as a response to the sheer volume of scholarly output that now spans dozens of fields.
Google outperforms OpenAI on math benchmark, winning 9 to 1 ratio
Google’s DeepMind team rolled out AlphaProof Nexus, an AI that pairs a large language model with the Lean proof assistant, and it has now produced...
Hotz warns AI coding agents could be costly despite 10x productivity boost
George Hotz, the programmer known for his work on tinygrad, has spent the last six months testing AI‑driven coding agents and comes away uneasy.
Accurate source citations boost AI answer quality, study finds
Why does this matter? Because getting the right answer isn’t enough if you can’t point to where it came from.
Google Antigravity 2.0 Retains Gemini CLI Features as Antigravity Plugins
Google Antigravity 2.0 landed on May 19 at I/O 2026, and it isn’t just an update—it’s a whole‑new platform.
FuRA uses spectral preconditioning with full‑rank SVD for efficient fine‑tuning
Fine‑tuning large language models has split into two camps. Full‑parameter updates give the model complete freedom but often overfit when data are...
Positional copying dominates answer readout in 1‑3B LMs on GSM8K
Why do tiny, instruction‑tuned models need a chain‑of‑thought prompt to solve math at all?
Study Introduces Orchestration Overhead Index to Measure AI Energy Costs
Current AI energy benchmarks still count watts per model call or per training epoch.
StepFun launches StepAudio 2.5 Realtime, evaluated via mobile app raters
Why does this matter? Because StepFun, a Shanghai‑based AI lab, just dropped StepAudio 2.5 Realtime, an end‑to‑end speech model that takes audio in...
Guide Shows How Python Connects to Existing AI Models via Custom Requests
Why does this matter? Because anyone who’s ever stared at a blank IDE can now see a clear path to an AI‑powered assistant.
Create a Claude Cowork‑Style Browser Agent with Playwright MCP and Claude Desktop
Claude Cowork moves AI out of the chat window and into the user’s own computer. Instead of answering questions, it actually clicks buttons, fills...
ByteDance study: LMMs answer questions better than full-page transcription
Multimodal AI models are being pushed to read ever‑longer documents—think PDFs that span hundreds of pages or video streams that run for hours.
Anthropic may keep supplying Claude to NSA despite Pentagon risk flag
Why does this matter? The Pentagon has labeled Anthropic a “supply chain risk,” yet the NSA may still receive its Claude models.
Claude Code auto‑creates AI scaling algorithms; new control allocates compute
Here's the thing: scaling large language models at inference time has usually been a hand‑crafted exercise.
SuperClaude workflow ranks security issues, details attack vectors, gives fixes
Here’s the thing: the SuperClaude Framework adds a structured layer to Anthropic’s API, turning raw model calls into a repeatable development...