AI News Archive - Browse Page 31 of 160
Browse AI news articles covering LLMs, tools, research, and industry trends
GPT-5.5 scores 71.4% on expert cybersecurity tasks, edging Mythos Preview's 68.6%
Why does this matter? Because the latest round of AI‑driven security tests pits OpenAI’s GPT‑5.5 against the much‑talked‑about Mythos Preview in a...
OpenAI activates default marketing cookies for free ChatGPT users
OpenAI has quietly shifted how it handles data for the millions of people who use its free ChatGPT interface.
Harvard study finds OpenAI's o1 and 4o outdiagnose ER doctors in 76‑patient test
Harvard’s latest foray into clinical AI pits cutting‑edge language models against seasoned physicians in a real‑world setting.
OpenAI releases Symphony, an open-source spec to let agents self‑manage tasks
While developers have been wrestling with endless ticket queues, the bottleneck isn’t the code—it’s the human eyes needed to triage each item.
2021 EDEN-unbiased quantizer beats 2026 successor in average accuracy
Why does a 2021 quantizer still matter when a 2026 version is on the books? The answer lies in how the two methods treat the numbers they compress.
Elon Musk and Sam Altman head to court in legal fight over OpenAI’s future
Why does a courtroom showdown between Elon Musk and Sam Altman matter to anyone outside the tech bubble?
US benchmark shows China lagging; Deepseek model underperforms private tests
The Center for AI Security and Innovation (CAISI), housed within NIST, has just released a multi‑domain benchmark that pits China’s newest...
AI tracks surge on streaming services, says Deezer researcher Manuel Moussa
Why does this matter? Because streaming platforms are suddenly awash in songs that no human ever wrote.
Eight tech giants sign Pentagon AI contracts; Anthropic warns of legal loopholes
Eight tech giants have just inked Pentagon contracts aimed at building an “AI‑first fighting force” that will operate across classified networks.
Microsoft adds AI legal agent to Word to flag contract risks and suggest edits
Microsoft is turning its flagship word processor into a first‑line legal assistant.
LlamaIndex CEO: AI scaffolding collapses as models surpass humans on massive data
The LlamaIndex chief executive has a blunt assessment: the middle‑tier “scaffolding” that once glued data‑preparation tools to large language models...
Anthropic could secure USD 900 billion-plus valuation in two‑week round, sources say
Anthropic is on the brink of a financing sprint that could push its market cap past $900 billion, and the window to close the deal may be as tight as...
ChatGPT's 'Nerdy' tweak rewards goblin metaphors in answers, study finds
Why does a whimsical goblin keep popping up in ChatGPT’s answers? A recent analysis of the model’s output uncovered a pattern that traces back to a...
xAI releases Grok 4.3 with low price and fast voice‑cloning suite
xAI has rolled out Grok 4.3 with a price tag that undercuts most competitors, and it bundles a voice‑cloning feature that claims to generate speech...
Portfolio now serves as resume: GitHub and personal site essential
If you're a technical person, GitHub matters and a personal website matters. In a hiring landscape awash with AI‑generated résumés, the signal that...
Google DeepMind AI co‑clinician beats GPT‑5.4 in blind tests, lags docs
DeepMind’s latest “AI co‑clinician” has just outscored GPT‑5.4 in a series of blind assessments, yet it still falls short of seasoned doctors.
Musk vs. Altman trial begins as DOJ cuts voting‑rights unit, AI job panic examined
The federal courtroom in San Francisco opened its doors this week to a case that pits two of Silicon Valley’s most visible AI figures against each...
Anthropic benchmark says Claude matches experts, 23 tasks remain ambiguous
Anthropic has rolled out a fresh benchmark aimed at gauging Claude’s performance against seasoned bioinformatics specialists.
SMG releases smg-grpc-proto on PyPI; vLLM integrates via PR #36169
SMG has taken a concrete step toward modular LLM serving by publishing its gRPC definitions as a PyPI package named smg‑grpc‑proto.
Grok Voice Think Fast 1.0 lets non‑programmers design agents via console.x.ai
Grok Voice Think Fast 1.0 promises a shortcut for anyone curious about voice‑driven AI, sidestepping the usual code‑heavy entry barrier.