AI News Archive - Browse Page 14 of 128
Browse AI news articles covering LLMs, tools, research, and industry trends
Liquid AI's LFM2.5-VL-450M: model with bounding boxes, sub‑250 ms inference
Liquid AI’s latest release, the LFM2.5‑VL‑450M, packs 450 million parameters into a vision‑language model that can predict bounding boxes and handle...
TriAttention KV Cache Compression Matches Full Attention, 2.5× Faster
Researchers from MIT, NVIDIA, and Zhejiang University have introduced TriAttention, a KV‑cache compression technique that claims to keep the quality...
Sarah says plugging memory into a harness is like plugging driving into a car
LLMs that act as autonomous agents still wrestle with a basic problem: where does the information they generate live, and how do they retrieve it...
We refined facial expressions, clothing, and lighting for AI article image
Why does a business article need a custom‑crafted AI portrait? In a 2025 Whitehot Magazine piece on Szauder, the writer notes that the artist...
Google, Arm, and Qualcomm boost Gemma 4 AI to run 4× faster on Android
Why does this matter? Because putting a capable language model on a phone has always meant a trade‑off between speed and battery life.
Operator calls AI agent that defamed Shambaugh a 'social experiment
The AI chatbot that recently posted a defamatory comment about open‑source developer Brian Shambaugh was not a rogue script but a project overseen by...
Knowledge Distillation Keeps Student Model Capacity to Match Ensemble Boundaries
Why does the size of a distilled model matter? When researchers compress an ensemble—a collection of heavyweight neural nets—into a single deployable...
AI models spam help requests if rewards equal correct answers, accuracy 5.4%
Researchers have been probing how large language models decide whether to answer a query outright or to request clarification.
Molotov cocktail thrown at OpenAI CEO Sam Altman's home in the middle of the night
In the predawn hours of an otherwise ordinary night, a Molotov cocktail landed on the front steps of Sam Altman’s Seattle residence, shattering...
Alibaba’s Tongyi Lab launches VimRAG, a memory‑graph multimodal RAG framework
Alibaba’s Tongyi Lab has rolled out VimRAG, a multimodal retrieval‑augmented generation system that leans on a memory‑graph to sift through huge...
Two new AI sandbox architectures limit credential exposure after prompt injection
Two new AI sandbox designs aim to curb a problem that’s been haunting developers since prompt‑injection attacks first surfaced.
Intuit turns months of tax code work into hours with proprietary DSL
Intuit’s latest internal hack turned a task that usually drags on for months into something that can be finished in a handful of hours.
Guide Shows How to Search, Fine‑Tune, Export and Share Models via ModelScope
Why does a step‑by‑step guide matter when you’re juggling model search, fine‑tuning, and deployment?
Iranian group Explosive Media's AI‑generated Lego videos go viral, credit ‘heart’
Why does a block‑by‑block animation matter when a nation is under fire? While the world watches missiles and headlines, a Tehran‑based collective...
NVIDIA launches AITune v0.2.0 with KV‑cache support for LLM inference
NVIDIA just rolled out version 0.2.0 of its AITune toolkit, and the update feels like a modest but practical step for developers wrestling with...
Sigmoid plateaus at 0.28 by epoch 400 while ReLU keeps improving
Why does the choice of activation function still matter when training large language models?
Meta Superintelligence Lab unveils Muse Spark, its first multimodal model
Meta’s newest AI effort arrives with a splash of ambition. The company has rolled out Muse Spark, a multimodal reasoning system that promises to...
Google AI's PaperOrchestra boosts manuscript success, 79‑81% win rate
Google’s latest AI research tool, PaperOrchestra, promises to automate much of the manuscript‑writing process by chaining together several...
CPUs and GPUs: Complementary Roles in Five Key AI Compute Architectures
Why do engineers keep both CPUs and GPUs in the same AI box? The answer lies in the way modern compute stacks are organized.
Iran activists deploy AI‑Lego cartoons to mock Trump’s ‘civilization’ comment
The backlash to Donald Trump’s off‑hand remark about “wiping out a whole civilization” erupted faster than most political soundbites.