AI News Archive - Browse Page 17 of 160
Browse AI news articles covering LLMs, tools, research, and industry trends
Agent explores once, then compiles branch‑free recipe to bypass LLM thereafter
Rahul Vir and Reya Vir lay out where the industry is headed. The AI‑prototype era is over; today’s teams are shipping autonomous agents that replace...
D&B rebuilds 642 million‑business database after AI agents hit limits
Why did D&B have to start from scratch? The answer lies in a data architecture that was never meant for autonomous agents.
Meta launches Forum: Reddit‑style advice within Facebook groups, AI‑assisted
Meta has rolled out a new iPhone‑only app called Forum, shifting Facebook Groups out of the main platform and into a standalone space.
CopilotKit launches AG-UI to bridge agent‑human interaction layer
Why does this matter? Because the tools that let autonomous agents talk to people have finally found a stable foundation.
AgentCo-op imports and refines searched workflows via component grounding
Designing multi‑agent workflows in open‑ended scientific settings has never been straightforward.
LLM‑RL Agent Manages CAD, CAE and Geometry Revision for Closed‑Loop Optimization
Why does this matter? In many manufacturing pipelines, designers bounce between CAD models and CAE analyses, only to hit a stubborn “semantic gap”...
SOLAR introduced as self‑optimizing autonomous agent for continual learning
LLMs have cracked many benchmarks, yet they stumble when the data they meet keeps changing.
Language Models Forecast Research Success Using 11,488 Comparative Idea Pairs
Why does it matter when a model can guess which experiment will work before any lab work begins?
OpenAI’s Q1 2026 adjusted margin slips to –122%, burning USD 1.22 per USD 1 earned
OpenAI’s first‑quarter numbers paint a stark picture. The adjusted operating margin slipped to minus 122 percent, meaning the firm lost $1.22 for...
VSAS‑Bench Introduces Standardized Real‑Time Evaluation for Visual Assistants
Streaming visual assistants are finally getting a benchmark that matches their real‑time nature.
F_Call_Analysis_Planner forwards Parent_Instruction to generate Selection_Rule
Most of the results were wrong. Even worse, the AI quickly learned which numerical ranges looked plausible and began spitting out convincing‑but...
Temporal Contrastive Transformer embeddings boost financial crime detection
Why does this matter? Financial institutions are constantly hunting for patterns that betray illicit activity, yet most detection pipelines still...
Quantum ML Hits Data Input Bottleneck: Processors Can't Read Images, Text
Quantum Machine Learning promises speedups, but the first hurdle appears before any quantum circuit runs: getting data onto the machine.
Experimental MLX Delegate Enables PyTorch Models on Apple Silicon GPUs
Apple Silicon has become a popular platform for running large language models locally.
OSCToM uses RL to generate adversarial scenarios testing high-order Theory of Mind
Why do large language models still stumble when asked to untangle layered social reasoning?
Gemma 4 Executes Sequential Tool Calls to Inspect Folder and Compute Results
In a recent Machine Learning Mastery tutorial, the authors showed a language model that could fetch weather, news, currency rates and the time from...
Alibaba's Qwen3.7-Max runs 35 hrs, self‑monitors reward‑hacking, supports Claude Code
Why does this matter? Because most large language models stumble when asked to keep a single thread of thought alive for hours on end.
Researchers use triplet loss to train high-quality Horn logic embeddings
The paper posted on arXiv (2605.20467v1) tackles a practical bottleneck in automated reasoning: turning logical statements into compact numeric...
Cohere launches Command A+ 218B sparse MoE, runs on two H100 GPUs using QAD
Cohere just put Command A+ on the table, an open‑source model aimed squarely at enterprise‑grade agentic workflows.
Gemini 3.5 Flash Shows Fast Responses in Free Account Tests
Google’s latest LLM family arrives with Gemini 3.5, and the first model on deck is Gemini 3.5 Flash.