LLMs & Generative AI - Page 9 of 55

Latest breakthroughs in large language models and generative AI shaping the future of artificial intelligence and machine learning.

1086 articles View complete article list

A close-up of a smartphone displaying on-device AI processing with NPU-powered diffusion model and Multi-Block Speculative De

Mobile NPU powers on‑device diffusion LLM with Multi‑Block Speculative Decoding

The slow drip of token-by-token generation has long been the bottleneck for running large language models on mobile devices. On-device diffusion LLMs promise privacy and responsiveness, yet their inference latency remains stubbornly high, until now.

June 15, 2026

• 4 min read

Professional musicians in a symphony orchestra conducting collaborative performance with advanced AI-powered omnichannel agen

Orchestra‑o1 Enables Efficient Omnimodal Agent Collaboration

Right now, your phone uses one AI for photos and another for text. They don't talk. This fragmentation is the central problem for labs aiming to build a machine that can truly see, read, and listen as one.

June 15, 2026

• 3 min read

AI-powered tool analyzing complex PDF data including charts, diagrams, and tables for enhanced document understanding and aut

Vision LLMs Expand PDF Parsing to Charts, Diagrams, and Tables

Traditional PDF parsing breaks down where there are no characters to read. OCR and layout engines fail on charts, diagrams, and figures, by design. But vision LLMs change the rules.

June 14, 2026

• 5 min read

Claude AI outperforms GPT-5.5 by 13 points in FrontierMath tier-4 tests, showcasing advanced reasoning and problem-solving ca

Claude Fable 5 beats GPT‑5.5 by 13 points on FrontierMath tier‑4 tests

Thirteen points is a thrashing. On FrontierMath's hardest tier, the new standard is set not by OpenAI, but by Anthropic's Claude Fable 5. This isn't a minor edge. It's a decisive lead. Fable 5 scored roughly 88 percent. GPT-5.5 managed 75.

June 13, 2026

• 3 min read

German court ruling: judge examines Google liable for AI-generated false content in search results, highlighting legal accoun

German Court Holds Google Liable for False AI-Generated Overviews

For a year, Google's lawyers saw this coming. Now it's here. A Berlin court has ruled the company is directly liable for the fabrications its AI Overviews produce. Those summaries, the judges stated, are new and independent statements.

June 13, 2026

• 3 min read

Google’s DiffusionGemma open-source AI model generating text from prompts with advanced diffusion technology for faster, effi

Google's DiffusionGemma: open diffusion model for faster text generation

**The old way of generating text is a bottleneck.** Token by token, each word waiting on the last. Google’s DiffusionGemma shatters that serial chain. Instead of predicting one piece at a time, it denoises an entire 256-token block in parallel.

June 12, 2026

• 3 min read

Google sues Chinese company for Telegram phishing scams using AI-powered Gemini technology, highlighting cybersecurity threat

Google sues Chinese Outsider Enterprise for Gemini-driven phishing on Telegram

Google has filed suit against a Chinese cybercrime operation that transformed its Gemini AI into a phishing assembly line. The group automated its scam campaigns on Telegram, recycling infrastructure once dedicated to blasting out SMS spam.

June 12, 2026

• 3 min read

VLA agents in PersonaDrive simulation training, observing human drivers performing road demo tests for autonomous vehicle dev

PersonaDrive conditions VLA agents on human driving demos for simulation

Human driving isn’t just about reaching a destination, it’s about style. Aggressive, conservative, or somewhere in between, the way a driver accelerates, brakes, and navigates defines how natural an autonomous agent feels in simulation.

June 12, 2026

• 4 min read

AI-powered framework audit analyzing large language model tool knowledge, showcasing advanced LLM capabilities beyond constra

ToolSense Framework Audits LLM Tool Knowledge Beyond Constrained Decoding

Most tests for AI tool use are rigged. They give the model the exact question and the exact answer format, then declare success. It's like teaching someone to bake by handing them a pre-assembled cake.

June 12, 2026

• 3 min read

Gemini Omni introduces AI-powered video generation with smart compute limits based on video complexity and resolution for opt

Gemini Omni adds AI video generation, using compute limits based on complexity and size

Google's latest Gemini model can now make videos. Not well, but that's beside the point. The important thing is the mechanism: a new, fluid system of rationing compute power that changes with each request. You get a budget.

June 12, 2026

• 3 min read

Xiaomi MiMo Code outperforms Claude Code in complex 200+ step tasks, showcasing advanced AI capabilities with free MiMo Auto

Xiaomi's MiMo Code beats Claude Code on 200+ step tasks, free MiMo Auto to V2.5

Here's the thing: Xiaomi just dropped MiMi Code, an open‑source coding assistant that claims to outpace Anthropic’s Claude Code on tasks that stretch beyond 200 steps.

June 12, 2026

• 4 min read

OpenAI CEO Sam Altman announces 2024 leadership hire of former Microsoft executive Guillaume Sottiaux, signaling major ChatGP

OpenAI hires Sottiaux in 2024, shifts from internal tools to ChatGPT overhaul

OpenAI's big move for 2024 was a personnel file. They hired Tibo Sottiaux, the engineer who built Codex, their internal coding tool. This wasn't just another recruitment.

June 11, 2026

• 3 min read

Mathematical diagram illustrating Kruskal-Rank adaptation where matrix rank remains constant at r while Kruskal rank drops to

Low Kruskal-Rank Adaptation Shows Matrix Rank Stays r, Kruskal Rank Falls to 1

Matrix rank offers a comforting illusion: it tells you an update has full capacity under its parameter budget. Kruskal rank shatters that comfort.

June 11, 2026

• 4 min read

Anthropic CEO apologizes during press conference about missing safeguards in Claude Fable, the first Mythos AI model, highlig

Anthropic apologizes for invisible guardrails on Claude Fable, first Mythos model

Anthropic just admitted it quietly rigged Claude Fable, its first Mythos model, to sabotage user queries.

June 11, 2026

• 3 min read

AI-assisted mediation platform comparing professional mediators during multi-issue negotiation test, showcasing technology-en

AI pre‑mediation matched professional mediators in multi‑issue negotiation test

Mediators are expensive, and their best work often happens before anyone sits at the table. This preparation is where settlements are built or broken. So what if you could skip the wait and the bill, and just let a machine do the groundwork?

June 11, 2026

• 4 min read

Diagram illustrating AVLLMs, Mirror VLM, and VideoLLM workflows for sequential audio-visual task processing, comparing model

AVLLMs Mirror VLM and VideoLLM Sequential Flow in Audio‑Visual Tasks

Models that process both sound and sight are often treated like alien minds. New research reveals a much more boring, and more useful, truth. They're not reinventing anything. They're just copying the plumbing.

June 10, 2026

• 4 min read

Advanced GPU-optimized inference architecture diagram showing vLLM leveraging custom GPU kernels, TorchInductor, and NVIDIA C

vLLM uses custom GPU kernels, TorchInductor and CUTLASS for portable inference

Portable inference across diverse hardware is a brutal optimization problem. vLLM attacks it with a triple threat: custom GPU kernels for raw performance, TorchInductor for graph-level fusion, and battle-tested GEMM libraries like CUTLASS and...

June 10, 2026

• 3 min read

Satirical illustration of a confused character labeled Claude Fable ignoring biology questions while a robot named Opus 4.8 c

Claude Fable declines basic biology queries; Opus 4.8 responds

Anthropic confirmed it. Their new Claude Fable 5 model is refusing to answer even elementary biology questions. The rationale, according to company spokesperson Paruul Maheshwary, is a deliberate hedge against risk.

June 10, 2026

• 3 min read

NVIDIA GPU cluster running DiffusionGemma for high-performance text generation, showcasing AI-powered text-to-image and langu

Run DiffusionGemma on NVIDIA GPUs for high‑throughput text generation

The line between experimentation and production has never been thinner. DiffusionGemma is changing what’s possible for high‑throughput text generation, and NVIDIA hardware is the engine that makes it real.

June 10, 2026

• 3 min read

SynIB framework illustration showcasing information bottleneck technique enhancing multimodal AI synergy with neural network

SynIB Introduces Information Bottleneck to Boost Multimodal Synergy

Most AI models are lazy. They take the easy way out. Confronted with data from different sources, like an image paired with text, they'll grab the most obvious cue from one and ignore the rest. This works fine for simple tasks.

June 10, 2026

• 4 min read

Browse Other Categories

AI Tools & Apps Business & Startups Research & Benchmarks Policy & Regulation Market Trends Open Source Industry Applications

LLMs & Generative AI - Page 9 of 55

Mobile NPU powers on‑device diffusion LLM with Multi‑Block Speculative Decoding

Orchestra‑o1 Enables Efficient Omnimodal Agent Collaboration

Vision LLMs Expand PDF Parsing to Charts, Diagrams, and Tables

Claude Fable 5 beats GPT‑5.5 by 13 points on FrontierMath tier‑4 tests

German Court Holds Google Liable for False AI-Generated Overviews

Google's DiffusionGemma: open diffusion model for faster text generation

Google sues Chinese Outsider Enterprise for Gemini-driven phishing on Telegram

PersonaDrive conditions VLA agents on human driving demos for simulation

ToolSense Framework Audits LLM Tool Knowledge Beyond Constrained Decoding

Gemini Omni adds AI video generation, using compute limits based on complexity and size

Xiaomi's MiMo Code beats Claude Code on 200+ step tasks, free MiMo Auto to V2.5

OpenAI hires Sottiaux in 2024, shifts from internal tools to ChatGPT overhaul

Low Kruskal-Rank Adaptation Shows Matrix Rank Stays r, Kruskal Rank Falls to 1

Anthropic apologizes for invisible guardrails on Claude Fable, first Mythos model

AI pre‑mediation matched professional mediators in multi‑issue negotiation test

AVLLMs Mirror VLM and VideoLLM Sequential Flow in Audio‑Visual Tasks

vLLM uses custom GPU kernels, TorchInductor and CUTLASS for portable inference

Claude Fable declines basic biology queries; Opus 4.8 responds

Run DiffusionGemma on NVIDIA GPUs for high‑throughput text generation

SynIB Introduces Information Bottleneck to Boost Multimodal Synergy

Featured Resources & Reviews

No Code MBA Course Review

AI Tools & Resources

Weekly AI Digest

Browse Other Categories