LLMs & Generative AI - Page 3 of 48
Latest breakthroughs in large language models and generative AI shaping the future of artificial intelligence and machine learning.
Latest breakthroughs in large language models and generative AI shaping the future of artificial intelligence and machine learning.
Audio‑visual large language models promise to decode hours‑long video, but their inference cost climbs with every extra frame and sound snippet. The culprit?
PathoSage arrives at a moment when multimodal large language models are being tested on the gritty details of tissue slides.
Apple just rolled out the third generation of its foundation models, a suite it calls AFM 3. The lineup includes five models built in partnership with Google, split between on‑device and server‑based versions.
Why does this matter? When pre‑training frontier LLMs stretches across trillions of tokens and thousands of accelerators, every percentage point of step time translates into days of compute and hefty expense.
Why does this matter? As AI moves from answering questions to handling whole workflows, we’re trusting models with the very files we rely on—legal contracts, codebases, research notes. A new study shows that trust may be misplaced.
Four new, concrete tricks can tighten the loop between you and Claude Code, the AI‑driven coding assistant that’s been gaining traction among developers.
Jensen Huang says the token market is splitting into clear value tiers. Why does that matter? Until recently, generative AI behaved like any other SaaS product: you paid a monthly subscription, opened a chat, asked a question, and got an answer.
OpenAI is gearing up for its most extensive redesign of ChatGPT since the chatbot first hit the public eye in 2022.
Why does a neural net sometimes crawl when asked to capture a sharp spike? The answer lies in a phenomenon first highlighted in 2019: the spectral bias.
Fine‑tuning open‑weight language models into niche assistants has become routine, yet each round of task‑specific training can erode the safeguards baked into the original system.
Diffusion‑based language models generate text by repeatedly refining tokens, yet once a token is written it cannot be changed. This creates a “stability lag”: early choices remain vulnerable even after later processing.
Why does evaluating an LLM for the classroom matter? Traditional tests ask whether a model can spit out the right answer, but teaching is more than that.
Why does reliable multi‑step workflow matter for LLMs? Because the promise of autonomous agents hinges on more than impressive prompts—it hinges on consistency.
Why do multi‑agent systems built on large language models still choke on token bloat? Most designs rely on predefined roles, pipelines and turn‑taking schedules, yet they leave the actual messages between agents as free‑form text. The result?
Elon Musk’s xAI has been quietly leaning on Anthropic’s Claude to shape its own coding models. According to a report from The Information, the startup fed Claude’s output directly into its pre‑training pipeline for months.
Why do we care how a language model weighs tomorrow against today? As AI systems start handling choices that span weeks, months or even years, understanding the inner mechanics of those trade‑offs becomes more than an academic curiosity.
Errorquake‑10k puts 10,000 LLM responses on a 0‑4 severity scale, spanning eight domains and five difficulty tiers.
Thanks to contemporary large language models, natural‑language processing has become a core component of modern AI systems. Search engines, chatbots and automated routing all lean on NLP techniques to turn raw text into actionable data.
DeepSpeed just added support for Muon Optimizer, a tool that’s gaining traction in frontier AI labs. Moonshot AI, for example, has already integrated Muon into the training pipeline for its Kimi‑K2‑Thinking foundation model.
Anthropic says its flagship model, Claude, now writes more than 90 percent of the company’s production code—a stark shift from the way the startup has traditionally built software.
Learn to build AI-powered apps without coding. Our comprehensive review of No Code MBA's course.
Curated collection of AI tools, courses, and frameworks to accelerate your AI journey.
Get the week's most important AI news delivered to your inbox every week.