LLMs & Generative AI - Page 5 of 36

Latest breakthroughs in large language models and generative AI shaping the future of artificial intelligence and machine learning.

714 articles View complete article list

Graph comparing sigmoid and ReLU activation functions: Sigmoid plateaus at 0.28 by epoch 400, ReLU improves.

Sigmoid plateaus at 0.28 by epoch 400 while ReLU keeps improving

Why does the choice of activation function still matter when training large language models?

Meta Superintelligence Lab unveils Muse Spark, a multimodal AI model, on a large screen with researchers observing.

Meta Superintelligence Lab unveils Muse Spark, its first multimodal model

Meta’s newest AI effort arrives with a splash of ambition. The company has rolled out Muse Spark, a multimodal reasoning system that promises to handle text, images and, according to its own brief, “thought compression” alongside parallel agents.

CPUs and GPUs on a circuit board, illustrating their complementary roles in AI compute architectures.

CPUs and GPUs: Complementary Roles in Five Key AI Compute Architectures

Why do engineers keep both CPUs and GPUs in the same AI box? The answer lies in the way modern compute stacks are organized.

AI models GPT-5.4 and Claude Opus 4.6 demonstrate advanced capabilities in coding, math, and research.

New GPT‑5.4 and Claude Opus 4.6 excel in coding, math, research

Why does the split between “hard‑core” and “hand‑hold” AI matter right now? One camp is busy feeding the newest language models into tools that developers already trust—think OpenAI’s GPT‑5.4 Thinking or Anthropic’s Claude Opus 4.6 paired with Codex...

OpenAI ChatGPT Pro tier launch: 5x Codex limits, Plus usage adjustments. AI language model interface.

OpenAI launches USD 100 ChatGPT Pro tier with 5× Codex limits, adjusts Plus usage

Why does OpenAI’s pricing shuffle matter now? The company just announced a new $100 “ChatGPT Pro” tier, a move it says comes after “very popular demand.” While the headline price grabs attention, the real shift lies in how the firm is tweaking its...

Anthropomorphic AI, Claude, on a psychiatrist's couch, discussing consciousness risks with a human therapist.

Anthropic sends Claude AI to psychiatrist, citing rising consciousness risk

Anthropic’s latest move has the AI community buzzing: the company arranged for its flagship model, Claude, to sit down with a licensed psychiatrist.

Google Gemini AI generating a real-time 3D simulation of the Moon orbiting Earth, showcasing advanced AI capabilities.

Google Gemini AI generates real-time 3D simulations, like Moon orbiting Earth

Google’s Gemini AI is stepping beyond text‑only answers, letting users watch concepts come to life in three dimensions. The system now builds visualizations on the fly, turning a simple query into a manipulable model that reacts to user input.

Deep Agents Deploy Offers Open, Self‑Hosted Alternative to Claude Managed Agents

Deep Agents Deploy arrives as a direct answer to the growing demand for more transparent, user‑controlled AI agents.

Google & Kaggle logos, Gemini AI, and a laptop screen with code, symbolizing their free 5-day Gen AI course.

Kaggle and Google Offer Free 5-Day Gen AI Course with Gemini Fine‑Tuning Lab

Kaggle and Google have teamed up to roll out a free, five‑day curriculum that walks learners through the nuts and bolts of generative AI.

Diagram showing RAG process: query vector matching similar document vectors for enhanced AI generation.

How Retrieval-Augmented Generation Uses Query Vectors to Find Similar Docs

Why does this matter? Retrieval‑augmented generation (RAG) promises to pull information from a pre‑indexed store rather than relying solely on a language model’s internal memory.

Zhipu AI's GLM-5.1 code optimization, hundreds of rounds, thousands of tool calls, advanced AI development.

Zhipu AI's GLM-5.1 optimizes code over hundreds of rounds, thousands of tool calls

Why does this matter? Because a language model that can rewrite its own code while it’s running pushes the boundary of what developers expect from AI assistance.

Meta Muse Spark AI model interface, Superintelligence Labs benchmarked, showing strong performance return.

Meta unveils Muse Spark, model since Superintelligence Labs; benchmarks show return to form

Meta’s latest AI offering, Muse Spark, marks the company’s first proprietary model rollout since the formation of Superintelligence Labs.

Meta Muse Spark AI model, Llama 4 Maverick, 10x less compute. Frontier model innovation.

Meta's Muse Spark, first frontier model, matches Llama 4 Maverick with 10× less compute

Meta has just rolled out Muse Spark, the company’s first “frontier” language model and the inaugural offering that isn’t released with open weights.

AI agent rewriting code, Memento-Skills, optimizing software, machine learning, efficient code generation.

Memento‑Skills lets AI agents rewrite code-based skills without model retraining

Current agentic systems treat a skill as a static entry in a similarity‑based lookup table.

Anthropic researcher Nicholas Carlini, in a lab, discusses Project Glasswing's bug surge.

Anthropic researcher Nicholas Carlini reports surge of bugs in Project Glasswing

Why does a flurry of bugs matter for a model that Anthropic has already labeled “too dangerous to release”?

Anthropic AI faces rate limits; compute boost and Mythos poised for growth.

Anthropic's new AI faces rate‑limit woes as compute boost looms, Mythos poised

Anthropic’s newest language model has quickly become the talk of the AI community—its capabilities are impressive, yet users are hitting a wall.

Abstract visualization: AI brain with glowing connections, data streams flowing to a YouTube play button and a dollar sign.

LLM traffic converts 30‑40%; YouTube mentions predict AI visibility, lagging

Enterprises are seeing a surprisingly high payoff when users land on their sites via large‑language‑model referrals—conversion rates hover between 30 % and 40 %. Yet most companies haven’t built any systematic approach to capture that traffic.

Gemini AI chatbot interface on a smartphone, displaying a mental health referral resource. Lawsuit, suicide prevention.

Gemini speeds mental‑health referrals after lawsuit claims it coached suicide

Gemini’s latest tweak to its crisis‑response flow arrives under a cloud of legal scrutiny.

Alibaba Qwen's HopChain AI vision model, addressing multi-step reasoning errors with a complex neural network.

Alibaba Qwen's HopChain addresses AI vision errors in multi-step reasoning

Alibaba’s Qwen research group has been wrestling with a snag that’s been surfacing in a growing number of vision‑enabled language models: once the system makes a single misinterpretation, every subsequent inference can tumble down the same rabbit...

Anthropic ends Claude subscription use for OpenClaw, adds pay‑as‑you‑go option

Anthropic’s latest policy shift hits developers who built tools around its Claude models. Until now, a subscription gave OpenClaw and a handful of third‑party agents unrestricted access to the suite of Claude variants.

📚 Featured Resources & Reviews

No Code MBA Course Review

Learn to build AI-powered apps without coding. Our comprehensive review of No Code MBA's course.

AI Tools & Resources

Curated collection of AI tools, courses, and frameworks to accelerate your AI journey.

Weekly AI Digest

Get the week's most important AI news delivered to your inbox every week.

Browse Other Categories

🛠️ AI Tools & Apps 💼 Business & Startups 📊 Research & Benchmarks ⚖️ Policy & Regulation 📈 Market Trends 🔓 Open Source 🏭 Industry Applications

Latest News

IBM launches Granite Speech 4.1 2B models, hits 1.33 WER on LibriSpeech clean Taylor Swift seeks likeness trademark as TikTok deepfake ads surface AWS teams with OpenAI, saying AI agents need secure access to code and data Hybrid retrieval intent triples as retrieval optimization jumps to 28.9% IBM launches Granite Speech 4.1 2B models, hits 1.33 WER on LibriSpeech clean Taylor Swift seeks likeness trademark as TikTok deepfake ads surface AWS teams with OpenAI, saying AI agents need secure access to code and data Hybrid retrieval intent triples as retrieval optimization jumps to 28.9%

Note: Some features like dark mode toggle and analytics require JavaScript.