LLMs & Generative AI - Page 2 of 36
Latest breakthroughs in large language models and generative AI shaping the future of artificial intelligence and machine learning.
Latest breakthroughs in large language models and generative AI shaping the future of artificial intelligence and machine learning.
Google is widening the reach of its Meet AI assistant, moving it from a niche Android‑only test to a feature anyone can tap during a face‑to‑face session.
OpenAI’s latest benchmark results have nudged the company back to the top of the image‑generation leaderboard, a spot it briefly lost to competitors earlier this year.
The buzz around artificial intelligence has moved from tech circles to town halls, and it’s doing so at a speed that even seasoned pollsters find unsettling.
OpenAI just rolled out a refreshed version of the image generation engine that sits behind ChatGPT.
Why does a simple coffee order feel like a tech maze? Starbucks rolled out a ChatGPT‑powered app that promises conversational ordering, yet the experience often stalls at the very first request.
OpenCode’s latest update brings the Qwen3‑Coder model into its toolbox, but the addition isn’t automatic. To get the new engine running, users must point the IDE at the right configuration file.
Yelp’s first foray into conversational AI arrived in early 2024, when a modest chatbot helped users sift through listings for plumbers, electricians and other service providers.
Why does this matter? Because the Qwen 3.6‑35B‑A3B demo isn’t just another language model showcase—it stitches together multimodal inference, thinking‑control, tool calling, MoE routing, RAG and session persistence into a single pipeline.
Microsoft’s Phi‑4‑Mini has been making the rounds in developer forums as a compact, 3.8‑billion‑parameter model that packs a surprising amount of capability into a decoder‑only architecture.
OpenAI just rolled out GPT‑5.4‑Cyber, a fine‑tuned version aimed squarely at verified security teams.
Moonshot AI and researchers from Tsinghua have rolled out a new cross‑datacenter KVCache system they call PrfaaS. The design promises to keep large language models humming efficiently, even as request loads ebb and flow.
Running a 1‑bit language model on a consumer‑grade GPU used to feel like a niche experiment.
Anthropic just rolled out Claude Opus 4.7, a model that promises sharper code generation, higher‑resolution vision and longer‑horizon reasoning.
OpenAI's latest API documentation offers a tantalizing glimpse into the company's future model roadmap, revealing a potential preview of GPT-4o's capabilities.
Microsoft's latest open-source tool promises to simplify document processing for developers and data professionals.
Why does this matter? Because turning raw meeting transcripts into actionable data used to be a manual slog.
Why does a tiny JSON object matter in a world where LLMs swallow gigabytes of context?
Crawl4AI has been moving from isolated snippets to end‑to‑end pipelines that actually scrape, clean and structure data.
Why does a chatbot need to remember you beyond the last exchange? While most AI assistants reset after each session, developers are experimenting with a layer that stores snippets of past interactions, stitching them into a coherent narrative that...
Why does a transformer matter for a lattice of spins? While the hype around large language models is loud, the real test is whether they can capture quantum correlations that have long frustrated histories.
Learn to build AI-powered apps without coding. Our comprehensive review of No Code MBA's course.
Curated collection of AI tools, courses, and frameworks to accelerate your AI journey.
Get the week's most important AI news delivered to your inbox every week.