LLMs & Generative AI - Page 14 of 48
Latest breakthroughs in large language models and generative AI shaping the future of artificial intelligence and machine learning.
Latest breakthroughs in large language models and generative AI shaping the future of artificial intelligence and machine learning.
OpenAI just rolled out a refreshed version of the image generation engine that sits behind ChatGPT.
Why does a simple coffee order feel like a tech maze? Starbucks rolled out a ChatGPT‑powered app that promises conversational ordering, yet the experience often stalls at the very first request.
OpenCode’s latest update brings the Qwen3‑Coder model into its toolbox, but the addition isn’t automatic. To get the new engine running, users must point the IDE at the right configuration file.
Yelp’s first foray into conversational AI arrived in early 2024, when a modest chatbot helped users sift through listings for plumbers, electricians and other service providers.
Why does this matter? Because the Qwen 3.6‑35B‑A3B demo isn’t just another language model showcase—it stitches together multimodal inference, thinking‑control, tool calling, MoE routing, RAG and session persistence into a single pipeline.
Microsoft’s Phi‑4‑Mini has been making the rounds in developer forums as a compact, 3.8‑billion‑parameter model that packs a surprising amount of capability into a decoder‑only architecture.
OpenAI just rolled out GPT‑5.4‑Cyber, a fine‑tuned version aimed squarely at verified security teams.
Moonshot AI and researchers from Tsinghua have rolled out a new cross‑datacenter KVCache system they call PrfaaS. The design promises to keep large language models humming efficiently, even as request loads ebb and flow.
Running a 1‑bit language model on a consumer‑grade GPU used to feel like a niche experiment.
Anthropic just rolled out Claude Opus 4.7, a model that promises sharper code generation, higher‑resolution vision and longer‑horizon reasoning.
OpenAI's latest API documentation offers a tantalizing glimpse into the company's future model roadmap, revealing a potential preview of GPT-4o's capabilities.
Microsoft's latest open-source tool promises to simplify document processing for developers and data professionals.
Why does this matter? Because turning raw meeting transcripts into actionable data used to be a manual slog.
Why does a tiny JSON object matter in a world where LLMs swallow gigabytes of context?
Crawl4AI has been moving from isolated snippets to end‑to‑end pipelines that actually scrape, clean and structure data.
Why does a chatbot need to remember you beyond the last exchange? While most AI assistants reset after each session, developers are experimenting with a layer that stores snippets of past interactions, stitching them into a coherent narrative that...
Why does a transformer matter for a lattice of spins? While the hype around large language models is loud, the real test is whether they can capture quantum correlations that have long frustrated histories.
Why does the way you feed a prompt into an open‑weight model matter? While the GPT‑OSS repository ships with the core model, it leaves developers to figure out the plumbing that turns a user’s message into tokens and then pushes those tokens through...
Schematik’s new “Cursor for Hardware” platform has caught the eye of big‑name AI labs and venture firms alike.
Google’s new Auto‑Diagnose system promises to sift through integration‑test failures using a large language model, then hand its findings back to developers.
Learn to build AI-powered apps without coding. Our comprehensive review of No Code MBA's course.
Curated collection of AI tools, courses, and frameworks to accelerate your AI journey.
Get the week's most important AI news delivered to your inbox every week.