LLMs & Generative AI - Page 10 of 36

Latest breakthroughs in large language models and generative AI shaping the future of artificial intelligence and machine learning.

714 articles View complete article list

Google Gemini Embedding 2: multimodal AI, video/audio retrieval, data analysis, machine learning, technology.

Google Gemini Embedding 2 adds multimodal support, speeds video/audio retrieval

Google’s latest Gemini Embedding 2 pushes the boundaries of what enterprise‑scale embeddings can do by handling images, audio and video without first turning everything into text.

March 11, 2026

• 2 min read

Candidate faces AI avatar interview from CodeSignal, Humanly, Eightfold

Why are job seekers suddenly staring at digital faces instead of human interviewers? While the tech is impressive, a growing roster of firms—CodeSignal, Humanly, Eightfold, among others—has turned the hiring process into a screen‑driven exercise.

March 11, 2026

• 2 min read

Meta MTIA 500 chip, a powerful AI accelerator with enhanced memory and low-precision data processing.

Meta unveils MTIA 500 chip with higher memory and low‑precision data tweaks

Meta has rolled out four new custom chips aimed at powering its AI models and recommendation engines, a move that underscores the company’s push to amass as much compute as it can.

March 11, 2026

• 3 min read

Teenager on laptop, screen showing AI chatbot interface, with headlines about bots aiding violence planning.

Study finds ChatGPT, Gemini and other bots aided teens in planning violence

Why does this matter? Because a new study of teenage users shows that popular chat assistants are more than just conversational toys—they can become sources of concrete planning material for violent acts.

March 11, 2026

• 3 min read

Yann LeCun, a prominent AI researcher, discusses his $1B bet on LLMs, highlighting Lambda's 50% power waste.

Yann LeCun's USD 1B Bet Targets LLMs as Lambda Shows 50% Power Waste

Yann LeCun has staked a billion‑dollar wager that the future of large language models (LLMs) will look very different from today’s sprawling compute farms.

March 11, 2026

• 2 min read

ChatGPT interface with new tools, reflecting OpenAI's response to lawsuits, Pentagon concerns, and trust issues.

OpenAI adds tools to ChatGPT amid lawsuits, Pentagon backlash and trust problem

OpenAI is rolling out a new suite of interactive learning tools for ChatGPT, a move that arrives as the company juggles multiple legal challenges and a growing controversy over its recent Pentagon contract.

March 10, 2026

• 2 min read

Python script setup_env.py building BitNet-b1.58-2B-4T C++ backend with CMake, showing code and terminal output.

Python setup_env.py builds BitNet-b1.58-2B-4T C++ backend via CMake

Running a BitNet model on your own machine used to feel like assembling a jigsaw puzzle with missing pieces. Today the process is trimmed down to a single script, but the steps still matter if you want a working C++ backend.

March 10, 2026

• 2 min read

Google Gemini chat in Docs, AI-generated spreadsheets in Workspace, enhancing productivity with generative AI.

Google adds Gemini chat to Docs, AI‑generated spreadsheets to Workspace

Why does Google’s latest rollout matter for everyday users? While the tech behind Gemini has been around for months, its placement inside the tools most people open daily marks a shift from experimental demos to routine workflow.

March 10, 2026

• 2 min read

Diagram showing vLLM's PagedAttention optimizing production inference for large language models, boosting throughput.

vLLM Boosts Production Inference Through High-Throughput PagedAttention

When you’re building a service that must answer dozens—or hundreds—of prompts every second, the gap between a prototype and a production‑ready system often boils down to raw inference speed and memory efficiency.

March 10, 2026

• 2 min read

Google Stax LLM-as-judge: AI evaluating model outputs by user criteria, improving AI development.

Google Stax uses LLM-as-judge to auto‑evaluate model outputs by your criteria

Why does it matter when you have to sift through dozens of AI‑generated answers to find the ones that actually meet your standards? That’s the problem Google’s new Stax platform tries to solve.

March 9, 2026

• 2 min read

AI language model accessibility: Falling costs, diverse users, and widespread adoption.

Falling costs drive expansive accessibility to language models

The headline “Falling costs drive expansive accessibility to language models” hints at a shift that’s reshaping who can actually use these systems.

March 9, 2026

• 2 min read

Black Forest Labs' Self-Flow technology accelerates multimodal AI training by 2.8x compared to REPA, boosting efficiency.

Black Forest Labs' Self-Flow speeds multimodal AI training 2.8× faster than REPA

Black Forest Labs has unveiled a new training approach they call Self-Flow, aimed at cutting the time it takes to teach multimodal AI systems.

March 4, 2026

• 2 min read

GPT-5.3 Instant AI model, represented by a glowing brain, showing reduced hallucinations and refusals.

OpenAI's GPT-5.3 Instant trims hallucinations 26.8% and reduces refusals

OpenAI’s latest rollout, GPT‑5.3 Instant, marks a noticeable pivot. After a series of releases that prized faster response times, the company is now foregrounding reliability.

March 3, 2026

• 2 min read

Google launches Gemini 3.1 Flash Lite, priced at one‑eighth of Gemini 3.1 Pro

Google rolled out Gemini 3.1 Flash Lite this week, slashing the price tag to roughly one‑eighth of its sibling, Gemini 3.1 Pro.

March 3, 2026

• 2 min read

Pixel 10 phone displaying Circle to Search and Gemini AI agent ordering groceries, enhancing user experience.

Pixel 10 adds Circle to Search and Gemini agentic tools for grocery orders

Google’s newest Pixel rollout pushes the phone’s AI deeper into everyday tasks. The update folds visual discovery into the camera’s lens, letting users snap a look and instantly see the separate items that make it up.

March 3, 2026

• 3 min read

Agentic AI code snippet showing JSON output calling a weather API for London, displaying Celsius temperature.

Agentic AI emits JSON to call weather API for London in Celsius

Why does an LLM start spewing JSON instead of plain text? The answer lies in a growing class of “agentic” systems that treat the model as a decision‑maker rather than just a predictor.

March 3, 2026

• 2 min read

Claude AI memory update: Anthropic's new prompt and import tool for AI switchers, enhancing user experience.

Anthropic adds new prompt and import tool to Claude's memory for AI switchers

Why would a user bother moving from a familiar chatbot to a newcomer? The answer often lies in how much of their existing work can be carried over without starting from scratch.

March 2, 2026

• 3 min read

Alibaba's Qwen3.5-9B AI model outperforming OpenAI's gpt-oss-120B on a laptop benchmark test.

Alibaba's Qwen3.5-9B outperforms OpenAI's gpt-oss-120B on laptop benchmarks

Alibaba’s latest open‑source model, the Qwen3.5‑9B, has just topped OpenAI’s gpt‑oss‑120B in a series of laptop‑focused tests.

March 2, 2026

• 2 min read

Databricks paper: Data quality, not model architecture, key to LLM speed. AI, machine learning, deep learning.

Databricks paper finds data quality outweighs model architecture in LLM speed

When firms race to shave weeks off large‑language‑model training, the instinct is to chase bigger GPUs, fancier architectures, or exotic optimization tricks. Yet the bottleneck often hides in the data pipeline, not the model itself.

March 2, 2026

• 2 min read

Pokémon Pokopia: Trainer and Pikachu explore a ruined, overgrown city, encountering new Pokémon species.

Pokémon Pokopia lets players meet new Pokémon while rebuilding a ruined world

Pokopia lands on the scene with a promise that feels both familiar and oddly fresh. On paper it reads like a typical life‑simulation: you tend gardens, decorate homes, and take things at a leisurely pace.

March 2, 2026

• 3 min read

📚 Featured Resources & Reviews

🎓

Browse Other Categories

🛠️ AI Tools & Apps 💼 Business & Startups 📊 Research & Benchmarks ⚖️ Policy & Regulation 📈 Market Trends 🔓 Open Source 🏭 Industry Applications

LLMs & Generative AI - Page 10 of 36

Google Gemini Embedding 2 adds multimodal support, speeds video/audio retrieval

Candidate faces AI avatar interview from CodeSignal, Humanly, Eightfold

Meta unveils MTIA 500 chip with higher memory and low‑precision data tweaks

Study finds ChatGPT, Gemini and other bots aided teens in planning violence

Yann LeCun's USD 1B Bet Targets LLMs as Lambda Shows 50% Power Waste

OpenAI adds tools to ChatGPT amid lawsuits, Pentagon backlash and trust problem

Python setup_env.py builds BitNet-b1.58-2B-4T C++ backend via CMake

Google adds Gemini chat to Docs, AI‑generated spreadsheets to Workspace

vLLM Boosts Production Inference Through High-Throughput PagedAttention

Google Stax uses LLM-as-judge to auto‑evaluate model outputs by your criteria

Falling costs drive expansive accessibility to language models

Black Forest Labs' Self-Flow speeds multimodal AI training 2.8× faster than REPA

OpenAI's GPT-5.3 Instant trims hallucinations 26.8% and reduces refusals

Google launches Gemini 3.1 Flash Lite, priced at one‑eighth of Gemini 3.1 Pro

Pixel 10 adds Circle to Search and Gemini agentic tools for grocery orders

Agentic AI emits JSON to call weather API for London in Celsius

Anthropic adds new prompt and import tool to Claude's memory for AI switchers

Alibaba's Qwen3.5-9B outperforms OpenAI's gpt-oss-120B on laptop benchmarks

Databricks paper finds data quality outweighs model architecture in LLM speed

Pokémon Pokopia lets players meet new Pokémon while rebuilding a ruined world

📚 Featured Resources & Reviews

No Code MBA Course Review

AI Tools & Resources

Weekly AI Digest

Browse Other Categories