Research & Benchmarks - Page 3 of 16
Academic AI research, performance benchmarks, scientific breakthroughs, and peer-reviewed studies advancing artificial intelligence frontiers.
Academic AI research, performance benchmarks, scientific breakthroughs, and peer-reviewed studies advancing artificial intelligence frontiers.
Why does the hype around AI productivity often feel out of step with what actually gets delivered? While labs showcase glossy numbers, the underlying data tells a quieter story.
Nvidia’s latest software rollout nudges its deep‑learning super‑sampling tech into a new performance tier.
Why does it matter when a chatbot mirrors the tone you expect? Researchers set out to see whether an AI’s conversational style could sway how people own up to mistakes or choose to settle disputes.
Why does it matter when a system tells you it “sees” something it never actually looked at?
Cohere’s newest speech‑to‑text system hits a 5.4 % word error rate, a figure that sits at the low end of what many enterprises consider acceptable for live‑customer interactions.
The roundup of free web APIs just added a service that’s quietly reshaped how autonomous agents pull data from the internet.
Meta’s latest push into open‑source AI isn’t just another research paper; it’s a bundle of tools aimed at developers and marketers alike.
The AI community has been wrestling with a simple question: how do we move from flashy prototypes to systems that people can actually rely on?
Why does it matter when a chatbot tells you what you want to hear? A new study, published under the title “Sycophantic AI can undermine human judgment,” probes exactly that question.
Early AI agents often treat every exchange as a line in a ledger, appending each utterance to a growing transcript.
Mozilla’s latest open‑source effort, cq, aims to give autonomous agents a place to post solutions and borrow tricks the way developers turn to Stack Overflow.
Why does this matter now? As AI models swell and GPU farms push power envelopes, designers are turning to liquid‑cooled chassis to keep chips from throttling.
Keeping up with large‑language‑model news feels like chasing a moving target. Every day a new model drops, a startup announces funding, or a research paper lands on arXiv, and the signal-to‑noise ratio on mainstream feeds can be overwhelming.
The courtroom is waiting, but the stakes stretch far beyond any single verdict. Two teenagers stand accused of using generative AI to produce explicit images of their classmates, a practice that has already ignited a wave of parental outrage and a...
At the industry’s biggest developer showcase this year, every booth was flashing AI demos, yet the actual game line‑up remained conspicuously human‑crafted.
Hachette’s decision to pull the horror title *Shy Girl* has sparked a debate that extends beyond the publisher’s internal review of artificial‑intelligence tools.
Scale AI has rolled out Voice Showdown, a benchmark that moves beyond synthetic tests and puts voice assistants through everyday scenarios.
Why does this matter? As AI‑generated media proliferates, distinguishing authentic material from synthetic output becomes a practical challenge for platforms, regulators, and end users alike.
Google has begun swapping out the text that sits atop its search results with copy generated by its own language models.
The latest benchmark study uncovers a widening gap between boardrooms and living rooms. Executives across sectors are mapping every workflow for a possible AI upgrade, touting efficiency gains and new product lines in every earnings call.
Learn to build AI-powered apps without coding. Our comprehensive review of No Code MBA's course.
Curated collection of AI tools, courses, and frameworks to accelerate your AI journey.
Get the week's most important AI news delivered to your inbox every week.