📂 Category
Research & Benchmarks News Archive - Page 3 of 5
475 articles in this category • Page 3 of 5
- 201. Nvidia's DLSS 4.5 beta adds 6x Multi Frame Generation for RTX 50 GPUs
- 202. AI sycophancy cuts apologies, raises double‑downs; lifts moral trust
- 203. AI models fabricate image descriptions; benchmarks miss the shortcuts
- 204. Cohere's open-weight ASR model reaches 5.4% WER, ready for production use
- 205. Free API that evolved from slow web search to top AI tool, beyond scraping
- 206. Meta unveils open-source brain AI, adds Scrunch site audit and Suno v5.5
- 207. AI assurance experts meet to build infrastructure for safe, high‑quality systems
- 208. Study finds overly flattering AI advice can impair users' judgment
- 209. xMemory reduces token usage and context bloat versus MemGPT's raw logging
- 210. Mozilla dev launches cq, a Stack Overflow‑style hub for agents
- 211. Liquid‑cooled AI systems make storage an active cooling and GPU partner
- 212. 10 X Accounts for LLM Updates, Including the ‘Largest AI Newsletter’
- 213. Teens await sentencing for AI‑generated nude images as parents sue school
- 214. Developers say AI‑generated games feel unlike human‑made; audiences don't connect
- 215. Hachette withdraws Shy Girl horror novel amid AI usage concerns
- 216. Scale AI's Voice Showdown ranks Qwen ahead of top models, highlights failures
- 217. SynthID uses steganography to embed hidden watermarks in data
- 218. Google Search experiments with AI-generated headlines, may expand rollout
- 219. Growing cultural disconnect as companies race to deploy AI rapidly
- 220. Deep AI adopters reshape workflow, borrowing product‑manager tactics
- 221. NVIDIA DGX Spark expands node support to four, doubling memory capacity
- 222. Google's MusicFX DJ Enables Real-Time Controllable AI Music Generation
- 223. Paper identifies simple games that defeat AlphaGo and AlphaChess training
- 224. NVIDIA Cosmos Transfer Enables Scalable Synthetic Data for Physical AI
- 225. Trump Administration Signals Possible Additional Sanctions on Anthropic at Hearing
- 226. YouTube extends AI deepfake detection to politicians, journalists
- 227. Karpathy releases open-source Autoresearch, runs hundreds of AI tests nightly
- 228. AI spots trends but misses significance, keeping humans essential
- 229. Large CUDA Tiles Reduce Flash Attention TFLOPS by 18‑43% Across Sequences
- 230. KV cache compaction cuts LLM memory 50×, chunked processing long contexts
- 231. AI system flags probable matches, narrows anonymous accounts to shortlist
- 232. Seven tech giants sign Trump pledge to curb data‑center power cost spikes
- 233. Microsoft's Phi-4 Reasoning Vision 15B offers low‑latency, compact AI
- 234. LangSmith CLI adds three portable skills for coding agents in the repo
- 235. Secret meeting sees 94% approve even least‑popular AI resistance stance
- 236. AI data centers move to Arctic edge, boosting Nordic rural economies
- 237. Microsoft's OPCD cuts system prompts while preserving AI performance
- 238. Wall Street shows persistent AI anxiety, sparking frequent mini‑panics
- 239. Riley Walz, the ‘Jester of Silicon Valley,’ joins OpenAI’s OAI Labs team
- 240. AI enables scientists to integrate multiple cell measurements
- 241. Researchers argue building conscious AI could foster empathy, despite doubts
- 242. AI Researchers Resign, Bots Hire Humans, Anthropic Targeted, Evie Party
- 243. Researchers embed mask token in LLM weights to achieve 3× faster inference
- 244. Run:ai on 64 GPUs serves 10,200 users, matching native scheduler
- 245. Google unveils Gemini 3.1 Pro, hits 94.3% GPQA Diamond and coding Elo 2
- 246. Google launches AI Professional Certificate to boost fluency for workers
- 247. Google.org launches USD 30 M AI for Government Innovation Impact Challenge
- 248. Google urges full‑stack, collaborative security to fight bad actors at MSC 2026
- 249. SurrealDB 3.0 stores agent memory, business logic, and multimodal data in one DB
- 250. Anthropic-Pentagon AI feud escalates as You.com co-founders Socher, McCann cited
- 251. AI's new physics discovery; Spotify devs wrote no code this year, CEO says
- 252. Study Finds Stigma Causes Shame for Some in AI Relationships
- 253. Google's upgrade teaches zero-shot selection, embeddings, QA workflows
- 254. MLOps Workflow Normalizes and Enriches Occupational Wage Data from Excel
- 255. Full‑stack resilience: protecting democracies from digital threats to subsea cables
- 256. Anthropic aims to curb costs as it launches USD 50B of data centers in NY, Texas
- 257. Qwen-Image-2.0 renders calligraphy with near‑perfect text, ranks behind Nano Banana Pro
- 258. EU AI Lacks Models and Compute; Germany Urged to Lead Coalition
- 259. New benchmark finds AI still hallucinates despite citing legitimate sources
- 260. No firm admits AI replacing New York workers; Amazon cites AI for 30,000 layoffs
- 261. AI Proposed to Supplant Nuclear Treaties, Raising Cheating Concerns
- 262. Study finds GPT‑4o updates trigger real mourning as users personify model
- 263. Deepseek‑R1 and QwQ‑3 exhibit competing personalities that improve reasoning
- 264. Google's PaperBanana uses five AI agents to auto-generate diagrams, missing icons
- 265. Team embeds compressed docs index in AGENTS.md to guide AI coding agents
- 266. Waymo launches Waymo World Model using DeepMind's Genie 3 for unseen scenarios
- 267. AI Social Network Moltbook Leaks Real Human Data, Raising Security Concerns
- 268. Recommendation engine lifts click-through 10%; efficiency needed for deployment
- 269. TTT-Discover uses inference-time RL to double GPU kernel speed vs experts
- 270. OpenClaw AI skill extensions flagged as security nightmare by OpenSourceMalware
- 271. Anthropic teams with Allen Institute and HHMI to boost transparent scientific AI
- 272. Infiltrator reports AI agents on Moltbook ignore pleas, share odd links
- 273. Musk merges SpaceX with xAI and X, cites new AI‑compute satellite plan
- 274. Game Arena launches chess benchmark to test AI strategic reasoning
- 275. Testing Google’s Auto Browse AI in Chrome: the results fell short
- 276. Tely AI auto‑creates and publishes website answers, delivering high‑quality leads
- 277. AI models using internal debate spot errors and boost accuracy on complex tasks
- 278. Vibe Coding’s 7 Plans Start at USD 3/Month, Provide Prompt Capacity
- 279. 7 Scikit-learn Tricks: Embed Preprocessing Pipelines in Hyperparameter Tuning
- 280. AI Toy Leaks 50,000 Kids' Chat Logs to Any Gmail User, Privacy Breach
- 281. Airtable Superagent provides full execution visibility, cites data semantics over model
- 282. AI scans 100 M Hubble cutouts in 2.5 days, flags 1,400 odd objects
- 283. Google DeepMind staff request physical safety from ICE agents in offices
- 284. Animators and AI Researchers Build ‘Dear Upstairs Neighbors’ Despite Unique Style
- 285. Microsoft's Maia 200 AI chip, with 100B+ transistors, rivals Amazon, Google
- 286. Researchers breach all AI defenses; Walmart CISO warns of agentic AI risks
- 287. Rust meets Python: Enhancing the NumPy‑pandas‑scikit‑learn‑PyTorch workflow
- 288. AI Foundry by Tredence to Host Builders Forum Feb 7, 2026 in Chennai
- 289. AI video hits high bar; new tools for consistency and customization at Davos
- 290. Trust Drives C‑suite Adoption and Scaling of Agentic AI, Research Finds
- 291. Chinese-born AI scholars in US forge ties, deepening US-China collaboration
- 292. Adobe adds Firefly‑Premiere AI video tools, announces USD 10M Sundance grants
- 293. Anthropic, DeepMind, Node.js Leaders Say AI Will Replace Most Coding in a Year
- 294. Hyperparameter Tuning Reaches 0.9617 Accuracy in 64.59 Seconds
- 295. RLVR lifts sampling efficiency, not reasoning; base models hold trajectories
- 296. OpenAI Safety Lead Moves to Anthropic's AI Risk Research Team
- 297. AI Researchers Reveal Token Warehousing Strategy to Cut GPU Computational Waste
- 298. AI Tool Detects Dangerous Blood Cells Doctors Might Overlook
- 299. IndiaAI Mission Launches 62 AI and Data Labs Across Uttar Pradesh
- 300. Stanford AI Detects Hidden Disease Signals in Large-Scale Sleep Data