📂 Category
Research & Benchmarks
176 articles in this category • Page 2 of 2
- 101. 70% of Creatives Fear Stigma as AI Drives Majority of Their Ideas – Anthropic
- 102. Student AI models can inherit bias and harmful traits from teacher models
- 103. Googler details meta‑prompt technique that guides Gemini to craft Veo videos
- 104. TPOT evolves ML pipelines via genetic algorithms in four steps
- 105. AI2 releases Olmo 3.1 32B Think, up 5+ points on AIME and 4+ on ZebraLogic
- 106. Researchers find complex AI persona tactics hurt meaning in development
- 107. AI Ends Build‑vs‑Buy Debate, Focus Shifts to Real Business Impact
- 108. Audio Dataset Valuable for Listening Models, Tackles Noise, Accents, Timing
- 109. GPT-5.2 leads FrontierScience test, but falters on real research tasks
- 110. Google highlights AI-driven chip, infrastructure and robotics advances in 2025
- 111. Dell and NVIDIA Host AI Developer Meetup in Hyderabad to Discuss Solutions
- 112. New framework lets agentic AI tools adapt to fill main agent knowledge gaps
- 113. Nested Learning's Continuum Memory System Redefines AI Memory for 2026
- 114. DeepSeek's architectural fix improves large‑scale reasoning, follows GRPO work
- 115. Notion’s simplified AI agent feature feels indispensable, says engineer
- 116. Docker Trick: Deterministic OS Packages in One Layer to Prevent ML Failures
- 117. Interactive AI Agent Uses OpenAI Function Schemas for Rapid ML Tasks
- 118. SAP deploys 95%‑accurate AI to redefine consultant role by 2030
- 119. CIOs drive AI experiments by embedding ready-to-use features into everyday tools
- 120. General Agentic Memory uses dual-agent design, beats RAG on benchmarks
- 121. Qwen3-4B-Instruct-2507: 4B‑parameter model boosts Raspberry Pi AI
- 122. AMD announces Ryzen AI 400 at CES, resembles AI 300 in laptops
- 123. MIT study probes memorization risk of clinical AI with de‑identified data
- 124. Analysis overhauls AI Index; GPT-5.2 beats professionals on 70.9% of tasks
- 125. Test-Time Training adds dual‑memory to Transformers, keeping inference cheap
- 126. Tredence hosts AI Foundry workshop in Chennai for AI system designers
- 127. New Magnetic Nanoparticle Approach Merges Heating and Healing for Bone Cancer
- 128. Vibe Coding Remains Early Stage, Real-World Reliability Still Distant
- 129. Dell says AI‑focused PCs confuse consumers, who show little interest
- 130. Replit CEO says using more tokens yields higher‑quality inputs, then tests apps
- 131. Hanns Christoph Nägerl’s team finds quantum heating defies classical intuition
- 132. Google AI Studio lets users trace inputs, outputs and API usage in logs
- 133. Gemini 3 Pro tops trust, ethics, safety at 69% vs 16% for Gemini 2.5
- 134. Google's Ironwood TPU to be generally available on Cloud in weeks
- 135. Google AI agents: consistency, context, short‑term session history, long‑term memory
- 136. WeatherNext 2 data now in Earth Engine, BigQuery; Vertex AI early access opens
- 137. Gemini Deep Research agent posts top results on HLE, DeepSearchQA, leads BrowseComp
- 138. Google's FACTS benchmark shows 70% factuality ceiling across four tests
- 139. Google, MIT study finds multi‑agent AI often loses context in sequential tasks
- 140. OpenAI Begins Developing Its Own AI Chips to Power Models
- 141. OpenAI unveils Aardvark, an agentic researcher that hunts bugs like a human
- 142. IndQA Targets India's Billion Non‑English Users, 2nd‑Largest ChatGPT Market
- 143. OpenAI Says It Won’t Seek Government Backstop for Infrastructure, CFO Friar Says
- 144. OpenAI finds sparse models aid debugging, may boost mechanistic interpretability
- 145. OpenAGI agent says it beats OpenAI and Anthropic; study deems over‑optimistic
- 146. OpenAI trials “Confessions” tool that makes models generate self‑audit reports
- 147. GPT-5.2 Thinking emerges as collaborative AI for end‑to‑end web builds
- 148. Anthropic faces pressure as CEO Dario Amodei backs AI regulation
- 149. LangSmith Fetch lets Claude Code, Cursor agents debug from terminal
- 150. NVIDIA Blackwell Tops AI Performance Charts with Optimized Hardware and Software
- 151. NVIDIA Blackwell Tops New AI Benchmark for Performance and Efficiency
- 152. NVIDIA Outlines AI Infrastructure Advances in Networking and Compute
- 153. Oracle's OCI Zettascale10 Offers Multi‑Gigawatt AI Power with NVIDIA GPUs
- 154. Reviews suggest Nvidia DGX Spark mini‑DGX copies DGX design, unveiled by Huang
- 155. NVIDIA Blackwell Wins All MLPerf Training v5.1 Benchmarks with FP4 Accuracy
- 156. NVIDIA cuts prices on Jetson edge‑AI developer kits for holiday shoppers
- 157. NVIDIA open-sources NeMo Data Designer for synthetic AI datasets at NeurIPS
- 158. NVIDIA offers up to USD 60,000 fellowships to PhD students for model collaboration
- 159. OpenUSD and NVIDIA Halos Enhance Robotaxi Safety with Synthetic Data, SimReady
- 160. Dell and NVIDIA host AI developer meetup in Bengaluru on deployment trade‑offs
- 161. Meta's Omnilingual ASR hits sub‑10% error on 78% of 1,600 languages
- 162. AI models stop 87% of attacks but only 8% of attempts; Qwen3‑32B hits 86.18%
- 163. Physicist Steve Hsu releases paper on AI‑assisted physics using GPT‑5 idea
- 164. AI agents claim sources verified despite dead links; 14 error types logged
- 165. Bright Data API Delivers Seamless AI/ML Integration and Anti‑Bot Protection
- 166. Corporate AI agents favor simple workflows; 41.5% accept minute‑range latency
- 167. CognitiveLab unveils NetraEmbed, 150% accuracy gain, adds ColNetraEmbed
- 168. Model distillation cuts latency 2‑3× and lowers costs by double‑digit percentages
- 169. U.S. Leads Pax Silica Initiative Launched at Summit to Secure Silicon Supply
- 170. Experts say data centers' water use is less risky than public perceives
- 171. Pangram 3.0 AI detector reports 99.98% accuracy, adds four usage tiers
- 172. YouTube channel serves AI concepts in under‑minute clips for fast learning
- 173. Fastweb and Vodafone use LangGraph LLM Compiler to automate customer requests
- 174. LeCun and Hassabis dispute meaning of ‘general intelligence’
- 175. Fusion reactors could produce dark‑sector particles via neutron emissions
- 176. Opera Neon: AI‑native browser that researches, compares prices, codes