📂 Category
Research & Benchmarks
176 articles in this category • Page 1 of 2
- 1. Google AI Advisors Let Users Probe Performance with Conversational “Why” Queries
- 2. Consensus uses GPT-5 and Responses API to speed scientific research
- 3. Developers say Sora, unlike Vine/TikTok, is not about people in social media
- 4. Study finds reasoning LLMs are more efficient but not more capable
- 5. Bengaluru Hosts The Best Firm Summit 2026 for HR and AI Leaders
- 6. Google expands AI partnership with Tel Aviv University, infrastructure for Gemma
- 7. Study: 77% of data engineers face heavier workloads despite AI tools
- 8. X limits Grok image tool to paid users; 1 obscene request/min, 102 in 5 mins
- 9. Run ML Notebooks on Databricks: Spark‑Powered, Scalable Experiment Platform
- 10. Stanford AI Detects Hidden Disease Signals in Large-Scale Sleep Data
- 11. Indian language ID proves tough; authors release baseline ML models
- 12. Arxiv tightens moderation as AI‑generated CS review papers swell
- 13. OpenAI researcher details new AI model using general RL, no code interpreters
- 14. Use Temporal Patterns: Plot Timestamps to Spot Seasonality, Trends, Shifts
- 15. Google plans space‑based TPUs for solar‑powered data centres, preprint says
- 16. DuckDB Outpaces SQLite and Pandas in Benchmark of 1M-Row Data Tasks
- 17. New Study: Smaller Training Data Can Boost AI's Problem-Solving Skills
- 18. NotebookLM Turns Complex Spreadsheets into Presentation Insights
- 19. Google DeepMind hires ex‑Boston Dynamics CTO to create Gemini AI for any robot
- 20. 7 Pandas Techniques for Efficient Large Dataset Management
- 21. Anthropic finds strict anti-hacking prompts increase AI sabotage and lying
- 22. Forest Listeners lets users explore Amazon and Atlantic forests to find species
- 23. Authors retract brain-mapping paper after reviewers flag fabricated citations
- 24. Microsoft's Fara-7B AI agent, rival to GPT‑4o, runs on PC, logs 145k tasks
- 25. AI for Math Initiative finds structures showing problems harder for computers
- 26. German Commons opens pipeline to free AI datasets from copyright limbo
- 27. TPUs Designed for Deep Learning Can Outperform GPUs in Many Workloads
- 28. ARC benchmark declines as labs tune AI to optimize its specific logic
- 29. Google's self-modifying model needs extra engineering, smarter compute for complex training
- 30. Denario AI research assistant writes papers and self‑reviews them
- 31. Adobe's Frame Forward AI removes subject from first frame, fills background
- 32. IBM and AICTE launch AI Lab in New Delhi with SkillsBuild's 1,000 AI courses
- 33. NxtGen’s M for Coding becomes AI autopilot, funded by Rs 10,300 cr IndiaAI Mission
- 34. Pinterest launches AI shopping assistant Thursday, suggesting looks
- 35. Runway's Gen-4.5 text-to-video AI claims unprecedented physical accuracy
- 36. OpenAI Codex CLI Works with ChatGPT Plan and VS Code Extension
- 37. AI Code Benchmarks Fail the "Vibe Check," Says New DeepMind Study
- 38. Human-aligned AI models show greater robustness and reliability, study finds
- 39. Meta's SPICE framework beats baselines, boosts math and general reasoning
- 40. RECAP tool shows Claude 3.7 reproduces ~3,000 words from The Hobbit and Harry Potter
- 41. NotebookLM adds full 1 million‑token Gemini context window, boosts processing
- 42. Ex‑Microsoft Chair warns AI will cut entry‑level jobs at Bengaluru summit
- 43. Google offers free AI tools to university students across EMEA
- 44. Beyond the Hype: AI's Real-World Progress in Fall 2025
- 45. MineWorld: An Open-Source AI Model That Learns From Minecraft
- 46. AI-Generated Consumer Simulations Could Replace Traditional Surveys
- 47. Red Hat unveils AI 3, hybrid cloud‑native platform for enterprise inference
- 48. Schwacke says brain‑based science can power sustainable AI future
- 49. AI trained on two books mimics famous authors, beats human imitators
- 50. Study finds 1 in 10 U.S. newspaper pieces partly AI-written, often undisclosed
- 51. Databricks study: AI judges need people focus, not just tech development
- 52. Experts advise locating new US data centers outside water‑stressed California
- 53. LearnLM tutoring boosts student problem‑solving by 5.5 percentage points
- 54. AI vision pioneer aims to extend models from data to space understanding
- 55. ElevenLabs' Scribe v2 delivers real‑time, negative‑latency transcription
- 56. Set Seed in XLMiner: Use Integer 12345, 42, 2024 for Consistent Partitions
- 57. Upwork study finds AI agents outperform alone when paired with humans
- 58. RDMA Cuts CPU Use in S3-Compatible Storage, Boosting AI Performance
- 59. Researchers push Context Engineering 2.0 as AI moves from Era 2.0 to 3.0
- 60. DeepEyesV2 Beats Larger Open‑Source Models by Leveraging Search Tools
- 61. Stereogum persists amid streaming, AI and dwindling ad revenue as ads dry up
- 62. ServiceNow uses LangSmith, knowledge graph and MCP to orchestrate agents
- 63. MIT Energy Initiative conference highlights storage research priorities
- 64. M‑GRPO Boosts Coordination in Multi‑Agent Training Over Single‑Agent GRPO
- 65. CrewAI Introduces Function-Based Guardrails for Rule‑Based Output Constraints
- 66. CrowdStrike's Stein finds DeepSeek‑R1 adds 50% more bugs on Chinese prompts
- 67. DOE orders cloud, labs, and network integration for AI Genesis mission in 90 days
- 68. Cecilia Heyes labels language a 'cognitive gadget' for precise social learning
- 69. Digital Connexion to Invest USD 11 Billion in Andhra Pradesh AI Data Centres
- 70. Karpathy says AI‑homework crackdown failed, urges in‑class grading shift
- 71. India's unique AI edge lies in development-to-deployment synergy, expert says
- 72. Build an AI Study Planner Agent That Automates Tasks Using APIs
- 73. Indian Prodigy's AI "Supermemory" Attracts Top Tech Investors
- 74. Open ASR Leaderboard Tests 60+ Speech Recognition Models for Accuracy and Speed
- 75. Gemma model reveals cancer therapy pathway; Yale releases C2S-Scale 27B
- 76. Infosys, Cognizant, Accenture, LTIMindtree invest $1.5B in Oracle Data Platform
- 77. Alibaba's AgentEvolver lifts tool-use accuracy ~30% via auto‑generated tasks
- 78. Data Science Interviews Test Translating Vague Business Questions into Analysis
- 79. Wipro partners with IISc and FSID for AI and quantum research collaboration
- 80. Linear MCP Gives AI Premium Access to Manage Projects via Natural Language
- 81. Adobe's Corrective AI changes voice‑over emotions, swaps music with Adobe Stock
- 82. Meta researchers find signatures in LLM traces signal reasoning correctness
- 83. 5 Data Science Projects: Practical Pandas EDA for Absolute Beginners
- 84. 97% Can't Distinguish AI Music; 71% Surprised, 51% Uncomfortable
- 85. 98% of market researchers use AI; 40% report errors, 29% rely on AI support
- 86. SerpApi Converts Live Search Results into Structured API Data for ML Pipelines
- 87. NeurIPS 2025: Top 4 Papers Highlight Shift From Bigger Models to Limits
- 88. AI Solves 30-Year-Old Math Problem, Showcasing Perplexity's Patent Search Tool
- 89. Pharma Cautious as AI Promises Faster Drug Discovery and Smarter Trials
- 90. Space Data Centers: Companies Harness Sunlight, Cooling, No Permits for AI
- 91. DeepSeek OCR Fast but Fails Complex Forms; Choose Proven Architecture
- 92. Google's Veo-3 fakes surgical videos; 1.78 handling, 1.64 tissue, lowest logic
- 93. DeepMind AI agent explores new games, explains its actions better than SIMA 1
- 94. OpenAI forms ‘OpenAI for Science’ team to speed physics, math discoveries
- 95. ComputeEval 2025.2 expands to 232 CUDA challenges, upping LLM test difficulty
- 96. Amazon launches beta AI translation for self‑published Kindle books
- 97. Counter-Strike Sets New Benchmark for Vibe Coding, Says Ex‑Mixpanel CEO
- 98. Harvard Data Course Runs 66 Weeks, Costs USD 1,332.90 (~Rs 1.18 Lakh)
- 99. Anthropic puts Claude in the interviewer's chair for AI testing
- 100. Harbor Framework Enables Sandbox Agent Execution on Docker, Modal, Daytona