📂 Category
LLMs & Generative AI News Archive - Page 2 of 10
951 articles in this category • Page 2 of 10
- 101. Anthropic says new Claude model aims for honesty, avoids unsupported claims
- 102. Soro chatbot built on Gemma 3, trained on 1.9 B Tajik tokens from web and PDFs
- 103. How Ollama’s Context Length Setting Impacts Local Model Memory
- 104. NVIDIA releases NvRTX 5.7.4 with DLSS 4.5 support for UE5.7.4
- 105. How to Run Multiple Claude Code Sessions in Parallel Without Confusion
- 106. POLAR builds multimodal knowledge graph for semantic and episodic memory
- 107. MEMO trains a memory model on new knowledge with two roles, no LLM changes
- 108. GEM framework casts LLM data curation as hyperspherical variational problem
- 109. Experienced users supervise Claude only when it deviates, not step‑by‑step
- 110. Deploy Agents to Audit Complex Docs and Run Light Evaluations
- 111. Parameter-Efficient Multi-Class Scheduling for Multimodal Anomaly Detection
- 112. Study formalises LLM reasoning redundancy as truncatable steps in correct traces
- 113. Direct and Surrogate Verification Encode Transformer Circuits into SMT Solvers
- 114. AWS Agent Toolkit Shows Invocation, Success, UserError, SystemError Stats
- 115. AMD Ryzen AI Max+ runs 122B‑parameter models locally with 128 GB UMA
- 116. Semantic Search Model Assigns Class Labels and Confidence Scores to Critiques
- 117. Hotz warns AI coding agents could be costly despite 10x productivity boost
- 118. Accurate source citations boost AI answer quality, study finds
- 119. FuRA uses spectral preconditioning with full‑rank SVD for efficient fine‑tuning
- 120. Positional copying dominates answer readout in 1‑3B LMs on GSM8K
- 121. StepFun launches StepAudio 2.5 Realtime, evaluated via mobile app raters
- 122. Guide Shows How Python Connects to Existing AI Models via Custom Requests
- 123. Anthropic may keep supplying Claude to NSA despite Pentagon risk flag
- 124. Claude Code auto‑creates AI scaling algorithms; new control allocates compute
- 125. SuperClaude workflow ranks security issues, details attack vectors, gives fixes
- 126. Anthropic: Claude Mythos Preview finds ~3,900 high‑severity open‑source bugs
- 127. Meta launches Forum: Reddit‑style advice within Facebook groups, AI‑assisted
- 128. SOLAR introduced as self‑optimizing autonomous agent for continual learning
- 129. VSAS‑Bench Introduces Standardized Real‑Time Evaluation for Visual Assistants
- 130. F_Call_Analysis_Planner forwards Parent_Instruction to generate Selection_Rule
- 131. OSCToM uses RL to generate adversarial scenarios testing high-order Theory of Mind
- 132. Alibaba's Qwen3.7-Max runs 35 hrs, self‑monitors reward‑hacking, supports Claude Code
- 133. Gemini 3.5 Flash Shows Fast Responses in Free Account Tests
- 134. Pricing Change Alters Complaint Language, Skews Classifier Accuracy
- 135. Claude skill helps data scientists spot 5‑6 PM weekday usage spikes in 2026
- 136. Deepseek launches Deepseek Code to compete with Claude Code and OpenAI's Codex
- 137. Robotics may get a ChatGPT moment with massive human‑generated training data
- 138. Microservice Architecture Unites OCR, Classification, and LLM Pipelines
- 139. Proposal Calls for Data Probes to Study Impact of Training Data on LLMs
- 140. Isotonic calibration gets O(n⁻¹/³) sample complexity, cost‑optimal LLM routing
- 141. Basis Spline Decoupling Enables Compression of Transformer Models
- 142. LLM Retrieves Median 2020 Inflation Expectation, Drowning Prompt Guidance
- 143. QuickReduce FP4 delivers ~4.1× speedup over RCCL at TP=4 for large messages
- 144. Study Uses SHARP and New Error Framework to Assess PHRs in Health AI
- 145. Google's Gemini 3.5 Flash, pricier, adds 11 Omniscience points hallucinations 61%
- 146. Alibaba launches Qwen3.5‑LiveTranslate‑Flash: 60‑language translation in 2.8 s
- 147. I/O 2026 unveils Gemini Omni for universal creation, Gemini 3.5 Flash debut
- 148. EKS Hosts Multistage Multimodal Recommender; DLRM Personalizes Rankings
- 149. Gemini 3.5 Flash Enhances Web UI, Graphics and AI Studio Animations
- 150. Fresh Web Data Grounds LLMs, Highlighting RAG's Production Limits
- 151. ANNEAL lets neuro‑symbolic agents patch knowledge graphs without weight changes
- 152. Claude Cowork: Guide to Turning Q1 Sales Data into a Structured Word Report
- 153. Activation steering reveals latent bias in LLMs, reinjection restores decisions
- 154. 95% of task‑specific generative AI pilots never reach production
- 155. 5 Practical Uses of Local Language Models Highlight Code‑First Approach
- 156. SkillSmith extracts fine-grained boundaries so agents run only needed components
- 157. Quantized LLMs Show Emerging Bias, Masking Gradual Degradation
- 158. AgentStop cuts GPU power, heat and battery drain by ending AI agents early
- 159. Vercel Labs launches Zero, a systems language for AI agents to read and ship
- 160. Enterprise‑grade AI platform merges chatbot, voice, video; customizable via APIs
- 161. NightCafe Remains Long‑Running, Community‑Focused AI Art Platform
- 162. Claude Mythos USD 36,428 for 122 exploit episodes; GPT‑5.5 USD 3,075 for 123
- 163. Use Automated Dashboards and Weekly Review Cadence for GenAI Interviews
- 164. OpenAI partners with Malta to offer ChatGPT Plus to every citizen
- 165. Tool Highlights Time‑Consuming Stalls and Faulty Calls in Claude Code
- 166. Zyphra launches ZAYA1-8B Diffusion Preview, a MoE model with 7.7× speedup
- 167. Claude targets agent control plane while Microsoft stays enterprise default
- 168. Invisible orchestration raises collective dissociation (g = 0.975, p = .001)
- 169. Automatic alerts trigger when LLM accuracy falls or latency spikes
- 170. Poetiq’s Meta‑System Improves LLMs on LiveCodeBench for Reasoning, Retrieval
- 171. ChatGPT traffic falls to 54% as Gemini climbs to 26.7% in a year
- 172. Inference Systems, Not Models, Emerge as the Next AI Bottleneck
- 173. Alibaba's Qwen-Image-2.0 doubles compression, slashes steps to 4 with Qwen3.5-9B
- 174. B2B Document Extractor Rebuilt: Rule-Based vs. LLM Using pytesseract OCR
- 175. Anthropic adds Claude plugins for CoCounsel, DocuSign, Everlaw, Box, Harvey
- 176. Rubrics-as-Reward seeks explicit criteria; scalable rubrics remain elusive
- 177. Avoid TensorRT Slowdowns or Build Failures by Adding Plugin Extensions
- 178. Audit matrix flags token rotation via npm postinstall hook in Claude Code
- 179. BalCapRL adds length-based reward masking, boosting LLaVA-1.5-7B and Qwen2.5-VL
- 180. SFT and RL Reweight Pretrained Distributions via Demonstration and Reward Signals
- 181. Spatial priming beats semantic prompting in chart data extraction study
- 182. GraphDC Uses Divide‑and‑Conquer Agents to Scale Graph Reasoning
- 183. RateQuant reveals mixed-precision KV cache pitfall: β decay rates span 3.6‑5.3
- 184. Top 10 2026 LLM Papers Highlight Pass@k Efficiency for Reasoning Models
- 185. Generative AI fuels industrial-scale record 2025 data breaches, ITRC reports
- 186. Strain drives exponential error growth; vorticity only linear impact
- 187. LKV learns head-wise budgets and token selection for LLM KV cache eviction
- 188. LLM Summarizers Omit Identification, Distinguish Observed vs Inferred Claims
- 189. NVIDIA's Star Elastic bundles 30B, 23B, 12B models; 23B hits 85.63 on AIME-2025
- 190. Understanding 'Compute': The Core Power Driving Modern AI Models
- 191. Fields Medalist: ChatGPT 5.5 Pro produced PhD-level math proof in under an hour
- 192. Key Topics for LLM Engineers: Using Instruction Data to Align Models
- 193. Semantic memory query retrieves Friday deployment approval for user-123
- 194. OpenAI launches Realtime‑Translate for 70+ languages and Realtime‑Whisper transcription
- 195. Google's Chrome 4GB on-device AI model unchanged, but explanation lacking
- 196. RVPO boosts HealthBench score to 0.261, beating GDPO’s 0.215 at 14B (p < 0.001)
- 197. Adding Tools and Memory Expands AI Agent Threat Surface, Study Finds
- 198. Grammar-Constrained Decoding Adds 495 Bash Tasks, Some Regressions Seen
- 199. SAT ensures improvement and plug‑and‑play upgrades in multi‑LLM training
- 200. Transfer Learning Boosts Efficiency of Physics-Informed Neural Networks