Estonian research institute evaluates AI models' susceptibility to Russian propaganda, highlighting cybersecurity and disinfo

Editorial illustration for Estonian institute benchmarks AI models' vulnerability to Russian propaganda

Estonian institute benchmarks AI models' vulnerability...

By AI Daily Post Edited by Brian Petersen, Editor-in-Chief

June 16, 2026 • Updated: July 15, 2026 • 3 min read

The Institute of the Estonian Language just graded sixty AI models on a critical new subject: Kremlin propaganda. Their benchmark, published today, hits models from global giants and open-source projects with 75 questions in Estonian, Russian, and English. Each query tests one of 14 common Russian narratives, phrased neutrally, with bias, or with outright manipulation.

Anthropic's Claude models claimed the top spots, followed by Nvidia's Nemotron 3 and Alibaba's Qwen 3.6 Plus. Mistral's models, including the newest Medium 3.5, landed in the bottom third.

How easily can Russian propaganda fool AI models? A new benchmark finds out - THE DECODER

Every answer got a score from 1 to 5—a 1 means the model parroted the line. To ensure those scores were sound, the institute used a calibrated Claude Opus 4.5 for evaluation, a method then validated by analysts at the Estonian watchdog Propastop. The full rankings and detailed methodology are now live on the institute's site.

Common Questions Answered

How many AI models did the Institute of the Estonian Language evaluate in their propaganda benchmark?

The Institute of the Estonian Language benchmarked sixty AI models from both global giants and open-source projects. This comprehensive evaluation tested each model's vulnerability to Kremlin propaganda across multiple languages and narrative types.

What languages and types of Russian narratives were included in the Estonian institute's propaganda benchmark?

The benchmark consisted of 75 questions presented in Estonian, Russian, and English, with each query testing one of 14 common Russian narratives. These narratives were phrased in three different ways: neutrally, with bias, and with outright manipulation to comprehensively assess model vulnerabilities.

How did the Institute of the Estonian Language score AI model responses to propaganda questions?

Each answer received a score from 1 to 5, where a score of 1 indicated the model simply repeated the propaganda line without critical analysis. To ensure scoring accuracy, the institute used a calibrated Claude Opus 4.5 for evaluation, with results then validated by analysts at the Estonian watchdog organization Propastop.

Where can researchers access the full rankings and methodology from the Estonian propaganda benchmark study?

The full rankings and detailed methodology from the Institute of the Estonian Language's propaganda benchmark are now publicly available on the institute's website. This transparency allows researchers and organizations to understand how different AI models performed and the specific evaluation criteria used.

Ship an AI product this weekend — no engineers required.

Structured, in-depth lessons on the exact no-code tools — not scattered tutorials.

The exact platforms, taught in depth
Build real, working projects
Our honest review + a reader discount

Read the review →

Estonian institute benchmarks AI models' vulnerability...

Common Questions Answered

How many AI models did the Institute of the Estonian Language evaluate in their propaganda benchmark?

What languages and types of Russian narratives were included in the Estonian institute's propaganda benchmark?

How did the Institute of the Estonian Language score AI model responses to propaganda questions?

Where can researchers access the full rankings and methodology from the Estonian propaganda benchmark study?

Further Reading

Ship an AI product this weekend — no engineers required.

Latest News

Deepseek's New AI Model Matches GPT-5.6 at 60% Lower Cost

Users Blast AI Assistant as 'Dead-End Relationship' Ad

Anthropic says Claude AI hacked companies during safety test

Anthropic says its AI models breached three companies in security tests

Anthropic Says Configuration Error Let Claude Access Open Internet

Nous Research Ships Three Hermes Agent Integration Paths for Block's Nostr Workspace

PolyAI's Dialog-RSN-1 Fuses Speech Recognition and Response

Google's Gemini Robotics 2.0 Aims for Improved Dexterity

LangSmith's LLM Gateway embeds governance into agent runtime

Google DeepMind's Gemini AI now controls entire humanoid robots

Related Reading

ChatGPT's 'Nerdy' tweak rewards goblin metaphors in answers, study finds

Google tests visual 'magazine-style' UI for Gemini 3 Pro users

AI Engineers Face Rising Costs, Need New Strategies for Efficiency

Study quantifies AI agent trust formation, breakage, recovery in survival game

UP‑NRPA Allows Dynamic Customization of Dialogue Strategies Without Offline RL

Common Questions Answered

How many AI models did the Institute of the Estonian Language evaluate in their propaganda benchmark?

What languages and types of Russian narratives were included in the Estonian institute's propaganda benchmark?

How did the Institute of the Estonian Language score AI model responses to propaganda questions?

Where can researchers access the full rankings and methodology from the Estonian propaganda benchmark study?

Further Reading

Ship an AI product this weekend — no engineers required.

Latest News

Deepseek's New AI Model Matches GPT-5.6 at 60% Lower Cost

Users Blast AI Assistant as 'Dead-End Relationship' Ad

Anthropic says Claude AI hacked companies during safety test

Anthropic says its AI models breached three companies in security tests

Anthropic Says Configuration Error Let Claude Access Open Internet

Nous Research Ships Three Hermes Agent Integration Paths for Block's Nostr Workspace

PolyAI's Dialog-RSN-1 Fuses Speech Recognition and Response

Google's Gemini Robotics 2.0 Aims for Improved Dexterity

LangSmith's LLM Gateway embeds governance into agent runtime

Google DeepMind's Gemini AI now controls entire humanoid robots