A recent audit conducted by NewsGuard reveals that leading AI chatbots perform significantly worse in providing accurate information in non-English languages, particularly in Russian, Chinese, and Spanish. The study highlights systemic weaknesses in AI language models, showing an alarming rate of disinformation propagation, especially in regions with controlled media environments and limited fact-checking resources.

NewsGuard’s AI Misinformation Audit

NewsGuard’s January 2025 Multilingual AI Misinformation Monitor analyzed responses from ten of the most widely used AI chatbots, including OpenAI’s ChatGPT-4o, Microsoft’s Copilot, Google’s Gemini 2.0, Meta AI, and Anthropic’s Claude. The study tested chatbot responses in seven languages: English, French, German, Italian, Spanish, Russian, and Chinese. The audit examined 2,100 responses across 30 misinformation-driven prompts, evaluating how effectively these AI models identified and debunked false claims.

The results paint a concerning picture: chatbots exhibited a failure rate exceeding 50% in Russian and Chinese, while Spanish also showed high misinformation rates. In contrast, French had the lowest failure rate at 34.33%, though still indicating substantial room for improvement.

Systemic AI Failures in Multilingual Contexts

The study defined chatbot failures as responses that either contained false information or failed to provide an answer. The failure rates in different languages were:

Russian: 55% failure rate (35% false information, 20% non-response)
Chinese: 51.33% failure rate (33.33% false information, 23.67% non-response)
Spanish: 48% failure rate (27% false information, 21% non-response)
English: 43% failure rate (23% false information, 20% non-response)
German: 43.33% failure rate (21.66% false information, 21.66% non-response)
Italian: 38.67% failure rate (25% false information, 13.67% non-response)
French: 34.33% failure rate (20% false information, 14.33% non-response)

These results indicate that AI chatbots are more prone to misinformation in languages where fact-checking infrastructures are weaker or where state-controlled media sources dominate information ecosystems.

Understanding the Disinformation Loop

A critical issue underlying these failures is AI models' reliance on lower-quality sources in non-English languages. For instance:

In Russian, chatbots frequently referenced Kremlin-controlled media, repeating state propaganda narratives.
In Chinese, chatbots cited government-approved sources, reflecting Chinese state censorship and selective information control.
In Spanish, chatbots often repeated disinformation spread by Russian-affiliated Spanish-language media outlets, leading to false narratives gaining credibility.

One of the most widely repeated false claims across all languages was that a Danish military pilot named Jepp Hansen was killed in a missile strike in Ukraine—a claim widely circulated by Russian and pro-Kremlin sources but refuted by Danish authorities. Chatbots, particularly in Russian, Chinese, and Spanish, failed to counteract this misinformation, often citing unreliable sources or failing to provide corrections.

The Role of Search Engine Manipulation and AI Hallucinations

Another factor exacerbating misinformation spread by AI chatbots is search engine manipulation by foreign actors. Malicious entities use SEO (Search Engine Optimization) tactics to push state-sponsored narratives to the top of search results, making AI models more likely to reference these sources. The study found that AI chatbots with integrated web search functions were particularly vulnerable to repeating misinformation sourced from these manipulated results.

Microsoft’s research on data voids—gaps in credible reporting—further supports this issue. When reliable sources are scarce, AI models default to repeating the most available content, which often includes state propaganda and misinformation.

AI as a Misinformation Amplifier?

These findings raise serious concerns about the ethical and societal implications of AI-driven misinformation. As chatbots become more widely used for information retrieval, users in non-English-speaking regions are disproportionately exposed to unreliable narratives. The following risks emerge:

Political Manipulation: AI chatbots may unknowingly become tools for state-sponsored disinformation campaigns.
Erosion of Public Trust: Users in languages with high failure rates may develop false confidence in AI-generated responses, leading to misinformed decisions.
Media Freedom Challenges: In countries with restricted press freedom, AI chatbots could further reinforce government-controlled narratives, suppressing independent journalism.

Strengthening AI’s Multilingual Reliability

To mitigate these risks, AI developers and policymakers must take proactive steps:

Expand High-Quality Data Sources: AI models need greater integration with fact-checked, independent journalism in non-English languages.
Improve AI Guardrails: Companies must enhance misinformation detection and bias correction algorithms to prevent reliance on propaganda sources.
Fact-Checking Partnerships: Collaborations with multilingual fact-checking organizations can help improve real-time AI responses.
Increase Transparency: AI developers should provide detailed citations for chatbot responses to allow users to verify information.
Regulate AI Search Functions: Governments and AI companies must address search engine manipulation to prevent chatbots from reinforcing SEO-driven disinformation.
Leverage French as a Translation Baseline: Given that French had the lowest failure rate, AI models could be designed to first process fact-checked French content and translate it into other languages to enhance factual accuracy across multilingual contexts.

License This Article

Source: News Guard

$5.00

$10.00

$20.00

$30.00

Custom Amount

3% Cover the Fee

Featured

Aug 10, 2025

GPT-5, Grok 4, and Claude Opus 4.1: Comparing the Latest AI Model Advancements

Aug 10, 2025

Aug 8, 2025

OpenAI’s GPT-4o vs GPT-4.5: Speed, Accuracy, and API Strategy Compared

Aug 8, 2025

Aug 6, 2025

OpenAI Introduces ChatGPT Study Mode to Promote Active Learning and Responsible AI Use

Aug 6, 2025

Aug 3, 2025

DeepSeek’s Market Share Drops 72% as Global AI Competition Intensifies

Aug 3, 2025

Aug 2, 2025

OpenAI Launches ChatGPT Agent: Automating Complex Tasks with Web and App Integration

Aug 2, 2025

Jul 21, 2025

Consumer Generative AI Hits US$12B Revenue in 2025, But Only 3% of Users Pay

Jul 21, 2025

Jul 7, 2025

Perplexity AI Chatbot Enhances WhatsApp with Task Scheduling Feature

Jul 7, 2025

Jun 22, 2025

Transurban Upgrades Lex Chatbot with Claude AI for Smarter, Proactive Customer Service

Jun 22, 2025

Jun 11, 2025

FastBots.ai Launches Hybrid AI-Human Chat to Improve Customer Service Efficiency

Jun 11, 2025

Jun 4, 2025

Replika AI Chatbot Faces Scrutiny Over Alleged Inappropriate Behaviour and User Safety Risks

Jun 4, 2025

Jun 1, 2025

DeepSeek R1-0528 Update: Chinese AI Model Challenges Global Competitors with Open-Source Release

Jun 1, 2025

May 28, 2025

Google Launches Gemini 2.5 Pro: Advanced AI Model Expands Features Across Apps and Devices

May 28, 2025

AI Chatbot ‘PoBot’ Expands Legal Aid for Hong Kong Migrant Domestic Workers

May 28, 2025

AI ChatbotsRussianChineseSpanishFrenchFact CheckMisinformation

TheDayAfterAI News

We are a leading AI-focused digital news platform, combining AI-generated reporting with human editorial oversight. By aggregating and synthesizing the latest developments in AI — spanning innovation, technology, ethics, policy and business — we deliver timely, accurate and thought-provoking content.

AI’s Misinformation Crisis: Chatbots Fail More Frequently in Russian, Chinese and Spanish

NewsGuard’s AI Misinformation Audit

Systemic AI Failures in Multilingual Contexts

Understanding the Disinformation Loop

The Role of Search Engine Manipulation and AI Hallucinations

AI as a Misinformation Amplifier?

Strengthening AI’s Multilingual Reliability

Elon Musk’s Grok 3: The Strongest AI Ever Built?

Repeated Server Errors Raise Questions About DeepSeek's Stability

TheDayAfterAI News