Skip to main content
Reporter points at laptop showing ChatGPT API output tagged with Wikipedia and German site icons, a web UI screen.

Editorial illustration for ChatGPT API Cites Wikipedia More, Favors Obscure German Sources Over Web Interface

ChatGPT API Reveals Unique Citation Patterns Across Sources

ChatGPT API favors Wikipedia and obscure German sites, unlike web UI

Updated: 3 min read

When artificial intelligence speaks, not all sources are created equal. A new analysis of ChatGPT's API reveals surprising citation patterns that differ dramatically from its public web interface.

The research uncovers an intriguing digital bibliographic puzzle. While most users interact with ChatGPT through its familiar web portal, the underlying API appears to have a distinctly different approach to sourcing information.

Some unexpected players are emerging in the AI's reference ecosystem. Obscure regional publications and encyclopedic resources are getting unexpected algorithmic spotlight time, suggesting that behind-the-scenes AI behavior might look very different from what users typically experience.

These findings raise critical questions about how large language models select and prioritize information sources. Plus, they hint at the complex, often opaque mechanisms driving AI knowledge retrieval.

The data points to a fascinating research landscape where technical nuance can reveal profound insights about how artificial intelligence actually processes and references information. What happens when we peek behind the algorithmic curtain?

By comparison, the API often points to encyclopedic sources like Wikipedia--almost 15 percent of API citations--and to little-known local outlets that have limited reach in Germany. Deutsche-handwerks-zeitung.de, for example, appears far more often in API results. Compared to the Reuters Digital News Report 2025, the web interface's source list overlaps 45.5 percent with top German media, while the API overlaps just 27.3 percent.

Public broadcasters also get more exposure on the web interface (34.6 percent of sources) than through the API (12.2 percent). Broader source requests can surface fringe and unreliable sites When users asked ChatGPT for a wider range of sources, the system listed 1.4 times as many sites through the API and 1.9 times as many through the web interface, compared to standard queries. But more variety didn't always mean better information.

According to the study, these requests led to more citations of politically biased or propaganda outlets, like news-pravda.com, which has reported ties to the Russian government. The system also sometimes linked to fake or nonexistent domains, like news-site1.com, or to lookalike sites that generate AI-written "news." Even though some sources were polarizing, the average political leaning of outlets cited by ChatGPT was close to the national average.

The ChatGPT API reveals intriguing citation patterns that diverge sharply from its web interface. Its sourcing strategy appears more encyclopedic, with Wikipedia capturing nearly 15 percent of API citations, a notable shift from typical web interactions.

Surprisingly, the API gravitates toward obscure German sources like Deutsche-handwerks-zeitung.de, suggesting a different information retrieval approach. When compared against mainstream media benchmarks, the API's source selection looks markedly different.

The data hints at underlying algorithmic variations between the web interface and API. While the web version aligns more closely with top German media outlets, overlapping 45.5 percent, the API shows just 27.3 percent correlation.

These differences raise questions about how information is sourced and prioritized across different ChatGPT platforms. The API's tendency to reference less-prominent sources could reflect nuanced backend processing that remains opaque to users.

For now, the divergent citation behaviors suggest that not all AI interactions are created equal. Researchers and users might want to pay closer attention to the sources driving their specific ChatGPT experience.

Common Questions Answered

How do citation patterns differ between ChatGPT's web interface and API?

The ChatGPT API shows a markedly different citation approach compared to its web interface, with nearly 15% of API citations coming from Wikipedia. The API also tends to reference more obscure sources, particularly local German outlets like Deutsche-handwerks-zeitung.de, while the web interface has greater overlap with mainstream media sources.

What percentage of sources overlap between the ChatGPT API and top German media?

According to the research, the ChatGPT API has only a 27.3% overlap with top German media sources, which is significantly lower than the web interface's 45.5% overlap. This suggests the API uses a more diverse and potentially less mainstream set of information sources.

Why are the citation differences between the ChatGPT API and web interface significant?

The divergent citation patterns reveal that AI information retrieval is not uniform across different platforms, with the API showing a more encyclopedic and locally-focused sourcing strategy. These differences highlight the complexity of AI information gathering and the potential variability in how AI systems access and prioritize information sources.