AI Disclosure & Provenance

Master AI disclosure and provenance end to end. 60 deep dives across 360 lessons covering foundations (transparency vs disclosure vs provenance, audiences, history, failure modes), content provenance standards (C2PA / Content Credentials, IPTC AI tags, JPEG Trust, PAI synthetic media framework, production implementation), watermarking (text, image, audio, video, robustness, evaluation), AI-content detection (text, image, audio, limits, program operations), model & system disclosure (model cards, system cards, datasheets, RAI cards, frontier-model norms, transparency reports, comparability), training-data provenance (lineage, dataset cards, opt-out registries, TDM ethics, copyright, contamination checking), AI disclosure law & regulation (EU AI Act Article 50 / 53, US state stack, election & political ads, platform labelling, FTC guidance, copyright mandates, cross-border), UX (labels, uncertainty communication, chatbot identity, accessibility), operations & engineering (provenance pipeline, versioning, audit trail, vendor requirements, incident response), and industry / standards / future (W3C / ISO / IEEE / NIST / ITU, C2PA / CAI, newsroom policies, deepfake counter-tech, research frontier).

60Topics
360Lessons
10Categories
100%Free

AI disclosure and provenance is the discipline of making it possible for users, regulators, and downstream systems to know, with appropriate confidence, what an AI system is, what it was trained on, what it just produced, and what they should rely on. It sits at the intersection of content provenance standards (C2PA / Content Credentials, IPTC, JPEG Trust), watermarking research (text, image, audio, video), AI-content detection (and its honest limits), model and system disclosure (model cards, system cards, datasheets, RAI cards, transparency reports), training-data provenance (opt-out registries, TDM exceptions, copyright), and the regulatory machinery that turns the whole stack into a legal obligation (EU AI Act Article 50 / Article 53, US state laws, FTC guidance, platform AI-content labelling rules).

This track is written for the practitioners doing this work day to day: provenance engineers shipping C2PA pipelines, ML engineers integrating watermarks, T&S leads building AI-content labelling, RAI leads authoring system cards, legal / policy partners interpreting disclosure law, newsroom standards leads, and program leaders standing up cross-functional disclosure programs. Every topic explains the underlying discipline (drawing on the C2PA spec, IPTC and JPEG Trust standards, NIST AI RMF, EU AI Act, US state laws, frontier-lab system cards, the canonical research literature on watermarking and detection), the practical methodology that operationalises it, and the failure modes where disclosure work quietly fails to inform anyone. The aim is that a reader can stand up a credible AI disclosure and provenance function, integrate it with engineering and governance, and defend it to regulators, journalists, oversight boards, and the users the system actually informs.

All Topics

60 AI disclosure & provenance topics organized into 10 categories. Each has 6 detailed lessons with frameworks, methodologies, and operational patterns.

Disclosure & Provenance Foundations

Content Provenance Standards

Watermarking

💢

Watermarking Overview

Map the watermarking landscape. Learn the families (text, image, audio, video), invisible vs visible, statistical vs cryptographic, and the watermark-vs-provenance complementarity.

6 Lessons
📝

LLM Text Watermarking

Reason about LLM text watermarking. Learn green-list / red-list approaches (Kirchenbauer et al.), SynthID-Text, the robustness-vs-quality trade-off, evaluation, and deployment realities.

6 Lessons
📷

Image Watermarking

Reason about image watermarking. Learn DCT / DWT-domain methods, neural watermarks (SynthID-Image, Stable Signature), survivability under crop / compression / re-photo, and the disclosure UX.

6 Lessons
🔊

Audio Watermarking

Reason about audio watermarking. Learn echo-hiding / spread-spectrum methods, AudioSeal-style neural watermarks, robustness to compression and re-recording, and voice-clone disclosure.

6 Lessons
🎬

Video Watermarking

Reason about video watermarking. Learn frame-level, temporal, and per-clip approaches, the social-platform re-encoding gauntlet, deepfake disclosure, and live-stream constraints.

6 Lessons
🔐

Watermark Robustness & Attacks

Reason about watermark robustness. Learn the canonical attack categories (paraphrase, regenerate, transform, ensemble), formal robustness research, and the layered-defence pattern.

6 Lessons
📊

Watermark Evaluation

Evaluate watermarks credibly. Learn detection metrics (TPR at low FPR), robustness eval, the WAVES / VeriBench-style suites, slice eval, and the link to standards work at NIST and ISO.

6 Lessons

AI-Generated Content Detection

Model & System Disclosure

📚

Model Cards (Mitchell et al.)

Author model cards properly. Learn the canonical Mitchell et al. structure, the Hugging Face / Google Cloud variants, intended-use sections, eval reporting, and the maintenance discipline.

6 Lessons
📚

System Cards

Author system cards (frontier-lab style). Learn the canonical sections, capability vs deployment context split, safety-eval reporting, residual-risk disclosure, and the cross-version comparability ritual.

6 Lessons
📝

Datasheets for Datasets

Author datasheets for datasets (Gebru et al.). Learn the canonical structure, motivation / composition / collection / preprocessing sections, intended use, and the maintenance discipline.

6 Lessons
📋

Responsible AI / Use Cards

Author responsible-AI cards alongside model and system cards. Learn the format, the intended-audience split (developer vs admin vs user), and the integration with deployment guides.

6 Lessons
🌟

Frontier Model Disclosure Norms

Read frontier-model disclosure norms. Learn the canonical artefacts (system card, RSP / scaling policy, evaluation reports, AISI access), the comparability question, and the regulator linkage.

6 Lessons
📊

AI Transparency Reports

Publish AI transparency reports. Learn the canonical metric set, methodology disclosure, comparability discipline, the DSA-aligned report shape, and the audience-specific publication pattern.

6 Lessons
📝

Disclosure Comparability Across Vendors

Compare AI disclosures across vendors. Learn the comparability problem, the eval-set disclosure question, methodology fingerprinting, the Stanford CRFM Foundation Model Transparency Index, and procurement use cases.

6 Lessons

Training Data Provenance

AI Disclosure Law & Regulation

🇪

EU AI Act Article 50 Disclosure

Engineer for EU AI Act Article 50. Learn the chatbot-disclosure duty, deepfake-labelling duty, GPAI disclosure under Article 53, the machine-readable requirement, and enforcement timelines.

6 Lessons
🇺

US State AI Disclosure Laws

Navigate the US state AI disclosure patchwork. Learn California (AB 2013, AB 2655, AB 2839), Illinois, Texas, Colorado, NYC AEDT, and the federal-action stance from the FTC and EOs.

6 Lessons
📢

Election & Political Ad Disclosure

Engineer election and political-ad AI disclosure. Learn the Tech Accord on AI & Elections commitments, FEC / FCC stances, state laws, platform policies, and the labelling pattern.

6 Lessons
📢

Platform AI-Content Labeling Rules

Read platform AI-content labelling rules. Learn the Meta / TikTok / Google-YouTube / X policies, the labelling-vs-removal distinction, edge cases (parody, art, journalism), and the enforcement gaps.

6 Lessons
🏪

FTC AI Disclosure Guidance

Work with FTC AI disclosure guidance. Learn Section 5 unfair / deceptive framing, the AI Comply enforcement actions, endorsement / testimonial rules, and the consumer-disclosure expectations.

6 Lessons
📚

Copyright & Training-Data Disclosure Mandates

Comply with copyright and training-data disclosure mandates. Learn EU AI Act Art. 53 training-data summaries, US Copyright Office guidance, JP/UK developments, and the disclosure-pipeline implication.

6 Lessons
🌍

Cross-Border Disclosure Compliance

Run cross-border disclosure compliance. Learn the obligation map (EU, US-state, UK, China, Japan, Korea, Brazil, India), the highest-common-denominator strategy, and per-region overrides.

6 Lessons

UX & User-Facing Disclosure

Operations & Engineering

Industry, Standards & Future