WebClassifAI
Try API for free
Published on

IAB Content Taxonomy 2.0 → 2.1: SCD Extension — Migration Notes

Authors
  • avatar
    Name
    Tudor Dan Gabriel
    Twitter

If you work in ad-tech, publisher operations, or any part of the digital supply chain, you already know the IAB Content Taxonomy is the canonical way to describe page content. Version 2.1 was released in October 2020, almost three years after v2, but this version doesn't overhaul the category tree the way 2.0 did — instead, it introduces a subtle but important privacy-focused signal: the SCD (Special Category Data) extension.

This article explains exactly what changed from 2.0 → 2.1, why the SCD extension matters, the categories affected, and practical migration and compliance guidance for publishers, DSPs, and data processors.

TL;DR — the essential change

  • No categories were added, removed, or had tier levels changed between 2.0 and 2.1.
  • The primary change is a new column/field called Extension in the taxonomy which, for many nodes, now contains the flag SCD.
  • SCD = Special Category Data — a privacy-oriented label signaling that a taxonomy node could reasonably be used to derive sensitive inferences about users if associated with identifiers or stored longitudinally. (Interactive Advertising Bureau, 2021)

What is the SCD extension and why it exists

The SCD extension is not saying the content itself is 'sensitive' in the legal sense; the taxonomy still just describes page content. The risk comes when content category signals are linked to individual IDs over time and used to build behavioral profiles that reveal attributes like political beliefs, religion, health conditions, or other personal characteristics. Put another way:

  • The taxonomy describes content.
  • When the same taxonomy node is repeatedly tied to an identifier, it becomes a signal about that person.
  • That aggregated signal can surface sensitive attributes and be used for discriminatory or high‑risk purposes.

To reduce this risk, 2.1 introduces the SCD flag on nodes that—by virtue of topic—have higher likelihood to be used to infer sensitive characteristics. The flag is a straightforward, machine-actionable signal the entire ad supply chain can use to adjust handling, storage, retention, and access policies.


Full list of nodes updated with the SCD extension

The following taxonomy nodes in 2.1 were updated to include the SCD extension. No other tier or category changes were made — these nodes simply carry the new privacy flag.

Unique IDNameExtension
136Special EducationSCD
188DatingSCD
189DivorceSCD
199Special Needs KidsSCD
200Single LifeSCD
211Alcoholic BeveragesSCD
214World CuisinesSCD
224Children's HealthSCD
231Weight LossSCD
237Smoking CessationSCD
287Diseases and ConditionsSCD
288AllergiesSCD
289Ear, Nose and Throat ConditionsSCD
290Endocrine and Metabolic DiseasesSCD
291Hormonal DisordersSCD
292MenopauseSCD
293Thyroid DisordersSCD
294Eye and Vision ConditionsSCD
295Foot HealthSCD
296Heart and Cardiovascular DiseasesSCD
297Infectious DiseasesSCD
298InjuriesSCD
299First AidSCD
300Lung and Respiratory HealthSCD
301Mental HealthSCD
302Reproductive HealthSCD
303Birth ControlSCD
304InfertilitySCD
305PregnancySCD
306Blood DisordersSCD
307Sexual HealthSCD
308Sexual ConditionsSCD
309Skin and DermatologySCD
310Sleep DisordersSCD
311Substance AbuseSCD
312Bone and Joint ConditionsSCD
313Brain and Nervous System DisordersSCD
314CancerSCD
315Cold and FluSCD
316Dental HealthSCD
317DiabetesSCD
318Digestive DisordersSCD
319Medical TestsSCD
320Pharmaceutical DrugsSCD
321SurgerySCD
322VaccinesSCD
323Cosmetic Medical ServicesSCD
387ElectionsSCD
388Political IssuesSCD
410Personal InvestingSCD
453Religion & SpiritualitySCD
454AgnosticismSCD
455SpiritualitySCD
456AstrologySCD
457AtheismSCD
458BuddhismSCD
459ChristianitySCD
460HinduismSCD
461IslamSCD
462JudaismSCD
463SikhismSCD
465Biological SciencesSCD
468GeneticsSCD
549BodybuildingSCD

Practical impact by stakeholder

Publishers / Content Platforms

  • Labeling: Continue to surface IAB taxonomy nodes as you do today, but when a page maps to a node with SCD, treat that classification as a higher‑risk signal. Surface only what you need to downstream partners.
  • Storage: Consider reducing retention or using ephemeral identifiers (e.g., session-level rather than persistent IDs) when storing SCD-tagged content events.

DSPs / Advertisers

  • Targeting: Avoid building audiences directly from SCD nodes or combining SCD signals with persistent IDs without legal basis and robust controls.
  • Bidding: Use SCD as a contextual safety indicator; reduce or block bids for certain sensitive categories if your policy or regulation requires it.

Data Processors / CDPs / Clean Rooms

  • Aggregation & Pseudonymization: Aggregate SCD-tagged signals and avoid deanonymization vectors.
  • Access Controls: Enforce strict RBAC and auditing on datasets containing SCD flags.

Compliance & Legal teams

  • Review whether any SCD categories trigger additional legal requirements in your jurisdiction (e.g., health data, political opinions). The taxonomy is a signal — local laws determine obligations.

  1. Update ingestion to accept the new Extension field from the taxonomy JSON. Treat SCD as a first‑class attribute.

  2. Flag at source. When classifying content, persist the taxonomy node plus its Extension metadata (do not infer SCD later — use the source-provided flag).

  3. Minimize linkage. Avoid tying SCD-labeled events to long-lived user identifiers unless absolutely necessary and legally permitted.

  4. Limit retention. Apply shorter retention windows and stronger deletion rules for records containing SCD nodes.

  5. Access & logging. Ensure only authorized systems/teams can query SCD-tagged data and that access is audited.

  6. Segmentation controls. Where possible, restrict the creation of targeted segments using SCD nodes. Prefer contextual (page-level) targeting over user-level segments built from SCD events.

  7. Document policy. Publish an internal playbook describing how SCD nodes are handled, and include it in vendor and partner onboarding.


Key takeaways

  • 2.1 is a privacy-first refinement. There are no structural category changes from 2.0, but many nodes now include an SCD extension that signals elevated privacy risk.
  • Action is required. Technical systems, retention policies, and segmentation logic should be updated so SCD-tagged nodes are handled with additional safeguards.
  • Not all SCD means 'block'. The flag is an operational signal — it doesn't automatically prohibit contextual use. It does, however, require careful thought about whether to link, store, or use the signal for user-level targeting.

Final thought

IAB Content Taxonomy 2.1 keeps the familiar classification tree from 2.0 while giving the ecosystem a simple, machine-readable way to surface privacy-sensitive topics. Treat the SCD extension as a low-friction tool for reducing privacy risk across the supply chain: update your pipelines, tighten retention and access, and favor context over user-level profiling when SCD is present.

References:

Interactive Advertising Bureau. (2021). Content Taxonomy 3.0 Implementation Guide. https://iabtechlab.com/wp-content/uploads/2021/09/Implementation-Guide-Content-Taxonomy-3-0-pc-Sept2021.pdf