Deepfake Detector Robustness Testing

danb21/social-media-robustness-sdxl-instantid · Datasets at Hugging Face

An evaluation benchmark released under CC BY-NC 4.0 for research evaluation only, not for training detection models or commercial use. Optional: tell me what you work on, and opt in below if you want a heads-up when datasets like this drop. I plan the next dataset around what people actually need.\n","classNames":"hf-sanitized hf-sanitized-Ab7YwAptoZL5_q4eSzBqU"},"customHeading":"Request access to the Social Media Robustness Benchmark","gated":"auto","isLoggedIn":false,"repoId":"danb21/social-media-robustness-sdxl-instantid","repoType":"dataset","requiresPaidPlan":false}"> Request access to the Social Media Robustness Benchmark This repository is publicly accessible, but you have to accept the conditions to access its files and content. An evaluation benchmark released under CC BY-NC 4.0 for research evaluation only, not for training detection models or commercial use. Optional: tell me what you work on, and opt in below if you want a heads-up when datasets like this drop. I plan the next dataset around what people actually need.

Social Media Robustness Benchmark: SDXL+InstantID Synthetic Face Detection

Version: v1.0.0 · License: CC BY-NC 4.0 (research evaluation only)

Detector accuracy on clean lab test sets does not predict in-the-wild performance. Social platforms re-encode every uploaded image: platform-specific JPEG, resize, chroma subsampling, metadata stripped. This benchmark lets detector authors and procurers measure robustness under documented, paired, demographically-balanced conditions instead of blunt lab proxies.

A companion blog post and white paper cover the methodology, statistics, and findings in full. This card describes what the dataset is, how it is built at a high level, and how to use it.

1. Details

Field Value

Name Social Media Robustness Benchmark: SDXL+InstantID Synthetic Face Detection

Version v1.0.0

Base corpus 2,400 images (1,200 real, 1,200 generated), sampled from danb21/synthetic-face-sdxl-instantid-bench at a deterministic per-cell quota

Perturbations 12 single-axis lab variants (JPEG, resize, noise, blur) + 4 platform-pipeline approximations (Instagram, Facebook, TikTok, X)

Total rows 40,574 image rows across 17 configurations (2,400 clean + 28,800 lab + 9,374 platform)

Cell axis 6 skin tones × 2 genders × 2 labels, 100 per cell at the clean baseline

Pairing Every configuration evaluates the same images (paired by media_id)

Blog post Read the write-up (companion post; published before the paper)

Paper In preparation; methodology and results reported there

Maintainer Daniel Babalola, danielbabalola@alumni.upenn.edu

2. What This Dataset Is For

Use it to:

Compute paired AUC deltas, AUC(clean) − AUC(perturbation), per detector per condition.

Measure per-cell robustness (skin tone × gender) under each perturbation.

Compare detector architectures under matched conditions.

It is not training data . It is small by design (2,400 base images), paired by construction (every condition evaluates the same images), and the platform pipelines are calibrated approximations of each platform's mean re-encode behaviour, not pixel-faithful platform reproductions (see Limitations).

3. Structure

Configurations

Config Layer Description

clean clean 2,400 unperturbed base images; the reference for every paired delta

layer1_jpeg_q{30,50,70,80,95} lab JPEG re-encode at the named quality factor

layer1_resize_{0.5,0.75} lab Bicubic downsample then upsample back

layer1_noise_{5,10} lab Additive Gaussian noise (variance 5 / 10)

layer1_blur_{1,2,4} lab Gaussian blur (sigma 1 / 2 / 4)

layer2_ig_pipeline platform Instagram (JPEG ~92, max edge 1440, 4:2:0, EXIF stripped)

layer2_fb_pipeline platform Facebook (JPEG ~93, max edge 1920, 4:2:0, EXIF stripped)

layer2_tt_pipeline platform TikTok (JPEG ~80, max edge 1920, 4:2:0)

layer2_x_pipeline platform X (JPEG ~93, max edge 1920, 4:2:0, EXIF stripped)

All configurations share the same column schema. Key columns: image, label (real/generated), cell_skin_tone, cell_gender, media_id (stable across configurations for pairing), perturbation_slug, perturbation_layer, and the measured-encoding fields.

Balance

The clean baseline and all 12 lab configurations are uniform at 100 images per cell per label (6 skin tones × 2 genders × real/generated). The 4 platform configurations re-crop the laundered output back to a 256×256 face crop for comparability; all real-side cells retain 100/100, while a small fraction of synthetic-face crops do not survive re-detection (~2,342–2,344 rows per platform config). That concentration is itself a finding, analyzed in the white paper.

4. How It Is Built (high level)

Source. A deterministic per-cell subset of the v1 Synthetic Face Detection Benchmark: 1,200 real Pexels frames and 1,200 SDXL+InstantID outputs, uniform at 100 per cell per...

Deepfake Detector Robustness Testing

Related Articles

The Newest Instagram "Exploit" Is the Goofiest I've Seen

It's Not Just X. It's Y

Amazon, Facebook, FBI have access to a private intelligence-sharing network

Show HN: GoPeek – open links in live mini browser windows without new tabs

Agent Memory: An Anatomy