← back to bubbles

About

Extraction is a bubble visualization of every notable crypto exploit, hack, and major collapse since 2014. Each bubble is one incident: size = USD lost, color = attack vector, position = year (or month when you drill into a year via the scrubber).

As of today: 1,189 incidents, 20142026, 34 Lazarus / DPRK (23 confirmed · 10 forensics · 1 rumored), 68 bridge hacks, 110 repeat exploits, 11 collapses.

Data sources

Spine + 6 enrichment layers. Each record is tagged with all sources that contributed to it.

Vector taxonomy

We classify incidents by attack mechanism, not target type. Ronin Bridge and Harmony are tagged key-leak (validator key compromise) — not bridge— because that's what actually happened. The bridge label captures contract-level cross-chain exploits like Wormhole and Nomad.

Methodology

  1. Fetch + normalize. npm run sync-data pulls DefiLlama → merges SlowMist + manual lists via fuzzy dedup (Levenshtein name + ±1 month + ±15% amount tolerance, with substring match preferred and generic-word filtering to prevent false positives like “X Bridge” ↔ “Y Bridge”).
  2. Vector inference.Priority chain: DefiLlama bridgeHack flag → Rugpull classification → technique-regex match → classification fallback. Donation Attack on lending markets correctly maps to oracle (it's share-to-asset ratio manipulation); Cloudflare Key Compromised maps to key-leak; Frontend / Domain Hijack maps to phishing.
  3. Repeat-exploit detection. Records grouped by parentProtocolId (or normalized name fallback) and sorted by date — first occurrence is isRepeat: false, subsequent get the prior incident IDs listed in the modal. Catches Compound forks (Sonne, Hundred), Curve pools, Balancer pools, Venus Core Pool repeats, etc.
  4. Data-quality sanity.Implausible-amount auto-drop (anything >$10B is almost certainly a token-count vs USD bug from SlowMist). Verified specific corrections via Tavily-driven cross-check: TokenStore Jun 2019 was $1B in SlowMist (token count) → $160M per Quadriga Initiative report; OneCoin $440M $4.5B per SEC + EU prosecutors; Finiko, Solar Techno Alliance, ArbiStar, Wirecard etc — see data/discovery-corrections.json.
  5. Schema invariants asserted by scripts/verify-data.ts: required fields, unique ids, ≥25 Lazarus matches, ≥85% classification coverage in DefiLlama spine, ≥5 repeat-flagged. Top-15 by amount printed for human review.

Known limitations

UI features

Credits

Built with Next.js + d3-force + Tailwind CSS. Data and analysis sourced from DefiLlama, SlowMist, Chainalysis, FBI / US Treasury / DOJ / SEC, TRM Labs, Elliptic, Halborn, Certik, Cyfrin, eth-labels.com, Tavily, and the UN Panel of Experts. Non-commercial — exists to make crypto-theft history easier to scan at a glance, not to attribute liability or replace forensic investigation.

Built by @nikolayxyz