The Framework Detected Structural Severity Years Before Each Failure
We took federal data from SEC filings, FDIC call reports, NTSB investigations, Congressional testimony, and bankruptcy court records. We mapped it through the Four Frequencies framework. Then we asked one question: did the structural severity scores escalate before the crisis, or only after?
In all six cases, the answer was the same. The framework read escalating structural conditions years before the failure event, using only publicly available data. The lead time ranged from 4.8 years (SVB) to 15 years (drug shortages). In every case, standard metrics and supervisory assessments showed no comparable signal during the same period.
Six Cases. Six Sectors. Six Distinct Failure Architectures.
Silicon Valley Bank
Connected Crisis: All four frequencies above 0.90 at failure
SVB put 82% of its securities into bonds it could not sell, removed the hedges that protected them, and held 94% uninsured deposits that could leave instantly. The CRO seat was empty for eight months. The framework read all four conditions escalating simultaneously, reaching Critical while the bank's supervisory rating was still Satisfactory.
U.S. Generic Drug Shortage
Chronic Erosion: 25-year structural degradation with no single trigger event
What hospitals pay for drugs kept going up. What manufacturers get paid to make them barely moved. That pricing squeeze has been continuous since 2001, grinding down generic manufacturing margins until producers started exiting the market. The framework picked up the signal three years before shortage counts spiked above 200.
Boeing 737 MAX
Cascading Crisis: Permission leads, then each frequency amplifies the next
Boeing was allowed to certify its own safety, and the scope of what it certified kept growing. MCAS went from a minor system to a flight control that could push the nose down based on a single sensor with no pilot override. The framework tracked the cascade: weakened oversight led to safety gaps, which got locked into concentrated design risk, while internal metrics showed everything on track.
East Palestine Derailment
Chronic + Acute Revelation: Peak severity preceded the derailment
Norfolk Southern cut 33% of its train crews in three years. The operating ratio improved. Wall Street celebrated. But every input that drives operating ratio improvement also degrades safety capacity. The framework's composite peaked in 2022 at 0.70. The derailment happened in 2023. The structural condition was at its worst before anyone outside the industry was paying attention.
CrowdStrike Global Outage
Instantaneous Cascade: 78 minutes from update to 8.5 million blue screens
CrowdStrike got more fragile by getting more successful. Every new Fortune 500 customer (80 to 298 over five years) expanded the blast radius of a single deployment failure. The testing gaps were there the whole time, but they were invisible until they mattered. The composite severity rose every single year, driven entirely by market penetration, not degradation.
WeWork
Narrative Implosion: The S-1 filing collapsed a $47B valuation narrative
In 2016, WeWork lost $430 million on $436 million in revenue and carried a $16.9 billion valuation. The framework scored that gap at 0.99 from year one. After the founder left, governance reformed, and the Permission signal dropped. But the company still went bankrupt because $47.2 billion in 15-year leases cannot be fixed by board reform alone.
How the Backtest Works
Each case follows the same process. We identify the federal data sources that map to each of the four frequencies. Revenue and asset concentration map to Thinness. Staffing patterns and capability metrics map to Absence. Governance structures and regulatory enforcement map to Permission. The gap between reported metrics and actual conditions maps to Management.
We normalize every metric to a 0-to-1 severity scale with fixed bands that do not change between cases: Elevated (0.25), Moderate (0.40), High (0.55), Severe (0.70), Critical (0.80). The composite severity is a weighted average where the keystone frequency (the one that drives the failure architecture) receives the highest weight. Each case documents why a specific frequency is the keystone and states explicit conditions that would disprove the classification.
The data provenance is classified into two tiers. FEDERAL-VERIFIED means the number comes from a filing or database that an organization submitted under legal obligation to a federal agency. DOCUMENTED-VERIFIED means the number comes from official company communications or industry analysis. Between 80% and 97% of the data across all six cases is FEDERAL-VERIFIED.
Sensitivity Analysis
The most common methodological challenge to any scoring framework is that the analyst chose the normalization ranges and weights to produce the desired result. We tested this directly.
We shifted all normalization ranges by plus and minus 20% and ran 25 parameter combinations per case (5 normalization shifts times 5 weight perturbations). The temporal lead finding holds at 100% across all combinations. The trajectory shape holds at 100%. The severity band at the crisis point holds at 100%. The findings are not artifacts of where we set the boundaries.
These backtests validate the framework against federal data. The same structural vocabulary applies to organizations that are still operating.
The backtests are retrospective. The framework also works prospectively.