AWS October 2025 Global Outage: 17 Million Reports, Banking Disruption, and DORA's Concentration Thesis Proven
InfrastructureCritical Third-Party ICT Provider (CTPP)October 20, 2025 (15-24 hours)

AWS October 2025 Global Outage: 17 Million Reports, Banking Disruption, and DORA's Concentration Thesis Proven

A malfunctioning internal subsystem in AWS northern Virginia triggered one of the largest internet outages on record, suspending trading on Coinbase and locking customers out of Lloyds and Bank of Scotland.

Published

Key Metrics

User Reports

17 million (970% spike)

was: Normal baseline

Largest single-provider outage reports in 2025

Geographic Scope

60+ countries

was: N/A

Every inhabited continent affected

Outage Duration

15-24 hours

was: N/A

Critical services 15h, residual effects 24h

Trading Impact

Coinbase: full trading suspension

was: Normal operations

Robinhood: trade execution failures

Banking Impact

Lloyds/BoS customers locked out

was: Normal access

Capital One: transaction delays

The Situation

The Concentration Thesis Validated

The AWS October 2025 outage arrived at a moment of acute regulatory significance. DORA had been applicable since January 17, 2025. The European Supervisory Authorities were in the final stages of designating Critical Third-Party ICT Providers under Art. 31 — a designation that AWS would receive less than one month later, on November 18, 2025. The outage was, in effect, a live demonstration of the systemic risk that the CTPP designation framework was designed to address.

The concentration data was damning. AWS's market position in cloud infrastructure services meant that a single failure in a single region could simultaneously affect financial institutions across dozens of countries. The 60-country impact radius demonstrated that cloud concentration risk is not a theoretical concern — it is a measurable, observable phenomenon with real-world consequences for financial stability.

Several structural factors amplified the impact:

US-East-1 as systemic infrastructure. AWS's northern Virginia region hosts a disproportionate share of cloud workloads, including many services that were architected to use US-East-1 as a primary or sole region. Some AWS services themselves had control plane dependencies on US-East-1, meaning that even workloads running in other regions could be affected by a US-East-1 outage. This architectural pattern — where a single geographic zone serves as a de facto global control plane — creates systemic concentration risk that individual financial institutions cannot mitigate through their own architectural choices alone.

Cross-sector correlation. The outage did not affect financial services in isolation. E-commerce platforms, media streaming services, communications tools, logistics systems, and government services all experienced disruptions simultaneously. For financial institutions, this meant that business continuity assumptions about the availability of supporting infrastructure — communications, supply chain, customer contact channels — were simultaneously invalidated.

Recovery dependency loops. Some institutions reported that their incident response procedures themselves depended on AWS-hosted tools — monitoring dashboards, communication platforms, documentation systems. When the provider hosting the incident response tooling is the same provider that caused the incident, the response capability is compromised at the moment it is most needed.

Regulatory reporting complexity. For EU-regulated financial entities, the outage triggered DORA's incident reporting requirements. Every affected institution needed to classify the event (Art. 17), determine whether it qualified as a major incident (Art. 18), and initiate the notification process to its national competent authority (Art. 19). The challenge: dozens of institutions across multiple jurisdictions, all experiencing the same root cause, all reporting independently to different NCAs, with limited visibility into the provider's own timeline and root cause analysis.

The Challenge

The Scale of Disruption

On October 20, 2025, Amazon Web Services experienced what would become one of its most significant outages in recent history. The failure originated in AWS's US-East-1 region — the northern Virginia data center complex that serves as the backbone of a disproportionate share of global internet infrastructure. According to AWS's post-incident summary and corroborated by independent monitoring services, the root cause was traced to a malfunctioning internal subsystem responsible for monitoring network load balancers. When this subsystem entered a failure state, it triggered a cascade that propagated far beyond its intended blast radius.

The scale was unprecedented for a single-provider event in 2025. Downdetector, the independent outage monitoring service, recorded approximately 17 million user reports across affected services — a 970% spike above normal baseline reporting levels. Reports originated from more than 60 countries, spanning every inhabited continent. The outage duration extended to approximately 15 hours for the most severely affected services, with some downstream systems experiencing degraded performance for up to 24 hours.

Financial services institutions were among the most visibly affected sectors. Coinbase, the largest US cryptocurrency exchange by trading volume, publicly confirmed that it had suspended all cryptocurrency trading for the duration of the outage — an action with direct financial consequences for its users given the 24/7 nature of cryptocurrency markets. Robinhood, another major retail trading platform, confirmed that users experienced difficulties executing trades, with some users unable to access their portfolios entirely. Lloyds Banking Group and Bank of Scotland — both among the UK's largest retail banks — reported that customers were locked out of online and mobile banking services.

Capital One, a major US bank with a well-documented cloud-first strategy built predominantly on AWS infrastructure, experienced transaction processing delays that affected its customer-facing services. The Capital One case is particularly instructive because the bank had been widely cited in industry publications as a model for cloud-native banking architecture — demonstrating that cloud-first does not mean cloud-resilient without deliberate multi-region and multi-provider design.

The outage's timing — during active trading hours in both European and US markets — amplified the financial services impact. Trading platforms that could not execute orders during market hours exposed their customers to market risk. Banks unable to process payments disrupted the settlement chains that underpin the financial system. The compounding effect of simultaneous failures across multiple institutions created systemic stress that no individual institution's business continuity plan had been designed to address.

The Approach

DORA's Framework Applied

The AWS October 2025 outage provides the most comprehensive real-world test of DORA's third-party risk and concentration risk provisions since the regulation became applicable. Every major DORA pillar was tested simultaneously.

Pillar IV: Third-Party Risk Management (Art. 28-44)

Art. 28(3) — Register of Information: DORA requires financial entities to maintain a register of all ICT third-party service arrangements. The October outage demonstrated why this register is essential: institutions needed to rapidly assess their exposure to AWS services, determine which critical functions were affected, and communicate the impact scope to regulators. Institutions that had completed their Register of Information submission in April 2025 had a structured data foundation for this assessment. Those that had not were operating blind.

Art. 29 — Concentration Risk Assessment: Art. 29(2) requires financial entities to assess concentration risk arising from ICT third-party arrangements, including at sub-consolidation and group level. The 60-country impact of a single AWS region failure is the canonical example of concentration risk at the systemic level. The assessment question is not whether individual institutions use AWS — many do, legitimately and with good architectural reasons — but whether the aggregate sector-level dependency on a single provider creates systemic fragility.

Art. 28(8) — Exit Strategies: Financial entities must define and implement exit strategies for critical ICT third-party service providers. The October outage tested whether institutions had actionable alternatives to AWS services — not in the abstract, but in the moment. Could an institution, during a 15-hour AWS outage, failover critical workloads to an alternative provider or to on-premise infrastructure? For most institutions, the honest answer was no.

Art. 31 — CTPP Designation: The October outage occurred one month before AWS was formally designated as a CTPP. The timing could not have been more relevant. The designation, announced November 18, 2025, subjected AWS to direct ESA oversight — including the power to conduct inspections, require reporting, and issue recommendations. The October outage provided the empirical evidence that validated the designation decision.

Pillar II: Incident Management and Reporting (Art. 17-23)

Art. 17-18 — Classification: Every affected financial entity needed to classify the AWS outage within its own incident management framework. Under DORA's classification criteria — client impact, duration, geographic spread, economic impact — this event would qualify as a major ICT-related incident for any institution whose critical functions were disrupted. The challenge was that the root cause was external and the impact assessment depended on information from AWS that was not immediately available.

Art. 19 — Notification Timelines: DORA requires initial notification within 4 hours of classification as major, with an intermediate report within 72 hours and a final report within one month. For an event affecting dozens of EU-regulated institutions simultaneously, this created a coordination challenge: multiple institutions reporting the same root cause to different NCAs, with each institution having different visibility into the impact and different recovery timelines.

Pillar I: ICT Risk Management (Art. 5-16)

Art. 11 — Business Continuity: The outage tested whether financial entities' business continuity plans covered the scenario of a prolonged failure of their primary cloud infrastructure provider. Plans that assumed cloud provider uptime as a given — rather than explicitly modeling provider failure — were exposed as incomplete.

Art. 9 — Protection and Prevention: DORA requires financial entities to "minimize the impact of ICT risk" through resilient systems design. Multi-region deployment, multi-provider strategies, and graceful degradation capabilities are the technical implementations of this requirement. The October outage separated institutions that had invested in genuine resilience from those that had relied on the provider's own redundancy guarantees.

The Results

Impact Assessment and Industry Consequences

The financial and operational consequences of the AWS October 2025 outage were substantial, though the full economic impact remains difficult to quantify given the breadth of affected services and the global scale of the disruption.

Quantified Impact

Duration and scale: The primary outage lasted approximately 15 hours, with residual effects persisting for up to 24 hours for some services. The 17 million Downdetector reports from 60+ countries represent the largest documented user-visible impact from a single cloud provider outage in 2025.

Financial services disruption: Coinbase's suspension of all cryptocurrency trading during a 15-hour window exposed its users to significant market risk — cryptocurrency markets operate 24/7, and the inability to trade during a volatile period represented a direct financial harm. Robinhood's trade execution failures similarly affected users during US market hours. For traditional banking customers at Lloyds and Bank of Scotland, the inability to access accounts or process transactions disrupted daily financial activities.

Estimated economic impact: Industry analysts estimated the total economic impact of the outage in the billions of dollars, encompassing lost trading revenue, failed transactions, e-commerce disruption, and business productivity losses. The Financial Executives International organization noted that the event reinforced the case for CIO/CFO-level engagement with cloud concentration risk management.

Regulatory Consequences

The timing of the outage — one month before the formal CTPP designation — made it a pivotal moment for DORA implementation. Several consequences followed:

Validation of CTPP designation: The ESAs' November 18, 2025 designation of AWS as a Critical Third-Party ICT Provider was informed by, among other factors, the empirical evidence of systemic impact from the October outage. The designation subjected AWS to direct oversight under DORA Art. 32-44, including the power of the Lead Overseer to conduct general investigations, on-site inspections, and issue recommendations and compliance orders.

Accelerated concentration risk assessments: EU financial institutions were required to reassess their cloud concentration risk profiles in light of the outage. Institutions that had identified AWS as a critical ICT third-party provider in their Register of Information submissions needed to demonstrate that their concentration risk assessments (Art. 29) had been updated to reflect the demonstrated impact of an AWS regional failure.

Incident reporting precedent: The outage established a practical precedent for how DORA's incident reporting requirements function during a provider-level event. The challenge of dozens of institutions reporting the same root cause to different NCAs highlighted the need for coordinated reporting mechanisms — an area where the ESAs subsequently issued guidance.

Structural Lessons

1. Multi-region is necessary but not sufficient. Institutions that had deployed workloads across multiple AWS regions fared better than those concentrated in US-East-1, but multi-region deployment within a single provider does not eliminate provider-level risk. Some AWS control plane services had cross-region dependencies that affected even diversified deployments.

2. Exit strategies must be operationally tested. Art. 28(8) exit strategies that exist only as documentation are insufficient. The October outage demonstrated that the ability to failover to an alternative provider or to on-premise infrastructure must be periodically validated through live testing — not just tabletop exercises.

3. Incident response tooling must not share the failure domain. Institutions whose monitoring, alerting, and communication tools were hosted on the same cloud provider that experienced the outage found their incident response capabilities degraded precisely when they were most needed. DORA Art. 11 business continuity planning should explicitly address this dependency.

4. The 4-hour reporting window is tight for provider-level events. When the root cause is external and the provider's own post-mortem is not yet available, classifying the severity and scope of a cloud provider outage within 4 hours is operationally challenging. Institutions need pre-defined classification criteria for provider outage scenarios.

5. Concentration risk is systemic, not institutional. No individual institution's risk management framework can fully mitigate the risk of a provider outage affecting the entire financial sector simultaneously. This is precisely why DORA created the CTPP oversight framework — to address risk at the systemic level that cannot be managed at the entity level alone.

Lessons Learned

  1. 1DORA Art. 29 concentration risk assessment was empirically validated: a single AWS region failure in northern Virginia affected financial services across 60+ countries, demonstrating that cloud concentration risk is systemic, not theoretical.
  2. 2DORA Art. 31 CTPP designation framework was proven necessary: the October outage occurred one month before AWS was designated as a CTPP, providing the empirical case for direct ESA oversight of critical cloud providers.
  3. 3DORA Art. 28(8) exit strategies must be operationally validated, not just documented. During a 15-hour outage, institutions without tested failover capabilities had no actionable alternative.
  4. 4DORA Art. 11 business continuity plans must explicitly model prolonged cloud provider failure, not assume provider availability. Multi-region within a single provider is necessary but not sufficient for resilience.
  5. 5DORA Art. 19 incident reporting timelines are operationally challenging during provider-level events where the root cause is external and the provider post-mortem is not immediately available. Pre-defined classification criteria for provider outage scenarios are essential.
  6. 6Incident response tooling must not share the failure domain with the provider being responded to. DORA Art. 11 continuity planning should explicitly address this dependency loop.
cloudAWSconcentration-riskArt-29Art-31CTPPglobaltradingbanking

Disclaimer:This case study is based on anonymized data from real-world DORA compliance programmes. Names, specific figures, and identifying details have been changed to protect confidentiality. The outcomes described are specific to the institution's context and may not be directly replicable.

Facing similar challenges?

See how Valendir can help your institution achieve and maintain DORA compliance with deterministic workflows, immutable evidence, and continuous assurance.