Two Cloudflare outages in 17 days; Raising serious alarms

Cloudflare outages 2025
Image courtesy of EM’s FP | Altered and designed by Team GBN
By Esahaque Eswaramangalam, ‎
  • Follow author on

This article argues that after two closely timed Cloudflare outages, it is no longer sensible to postpone the more complicated conversation about the crisis they expose, the questions they raise, and the direction of the solutions now coming into view.

On March 21, November 18, and December 5, 2025, Cloudflare outages shook large parts of the internet, and the fact that two of those failures happened just 17 days apart has only sharpened concerns. Most people barely notice Cloudflare until something breaks, but the company sits at the backbone of the modern web. When it stumbles, the impact is immediate and global.

Those back-to-back outages exposed real strain inside an infrastructure provider that the online world quietly depends on. The incidents have revived a long-avoided question in the industry: how much risk is created when a small group of companies carries such a large share of global traffic? And what happens when even one of them falters?

A Breakdown of the November 18 Outage

Setting aside the outage that occurred eight months earlier in March, let’s look back at the two major disruptions that happened in quick succession.

The November 18 outage was the larger and more damaging of the two. It affected roughly 20 to 30 percent of global internet services, making it one of the most disruptive incidents in recent memory. Major platforms: X, ChatGPT, Spotify, Canva, e-commerce operations, SaaS tools, and even some government portals, slowed down drastically or went completely dark.

Cloudflare later confirmed a technical misstep inside its Bot Management system. A permissions error created duplicate entries in a “feature file,” which doubled its size. Once this oversized file was pushed across Cloudflare’s global network, proxy servers began failing almost simultaneously. The issue spread quickly and unpredictably.

The outage began at 11:20 UTC and lasted nearly six hours. DNS delays and edge restarts meant the after-effects lingered in some regions long after the fix was deployed.

One of the more striking parts of the outage was the failure of the reporting tools themselves. DownDetector struggled, and users trying to report disruptions discovered that X wasn’t loading. To many, it felt as though the internet had stopped responding.

The timing added further curious attention. Just days earlier, Cloudflare CEO Matthew Prince had sharply criticised Google for allegedly using publishers’ content to train AI models. Although there is no evidence linking the outage to those remarks, the proximity heightened public interest.

Cloudflare outages 2025
Image courtesy of EM’s FP | Altered and designed by Team GBN

December 5: Shorter Duration

The second outage, today on December 5, was much shorter but still concerning. Arriving only 17 days after the November incident, it revived questions about Cloudflare’s reliability and the fragility of the systems built around it.

This time, the disruption stemmed from issues with Cloudflare’s Dashboard and related APIs. Around 09:00 UTC, users across regions began reporting login failures and slow or broken application loads. In India, trading platforms were among the first hit. Zerodha, Groww, Angel One and others faced login and data feed issues during active trading hours.

Globally, services such as Canva, Zoom, LinkedIn, and several SaaS platforms reported intermittent problems. Cloudflare later explained that the outage was unrelated to cyberattacks and was linked to logging adjustments made in response to a third-party vulnerability.

Although functionality was restored in roughly half an hour, the episode underlined a question that now sits at the centre of both incidents: How much risk is acceptable when a single company carries this much of the internet’s operational load?

Why These Outages Matter More Than Usual

The two incidents highlight deeper structural issues that the internet community has not yet addressed, which include:

1) Too Much Dependence on Too Few Companies

The internet looks distributed, but much of its traffic flows through a small group of infrastructure providers. Cloudflare alone handles CDN, DNS, DDoS protection, Zero Trust access, and edge compute for millions of sites.
When one part fails, entire ecosystems feel the impact.

2) Failures Spill Across Unrelated Sectors

The November outage didn’t just break social media; it hit banking portals, trading platforms, aviation sites, online learning systems, and government services. A single configuration error briefly reshaped the digital experience of millions.

3) Failover Did Not Protect Major Platforms

One of the most worrying takeaways is how many large platforms had no effective backup route. Multi CDN and DNS failover systems, which should have activated automatically, simply didn’t.

This raises blunt questions:

  • Why were major AI platforms unable to reroute?
  • How did streaming and communication services fall over so quickly?
  • Have enterprises become too comfortable leaning entirely on Cloudflare?

The outages exposed architectural weaknesses far beyond Cloudflare itself.

4) Real Financial Costs

Outages freeze payments, disrupt trading, break customer support channels, and halt business operations. Even 20 or 30 minutes of downtime can translate into millions of dollars in lost activity.

5) A Governance Gap

Cloudflare is a private company, yet its failures ripple into government systems, regulated industries, and essential services.

These forces uncomfortable questions:

  • Who holds infrastructure providers accountable?
  • Should CDN and DNS providers be treated as critical infrastructure?

These are not theoretical discussions anymore.

Now Faces Hard Questions

Cloudflare outages 2025
Image courtesy of EM’s FP | Altered and designed by Team GBN

Having two outages in less than three weeks forces policymakers, regulators, and infrastructure leaders to revisit assumptions about resilience. Key questions include:

  • Should governments mandate multi-CDN setups for essential services?
  • Are companies too dependent on Cloudflare for DNS, routing, and Zero Trust?
  • Do we need clearer outage-reporting standards for infrastructure providers?
  • Can one private company realistically support core systems for entire economies?
  • Should internet infrastructure be regulated closer to sectors like aviation or energy?

The answers will shape how future digital systems are designed.

Alternatives and the Push Toward Redundancy

Cloudflare remains one of the most advanced companies in cybersecurity and content delivery, but these outages have pushed enterprises to accelerate diversification.

Alternatives include:

  • Akamai
  • Fastly
  • AWS CloudFront
  • Google Cloud CDN
  • Microsoft Azure Front Door
  • Imperva
  • F5

Many now use models that combine:

  • multi-CDN routing
  • active DNS failover
  • hybrid delivery and security layers

Not to replace Cloudflare; but to avoid single points of failure.

A Broader Warning for the Digital Future

Cloudflare outages 2025
Image courtesy of EM’s FP | Altered and designed by Team GBN

Cloudflare’s November 18 outage, which took down major platforms including X and ChatGPT, came less than three weeks before the December disruption. Together, they expose a structural weakness: the speed and convenience of the modern internet rely heavily on a small number of intermediaries.

As more banking, commerce, logistics, public services, and AI systems move into real-time online environments, the stakes of any failure grow. What used to be a technical concern is now part of economic continuity and national-level resilience planning.

Despite this, internet infrastructure companies remain lightly regulated compared to sectors like energy or finance, even though a single misconfiguration can disrupt global traffic.

The back-to-back outages sharpen the conversation around what standards should apply to firms carrying this kind of responsibility. Chaos testing, stronger failure-mode design, circuit-breaker mechanisms to prevent cascading outages, and open post-incident analysis are becoming baseline expectations rather than optional improvements.

Whether governments, enterprises, and infrastructure providers act on these signals will determine how stable the next decade of digital life will be.

Informative | AI and Unauthentic Content: The Rise of the ‘Authentic Flag’

YOU MAY LIKE