CUBIC's Idle Optimization Causes QUIC Death Spiral at Cloudf

CUBIC's Idle Optimization Causes QUIC Death Spiral at Cloudflare

A Linux kernel idle optimization ported to Cloudflare's QUIC implementation caused CUBIC's congestion window to lock at minimum, creating a self-perpetuating recovery trap. The bug, triggered only when cwnd collapses to two packets, caused 61% of test connections to fail to complete a download within 10 seconds.

3 min readMay 13, 2026

CUBIC's Idle Optimization Causes QUIC Death Spiral at Cloudflare

Cloudflare engineers traced a 61% test failure rate to a subtle bug in their CUBIC congestion controller implementation. The bug, ported from a 2017 Linux kernel fix, causes the congestion window (cwnd) to lock at its 2700-byte minimum when the connection enters a specific recovery oscillation.

The Test That Failed 61% of the Time

The integration test simulated a 10MB HTTP/3 download over localhost with 10ms RTT. For the first two seconds, 30% random packet loss was injected. After two seconds, loss stopped entirely. With Reno, the test passed 100% of the time. With CUBIC, 61 out of 100 runs failed to complete within the 10-second timeout—even though the download should finish in 4-5 seconds.

The Oscillation: 999 State Transitions in 6.7 Seconds

Instrumentation revealed that after the loss phase ended, CUBIC's cwnd stayed pinned at the minimum floor of 2700 bytes (two full-sized packets). The congestion state machine oscillated between recovery and congestion avoidance 999 times over 6.7 seconds—one transition every ~14ms, matching the connection's RTT.

Root Cause: Porting a Kernel Fix to User-Space QUIC

The bug originates from a 2017 Linux kernel change (commit by Eric Dumazet, Yuchung Cheng, Neal Cardwell) that adjusted CUBIC's epoch_start after idle periods. The kernel fix shifted the epoch forward by the idle duration to prevent cwnd inflation. When Cloudflare ported this to quiche in 2020, they used the on_packet_sent() callback to detect idle:

// cubic.rs — on_packet_sent() (simplified)
fn on_packet_sent(&amp;mut self, bytes_in_flight: usize, now: Instant, ...) {
    if bytes_in_flight == 0 {
        let delta = now - self.last_sent_time;
        self.congestion_recovery_start_time += delta;
    }
    self.last_sent_time = now;
}

This logic has a flaw: congestion_recovery_start_time is normally set during ACK processing, not at send time. Adding the idle delta at send time can push the recovery start time into the future, causing CUBIC to misinterpret the connection state.

The Self-Perpetuating Trap

The bug triggers only when three conditions align:

A real loss event sets the recovery boundary
The connection is in congestion avoidance
cwnd has collapsed to the two-packet minimum

At minimum cwnd, every ACK cycle drives bytes_in_flight to zero. The on_packet_sent() check then advances congestion_recovery_start_time by the full RTT each time, creating a feedback loop: the recovery state never ends, so cwnd never grows.

The Fix: A One-Line Change

The Cloudflare team fixed the bug by resetting congestion_recovery_start_time only when the connection has actually been idle long enough to warrant a reset, rather than on every send when bytes_in_flight == 0. The exact fix is described as an elegant near-one-line change that breaks the oscillation cycle.

Why This Matters for QUIC Implementations

This bug highlights the dangers of porting kernel-level TCP optimizations to user-space QUIC stacks. The kernel has CA_EVENT_TX_START callbacks that QUIC lacks, forcing implementers to approximate idle detection with bytes_in_flight == 0 checks. As Cloudflare notes, outside the minimum-cwnd regime the bug is invisible—it only surfaces under heavy loss scenarios that drive cwnd to its floor.

Lessons for Developers

Test congestion controllers at minimum cwnd, not just steady state
Be wary of porting kernel TCP optimizations to user-space QUIC without adapting the idle detection mechanism
Instrument state transitions: the 999 oscillations were invisible without qlog visualization

Cloudflare has merged the fix into quiche. The full analysis is available on their blog.

Editor's Take

I've been burned by porting kernel TCP code to user-space before. The assumption that `bytes_in_flight == 0` is equivalent to idle is seductive but wrong in QUIC's ACK-clocked world. Cloudflare's fix is elegant, but the real takeaway is to distrust any kernel optimization that relies on callbacks you don't have. I'd love to see the Linux kernel community add a dedicated QUIC congestion control interface to avoid these translation errors.

— DevDigest Editorial

Key Takeaways

•Always test congestion controllers at minimum cwnd with realistic loss patterns.
•When porting kernel TCP code to QUIC, audit every idle detection mechanism—`bytes_in_flight == 0` is not equivalent to kernel's `CA_EVENT_TX_START`.
•Instrument state transitions and cwnd history; visualization revealed the 14ms oscillation that was invisible in logs.

Why It Matters

If you're building or maintaining a QUIC implementation, this bug shows how a seemingly benign port of a kernel optimization can create a catastrophic failure mode. Testing congestion controllers only in steady state is insufficient—you must drive them to minimum cwnd and verify recovery.

#bug#cloudflare#QUIC#linux-kernel#CUBIC

Get the weekly digest

Every Sunday - top tech stories, industry breakthroughs, and developer tools delivered to your inbox.

No spam, unsubscribe anytime.

CUBIC's Idle Optimization Causes QUIC Death Spiral at Cloudflare

CUBIC's Idle Optimization Causes QUIC Death Spiral at Cloudflare

The Test That Failed 61% of the Time

The Oscillation: 999 State Transitions in 6.7 Seconds

Root Cause: Porting a Kernel Fix to User-Space QUIC

The Self-Perpetuating Trap

The Fix: A One-Line Change

Why This Matters for QUIC Implementations

Lessons for Developers

Editor's Take

Key Takeaways

Why It Matters

Get the weekly digest

You might also like

The Vi Family: A Comprehensive Guide to 50 Years of Vi Clones

SAP embeds n8n as orchestration layer for Joule AI agents

Googlebook Replaces Chromebook: Android 17 with Gemini Cursor AI

Building an Offline AI That Remembers: SQLite, Metrics, Proactive Alerts

SAP embeds n8n as orchestration layer for Joule AI agents

SAP Autonomous Enterprise: 200 AI Agents, Anthropic Claude, Stock Down 41%