Partial function execution outage

Resolved·Partial outage

System backlogs should be caught up and the system overall is stable with increased capacity. The root cause was determined and several system performance and reliability improvements will be added this week as follow ups.

Mon, Dec 15, 2025, 02:49 AM

(4 months ago)

Affected components

Dec 14, 2025, 05:11 PM

Dec 15, 2025, 02:49 AM

Function execution

Updates

Resolved

Mon, Dec 15, 2025, 02:49 AM

Monitoring

All affected users should be in a stable state, while the system catches up with execution during the migration from the affected queue shard. As things stablize, we'll migrate these users over to a new dedicated queue shard with additional capacity.

Mon, Dec 15, 2025, 12:11 AM(2 hours earlier)

Identified

All users were migrated off of the ss2 shards an hour ago to ensure all functions are processed to mitigate issues. All users should be on stable queue shards.

Sun, Dec 14, 2025, 11:38 PM(32 minutes earlier)

Identified

Our latest attempt to add more replicas to our queue did not succeed so queue workers have been brought back online. We are not working to shift all accounts off of the queue (ss2) temporarily to stabilize before re-distributing accounts.

Sun, Dec 14, 2025, 09:41 PM(1 hour earlier)

Identified

We are actively working to stabilize the ss2 queue. During this process, function execution on this shard may be reduced for 10-20 minutes.

Sun, Dec 14, 2025, 09:15 PM(25 minutes earlier)

Monitoring

We have scaled up workers for our ss2 queue shard to get caught up from the backlog while we also get system hardening measures in place.

Sun, Dec 14, 2025, 08:26 PM(49 minutes earlier)

Monitoring

We've fixed the ss2 queue shards and function execution should begin to resume around 17:58 UTC.

Sun, Dec 14, 2025, 06:00 PM(2 hours earlier)

Identified

We have identified the cause of the issue. We're actively working on implementing a fix to restore the ss2 queue shard.

Sun, Dec 14, 2025, 05:28 PM(31 minutes earlier)

Investigating

We are actively investigating an issue with one of our queue shards (ss2) that handle function execution for a subset of our customers. We will provider further updates as we identify the cause and resolve the issue.

Sun, Dec 14, 2025, 05:11 PM(16 minutes earlier)