Ongoing issue with Azure AKS Clusters
Resolved·Partial outage

System should be fully recovered and node additional is operational. Resolution confirmed.

Tue, Feb 3, 2026, 06:54 AM
(1 week ago)
·
Affected components
Node Autoscaler
EU Node Autoscaler
Updates

Resolved

System should be fully recovered and node additional is operational. Resolution confirmed.

Tue, Feb 3, 2026, 06:54 AM

Monitoring

Update: Azure has released a fix and is rolling it out across all impacted regions. On the CAST AI side, we are observing node operations completing successfully, with nodes being added and provisioning as expected.

We will provide another update once Azure confirms full resolution.

Reference: https://azure.status.microsoft/en-us/status

Mon, Feb 2, 2026, 11:05 PM(7 hours earlier)

Identified

Update: Azure has reported issues with virtual machine service management across multiple regions following a recent configuration change. We’re actively monitoring the situation and will continue to provide updates as more information becomes available.

Reference: https://azure.status.microsoft/en-us/status

Mon, Feb 2, 2026, 09:39 PM(1 hour earlier)

Investigating

Update: Our investigation indicates this may be related to an Azure service outage impacting node creation and addition. We are awaiting further updates from Azure. As a precaution, we recommend disabling node deletion policies and scheduled rebalancing for critical clusters to avoid triggering downscaling. We’ll continue to share updates as more information becomes available.

Reference: https://downdetector.com/status/windows-azure/

Mon, Feb 2, 2026, 09:16 PM(23 minutes earlier)

Investigating

We’re currently observing an increased rate of node add failures and long add node operations, affecting Azure AKS clusters. Our engineering team is actively investigating and will provide updates as more information becomes available.

Mon, Feb 2, 2026, 08:15 PM(1 hour earlier)
Powered by