Service Health
Incident affecting Google Cloud Composer
Creation and Upgrades are failing for some Environments while using Cloud Composer 2
Incident began at 2024-04-16 02:20 and ended at 2024-04-17 03:40 (all times are US/Pacific).
Previously affected location(s)
Taiwan (asia-east1)Hong Kong (asia-east2)Tokyo (asia-northeast1)Osaka (asia-northeast2)Seoul (asia-northeast3)Mumbai (asia-south1)Singapore (asia-southeast1)Jakarta (asia-southeast2)Sydney (australia-southeast1)Melbourne (australia-southeast2)Warsaw (europe-central2)Finland (europe-north1)Belgium (europe-west1)London (europe-west2)Frankfurt (europe-west3)Netherlands (europe-west4)Zurich (europe-west6)Paris (europe-west9)Montréal (northamerica-northeast1)Toronto (northamerica-northeast2)São Paulo (southamerica-east1)Iowa (us-central1)South Carolina (us-east1)Northern Virginia (us-east4)Columbus (us-east5)Oregon (us-west1)Los Angeles (us-west2)Salt Lake City (us-west3)Las Vegas (us-west4)
Date | Time | Description | |
---|---|---|---|
| 19 Apr 2024 | 10:21 PDT | Incident ReportSummaryBetween 16 and 17 April 2024, Cloud Composer users experienced an elevated failure rate when creating, resizing or upgrading to newer versions of Cloud Composer 2 Environments with “Private IP” configuration for a duration of 1 day, 1 hour and 20 minutes. Existing Private IP environments continued to operate normally if they were not upgraded or resized. To our Cloud Composer customers whose businesses were impacted during this disruption: we sincerely apologize. This is not the level of quality and reliability we strive to offer you, and we are taking immediate steps to improve the platform’s performance and availability. Root CauseThe root cause was due to an issue introduced by a recent change to the latest stable Container Operating System (COS) image used by Cloud Composer in one of its workloads. The new version of the COS image (M113) moved from iptables-legacy to iptables-nft package as the default, which impacted Konlet (system executing containers)’s handling of iptables to break. Remediation and PreventionGoogle engineers were alerted to the outage via our monitoring tools on 16 April at 09:17 US/Pacific and immediately started an investigation. Once the nature and scope of the issue became clear, Google engineers reverted the recently introduced rollout. Google is committed preventing a repeat of this issue in the future and is completing the following actions:
Detailed Description of ImpactBetween 16 April 2024 from 02:20 to 17 April, 03:40 US/Pacific impacted customers might have experienced issues with:
During the outage customers were asked to refrain from performing upgrade operations until mitigation has been confirmed. Additional Information for Customers:
If you are one of the customers that experienced upgrade failure during the incident and still continue to have issues with further upgrades, please reach out to Google Cloud Support using https://cloud.google.com/support for assistance with recovery. |
| 17 Apr 2024 | 09:39 PDT | Mini Incident ReportWe apologize for the inconvenience this service disruption/outage may have caused. We would like to provide some information about this incident below. Please note, this information is based on our best knowledge at the time of posting and is subject to change as our investigation continues. If you have experienced impact outside of what is listed below, please reach out to Google Cloud Support using https://cloud.google.com/support. (All Times US/Pacific) Incident Start: 16 April 2024 02:20 Incident End: 17 April 2024 03:40 Duration: 1 day 1 hours, 20 minutes Affected Services and Features: Google Cloud Composer Regions/Zones: Global Description: Google Cloud Composer users experienced an elevated failure rate when creating, resizing, or upgrading to newer versions of Cloud Composer 2 Environments with “Private IP” configuration. This was due to an inadvertent issue introduced by a recent change to the latest stable Container Operating System (COS) image used by Cloud Composer in one of its workloads. Existing Private IP environments continued to operate normally if they were not upgraded or resized. Google engineers executed a rollback of the change to mitigate the issue on 17 April 2024 at 03:40 US/Pacific. Google will complete a full Incident Report in the following days that will provide a detailed root cause. Customer Impact:
Additional details:
If you are one of the customers that experienced upgrade failure during the incident and still continue to have issues with further upgrades, please reach out to Google Cloud Support using https://cloud.google.com/support for assistance with recovery. Alternatively, you can recreate these environments. |
| 17 Apr 2024 | 04:41 PDT | The issue with Google Cloud Composer has been resolved for all affected users as of Wednesday, 2024-04-17 04:09 US/Pacific. Users are now able to create new Composer Environment and upgrade existing ones. Some failed upgrades during the duration of the incident may have been automatically recovered. If you're still experiencing issues, please contact us via a customer support case for our repair procedure. We thank you for your patience while we worked on resolving the issue. |
| 17 Apr 2024 | 03:46 PDT | Summary: Creation and Upgrades are failing for some Environments while using Cloud Composer 2 Description: Mitigation work is currently underway by our engineering team. We do not have an ETA for mitigation at this point. We will provide more information by Wednesday, 2024-04-17 05:30 US/Pacific. Diagnosis: Impacted customers may experience issues with creating new Composer Environments and upgrades for existing Composer Environments. Existing Composer 2 Environments, if not modified, should function correctly. Workaround: None at this time. |
| 17 Apr 2024 | 01:45 PDT | Summary: Creation and Upgrades are failing for some Environments while using Cloud Composer 2 Description: Customers might experience issues with creating or upgrading to newer versions of Cloud Composer 2 Environments.The problem exists for the “Private IP” Composer Environment. We have identified the root cause of the above issue, and are working on a fix. Users are requested to refrain from performing upgrade operations until mitigation has been confirmed. We will provide more information by Wednesday, 2024-04-17 04:00 US/Pacific. Diagnosis: Impacted customers may experience issues with creating new Composer Environments and upgrades for existing Composer Environments. Existing Composer 2 Environments, if not modified, should function correctly. Workaround: None at this time. |
- All times are US/Pacific