Google Cloud Status Dashboard
This page provides status information on the services that are part of Google Cloud Platform. Check back here to view the current status of the services listed below. If you are experiencing an issue not listed here, please contact Support. Learn more about what's posted on the dashboard in this FAQ. For additional information on these services, please visit cloud.google.com.
Google Kubernetes Engine Incident #19012
We are investigating an issue with Google Kubernetes Engine where some nodes in recently upgraded clusters (see affected versions) may be experiencing elevated numbers of kernel panics
Incident began at 2019-11-04 11:46 and ended at 2019-11-13 15:38 (all times are US/Pacific).
Date | Time | Description | |
---|---|---|---|
Nov 13, 2019 | 15:38 | The issue with Google Kubernetes Engine clusters with node pools experiencing an elevated number of kernel panics has been resolved in a new release of GKE available as of Wednesday, 2019-11-11 16:00 US/Pacific. The fix is contained in the following versions of GKE which is currently rolling out to node pools with auto upgrade enabled [1]. This should complete by Friday, 2019-11-15. Any customer on manual updates will need to manually upgrade their nodes to the following versions.: 1.13.11-gke.14 Please note that this fix has downgraded the version of CoS to cos-73-11647-293-0 [2] as a temporary mitigation, we expect the next release of GKE to have an upgraded kernel and fix for the panics seen in the below releases.. Affected versions were: 1.13.11-gke.9, 1.14.7-gke.14, 1.13.12-gke.1, 1.14.8-gke.1, 1.13.11-gke.11, 1.13.12-gke.2, 1.14.7-gke.17, 1.14.8-gke.2, 1.13.12-gke.3, 1.14.8-gke.6, 1.13.11-gke.12, 1.13.12-gke.4, and 1.14.8-gke.7 We thank you for your patience while we've worked on resolving the issue. [1] - https://cloud.google.com/kubernetes-engine/versioning-and-upgrades#rollout_schedule [2] - https://cloud.google.com/container-optimized-os/docs/release-notes#cos-73-11647-293-0 |
|
The issue with Google Kubernetes Engine clusters with node pools experiencing an elevated number of kernel panics has been resolved in a new release of GKE available as of Wednesday, 2019-11-11 16:00 US/Pacific. The fix is contained in the following versions of GKE which is currently rolling out to node pools with auto upgrade enabled [1]. This should complete by Friday, 2019-11-15. Any customer on manual updates will need to manually upgrade their nodes to the following versions.: 1.13.11-gke.14 Please note that this fix has downgraded the version of CoS to cos-73-11647-293-0 [2] as a temporary mitigation, we expect the next release of GKE to have an upgraded kernel and fix for the panics seen in the below releases.. Affected versions were: 1.13.11-gke.9, 1.14.7-gke.14, 1.13.12-gke.1, 1.14.8-gke.1, 1.13.11-gke.11, 1.13.12-gke.2, 1.14.7-gke.17, 1.14.8-gke.2, 1.13.12-gke.3, 1.14.8-gke.6, 1.13.11-gke.12, 1.13.12-gke.4, and 1.14.8-gke.7 We thank you for your patience while we've worked on resolving the issue. [1] - https://cloud.google.com/kubernetes-engine/versioning-and-upgrades#rollout_schedule [2] - https://cloud.google.com/container-optimized-os/docs/release-notes#cos-73-11647-293-0 |
|||
Nov 11, 2019 | 18:53 | Description: The following fixed versions are now available and should fix the kernel panic issue: 1.13.11-gke.14, 1.13.12-gke.8, 1.14.7-gke.23 and 1.14.8-gke.12. Mitigation work is currently underway by our engineering team to roll out the fixed versions to clusters configured with node auto-update, and is expected to be complete by Wednesday, 2019-11-13. Clusters not configured with node auto-update can be manually upgraded. At this time the following versions are still affected: 1.13.11-gke.9, 1.14.7-gke.14, 1.13.12-gke.1, 1.14.8-gke.1, 1.13.11-gke.11, 1.13.12-gke.2, 1.14.7-gke.17, 1.14.8-gke.2, 1.13.12-gke.3, 1.14.8-gke.6, 1.13.11-gke.12, 1.13.12-gke.4, and 1.14.8-gke.7. We will provide more information as it becomes available or by Wednesday, 2019-11-13 17:00 US/Pacific at the latest. Diagnosis: Affected users may notice elevated levels of kernel panics on nodes running one of the affected versions listed above. Workaround: Users seeing this issue can upgrade to a fixed release. |
|
Description: The following fixed versions are now available and should fix the kernel panic issue: 1.13.11-gke.14, 1.13.12-gke.8, 1.14.7-gke.23 and 1.14.8-gke.12. Mitigation work is currently underway by our engineering team to roll out the fixed versions to clusters configured with node auto-update, and is expected to be complete by Wednesday, 2019-11-13. Clusters not configured with node auto-update can be manually upgraded. At this time the following versions are still affected: 1.13.11-gke.9, 1.14.7-gke.14, 1.13.12-gke.1, 1.14.8-gke.1, 1.13.11-gke.11, 1.13.12-gke.2, 1.14.7-gke.17, 1.14.8-gke.2, 1.13.12-gke.3, 1.14.8-gke.6, 1.13.11-gke.12, 1.13.12-gke.4, and 1.14.8-gke.7. We will provide more information as it becomes available or by Wednesday, 2019-11-13 17:00 US/Pacific at the latest. Diagnosis: Affected users may notice elevated levels of kernel panics on nodes running one of the affected versions listed above. Workaround: Users seeing this issue can upgrade to a fixed release. |
|||
Nov 08, 2019 | 17:10 | Description: This issue was downgraded to an orange category Service Disruption as the number of projects actually affected is very low. Mitigation work is currently underway by our engineering team and is expected to be complete by Wednesday, 2019-11-13. At this time the following versions are still affected: 1.13.11-gke.9, 1.14.7-gke.14, 1.13.12-gke.1, 1.14.8-gke.1, 1.13.11-gke.11, 1.13.12-gke.2, 1.14.7-gke.17, 1.14.8-gke.2, 1.13.12-gke.3, 1.14.8-gke.6, 1.13.11-gke.12, 1.13.12-gke.4, and 1.14.8-gke.7. We will provide more information as it becomes available or by Wednesday, 2019-11-13 17:00 US/Pacific at the latest. Diagnosis: Affected users may notice elevated levels of kernel panics on nodes upgraded to one of the affected versions listed above. Workaround: Users seeing this issue can downgrade to a previous release (not listed in the affected versions above). Users on a Release Channel affected by this issue should reach out to support for assistance with downgrading their nodes. |
|
Description: This issue was downgraded to an orange category Service Disruption as the number of projects actually affected is very low. Mitigation work is currently underway by our engineering team and is expected to be complete by Wednesday, 2019-11-13. At this time the following versions are still affected: 1.13.11-gke.9, 1.14.7-gke.14, 1.13.12-gke.1, 1.14.8-gke.1, 1.13.11-gke.11, 1.13.12-gke.2, 1.14.7-gke.17, 1.14.8-gke.2, 1.13.12-gke.3, 1.14.8-gke.6, 1.13.11-gke.12, 1.13.12-gke.4, and 1.14.8-gke.7. We will provide more information as it becomes available or by Wednesday, 2019-11-13 17:00 US/Pacific at the latest. Diagnosis: Affected users may notice elevated levels of kernel panics on nodes upgraded to one of the affected versions listed above. Workaround: Users seeing this issue can downgrade to a previous release (not listed in the affected versions above). Users on a Release Channel affected by this issue should reach out to support for assistance with downgrading their nodes. |
|||
Nov 07, 2019 | 14:49 | Description: This issue was downgraded to an orange category Service Disruption as the number of projects actually affected is very low. Mitigation work is currently underway by our engineering team and is expected to completed by early next week. At this time the following versions are still affected: 1.13.11-gke.9, 1.14.7-gke.14, 1.13.12-gke.1, 1.14.8-gke.1, 1.13.11-gke.11, 1.13.12-gke.2, 1.14.7-gke.17, 1.14.8-gke.2, 1.13.12-gke.3, 1.14.8-gke.6, 1.13.11-gke.12, 1.13.12-gke.4, and 1.14.8-gke.7. We will provide more information as it becomes available or by Friday, 2019-11-08 17:00 US/Pacific at the latest. Diagnosis: Affected users may notice elevated levels of kernel panics on nodes upgraded to one of the affected versions listed above. Workaround: Users seeing this issue can downgrade to a previous release (not listed in the affected versions above). Users on a Release Channel affected by this issue should reach out to support for assistance with downgrading their nodes. |
|
Description: This issue was downgraded to an orange category Service Disruption as the number of projects actually affected is very low. Mitigation work is currently underway by our engineering team and is expected to completed by early next week. At this time the following versions are still affected: 1.13.11-gke.9, 1.14.7-gke.14, 1.13.12-gke.1, 1.14.8-gke.1, 1.13.11-gke.11, 1.13.12-gke.2, 1.14.7-gke.17, 1.14.8-gke.2, 1.13.12-gke.3, 1.14.8-gke.6, 1.13.11-gke.12, 1.13.12-gke.4, and 1.14.8-gke.7. We will provide more information as it becomes available or by Friday, 2019-11-08 17:00 US/Pacific at the latest. Diagnosis: Affected users may notice elevated levels of kernel panics on nodes upgraded to one of the affected versions listed above. Workaround: Users seeing this issue can downgrade to a previous release (not listed in the affected versions above). Users on a Release Channel affected by this issue should reach out to support for assistance with downgrading their nodes. |
|||
Nov 07, 2019 | 08:49 | Description: Mitigation work is currently underway by our engineering team. The mitigation is expected to complete by early next week. At this time the following versions are still affected: 1.13.11-gke.9, 1.14.7-gke.14, 1.13.12-gke.1, 1.14.8-gke.1, 1.13.11-gke.11, 1.13.12-gke.2, 1.14.7-gke.17, 1.14.8-gke.2, 1.13.12-gke.3, 1.14.8-gke.6, 1.13.11-gke.12, 1.13.12-gke.4, and 1.14.8-gke.7. We will provide more information by Monday, 2019-11-11 15:00 US/Pacific. Diagnosis: Kernel panics following an upgrade in GKE version. Workaround: Users seeing this issue may go to a previous patch release (not listed in the affected versions above). |
|
Description: Mitigation work is currently underway by our engineering team. The mitigation is expected to complete by early next week. At this time the following versions are still affected: 1.13.11-gke.9, 1.14.7-gke.14, 1.13.12-gke.1, 1.14.8-gke.1, 1.13.11-gke.11, 1.13.12-gke.2, 1.14.7-gke.17, 1.14.8-gke.2, 1.13.12-gke.3, 1.14.8-gke.6, 1.13.11-gke.12, 1.13.12-gke.4, and 1.14.8-gke.7. We will provide more information by Monday, 2019-11-11 15:00 US/Pacific. Diagnosis: Kernel panics following an upgrade in GKE version. Workaround: Users seeing this issue may go to a previous patch release (not listed in the affected versions above). |