Service Health

This page provides status information on the services that are part of Google Cloud. Check back here to view the current status of the services listed below. If you are experiencing an issue not listed here, please contact Support. Learn more about what's posted on the dashboard in this FAQ. For additional information on these services, please visit https://cloud.google.com/.

All incidents reported for Vertex AI Training

2024

Summary Date Duration
Vertex AI custom training jobs failing if using more than 2GB ephemeral storage 16 Aug 2024
4 hours, 40 minutes
Cloud TPU Service Activation is impacted. 7 Aug 2024
3 hours, 51 minutes
Multiple Google Cloud Products are experiencing issues in us-west1 14 Feb 2024
3 hours, 7 minutes

2023

Summary Date Duration
We are investigating a potential issue with Vertex AI Online Prediction, Vertex AI Training. 25 Sep 2023
4 minutes
Vertex AI training jobs are experiencing issues where jobs may take longer than usualĀ  22 Jul 2023
7 hours, 48 minutes
Cloud AI Platform and Vertex AI Training elevated error rates for GPU jobs in us-central1, us-east1, and europe-west3 3 Mar 2023
1 day, 1 hour

2021

Summary Date Duration
Global: Jobs failing with internal error for GKE version 1.18 5 Oct 2021
2 hours, 30 minutes
Issue with multiple Google Cloud infrastructure components. 20 May 2021
10 hours, 45 minutes