Service Health

This page provides status information on the services that are part of Google Cloud. Check back here to view the current status of the services listed below. If you are experiencing an issue not listed here, please contact Support. Learn more about what's posted on the dashboard in this FAQ. For additional information on these services, please visit https://cloud.google.com/.

Incident affecting Vertex Gemini API

Vertex Gemini API serving the gemini-1.5-flash model may experience elevated latency or errors in multiregions/us

Incident began at 2024-12-11 17:15 and ended at 2024-12-13 06:03 (all times are US/Pacific).

Date Time Description
15 Dec 2024 23:21 PST

Mini Incident Report

We apologize for the inconvenience this service disruption/outage may have caused. We would like to provide some information about this incident below. Please note, this information is based on our best knowledge at the time of posting and is subject to change as our investigation continues. If you have experienced impact outside of what is listed below, please reach out to Google Cloud Support using https://cloud.google.com/support.

(All Times US/Pacific)

Incident Start: 11 December 2024 17:15

Incident End: 13 December 2024 06:03

Duration: 1 day, 12 hours, 48 minutes

Affected Services and Features:

Vertex Gemini API

Regions/Zones: multiregion-us

Description:

The Vertex Gemini API experienced elevated latency and errors for the gemini-1.5-flash-002 model in multiregion-us for a duration of 1 day, 12 hours, and 48 minutes. From preliminary analysis, the root cause of the issue was a significant increase in decode processing load stemming from a large spike in response content heavy traffic.

Google engineers mitigated the issue by increasing processing capacity and updating a resource allocation configuration to alleviate stress on processing.

Customer Impact:

Customers would have experienced increased latency or errors for the gemini-1.5-flash-002 model in multiregion-us.

13 Dec 2024 10:54 PST

The issue with Vertex Gemini API has been resolved for all affected users as of Friday, 2024-12-13 09:00 US/Pacific.

We thank you for your patience while we worked on resolving the issue.

13 Dec 2024 09:51 PST

Summary: Vertex Gemini API serving the gemini-1.5-flash model may experience elevated latency or errors in multiregions/us

Description: We are experiencing an issue with Vertex Gemini API serving the gemini-1.5-flash model beginning on Wednesday, 2024-12-11 18:00 US/Pacific.

Our engineering team has mitigated the issue and are showing signs of recovery. We will continue to monitor the service for stability.

We will provide an update by Friday, 2024-12-13 11:00 US/Pacific with current details.

Diagnosis: Impacted customers may experience elevated latency or errors.

Workaround: None at this time.

13 Dec 2024 06:12 PST

Summary: Vertex Gemini API serving the gemini-1.5-flash model may experience elevated latency or errors in multiregions/us

Description: We are experiencing an issue with Vertex Gemini API serving the gemini-1.5-flash model beginning on Wednesday, 2024-12-11 18:00 US/Pacific.

Our engineering team continues to investigate the issue.

We will provide an update by Friday, 2024-12-13 10:00 US/Pacific with current details.

We apologize to all who are affected by the disruption.

Diagnosis: Impacted customers may experience elevated latency or errors.

Workaround: None at this time.

13 Dec 2024 05:24 PST

Summary: Vertex Gemini API serving the gemini-1.5-flash model may experience elevated latency or errors in multiregions/us

Description: We are experiencing an issue with Vertex Gemini API serving the gemini-1.5-flash model beginning on Wednesday, 2024-12-11 18:00 US/Pacific.

Our engineering team continues to investigate the issue.

We will provide an update by Friday, 2024-12-13 10:00 US/Pacific with current details.

We apologize to all who are affected by the disruption.

Diagnosis: Impacted customers may experience elevated latency or errors.

Workaround: None at this time.

13 Dec 2024 05:01 PST

Summary: Vertex Gemini API serving the gemini-1.5-flash model may experience elevated latency or errors in multiregions/us

Description: We are experiencing an issue with Vertex Gemini API serving the gemini-1.5-flash model beginning on Wednesday, 2024-12-11 18:00 US/Pacific.

Our engineering team continues to investigate the issue.

We will provide an update by Friday, 2024-12-13 06:00 US/Pacific with current details.

We apologize to all who are affected by the disruption.

Diagnosis: Impacted customers may experience elevated latency or errors.

Workaround: None at this time.