The Dashboard was intermittently unavailable in the EU region from 15:30 UTC to 18:30 UTC due to an outage in the underlying data store that backs its search index.
A partition on the data store's shards caused increased latency on 20% of write and read operations, which ultimately timed out. This in turn prevented the Dashboard from loading whenever a timeout for these operations was reached.
15:30 UTC: Dashboard (EU) started having intermittent usage issues
16:30 UTC: We identified the root cause in the search index data store
17:45 UTC: Fix was applied to the search index data store
18:30 UTC: Incident was resolved
As immediate remediation the partition was manually restored, resulting in normal read/write times. The data that failed to be written during the incident period was eventually made consistent automatically.
We are committed to preventing such issues from having such an impact in the future. Therefore, we will: