Onfido experienced an outage on Feb 24th, 14:14 UTC, for 9 minutes. Client API requests to upload documents and create checks returned an error response during this period. While this also impacted our SDK traffic, our logs indicate a very small number of user sessions failed to complete, as they were largely recovered by re-tries.
During preparations to rollout a planned database update, part of the change was inadvertently pushed to our production environment before it was ready. This was due to a misconfigured release pipeline.
14:14 UTC: A change to a database is inadvertently pushed to production
14:15 UTC: The team responsible for this upgrade is alerted to an increase in database errors
14:17 UTC: The problem is identified, and a fix is released
14:22 UTC: The fix is fully applied to impacted regions
The offending pipeline is being corrected and an investigation will be done to assess if any other pipeline suffers from the same misconfiguration. Additional controls will be established to avoid such misconfigurations in future.