Root Cause Analysis:
On May 5, 2025, starting at approximately 15:33 UTC, customers, primarily in Qlik Cloud - US, may have experienced login failures, difficulty opening apps, browsing in the Hub, or issues with reload and automation functionality.
This incident was triggered by a license validation check, which caused an unexpected increase in traffic. Upon reverting the change in the Qlik Cloud - US region, Qlik encountered a traffic-related untested race condition that significantly extended the duration of the impact and inhibited restoration of service. Other regions were briefly affected, but did not encounter this condition, resulting in rapid service restoration outside the U.S.
This was not a malicious event, and no customer data was compromised.
Resolution and Remediations:
The resolution involved restoring the previous version of the service, recycling impacted nodes, and scaling up capacity to stabilize operations.
To further enhance platform resiliency, Qlik is expanding validation coverage and optimizing the handling of cross-cluster license checks. Deployment testing is also being refined to increase visibility into potential edge-case behaviors. We appreciate your patience and trust as we continue improving the resilience and reliability of our platform.