We have invested today into adding monitoring that catches the edge case that we ran into last night.
Sep 6, 17:54 CEST
Everything is looking good at this moment. We will keep this incident in monitoring state until the end of the day (17:00 CEST).
Sep 6, 11:57 CEST
A fix has been implemented and we are monitoring the results.
Sep 6, 08:38 CEST
The problem is related to only a few instances hitting one of our production machines that was not responding. The load balancer should normally detect that, but didn't. We have removed the machine from the group and are investigating. In the meanwhile, we will add a new machine to the group to be back at full capacity.
Sep 6, 08:28 CEST
We are receiving reports from several instances that Atlas CRM is not loading.
Sep 6, 08:16 CEST