Friday June 10, 2016
Outage
Website and API outage
We have completed the last steps of our immediate remediation of this morning's incident in which web server capacity was exhausted. After the initial incident that impacted Slack's website and API was addressed the problem resurfaced to impact file upload and download; this, too, has been fixed. We are reviewing logs and metrics to ensure we do not experience similar problems again in the future.
12:39 PM PST
And now we believe the Slack website, API, and file upload/download are all back to functioning normally. Monitoring continues as our operations team checks our assumptions for all things related to web tier memory usage. As of now service is completely restored.
11:56 AM PST
The main Slack service and API are confirmed to be functioning well. However our fix has had an impact on serving file uploads and downloads which we are addressing now. They may be very slow or fail outright and we are implementing a fix for this problem immediately.
11:36 AM PST
We believe we've addressed the cause of the terrible Slack outage this morning. We're very sorry for the interruption to your days and we're taking steps now to address the problems uncovered during this incident. We will continue to monitor the situation to ensure our changes completely fix the problem before closing this incident.
11:24 AM PST
We are releasing the fix more widely and ensuring that all our customers get reconnected to Slack. We're continuing to monitor the situation.
11:11 AM PST
We are preparing and testing a potential fix for the problem that has caused Slack's website and API to be down this morning. Thank you for your patience.
11:00 AM PST
We're continuing to search for the source of resource exhaustion that's causing Slack to be unavailable to our customers.
10:40 AM PST
Slack's web application is failing more widely so as to include administrative web pages as well as API calls from Slack clients. We're continuing to work to restore service and allow disconnected users to connect to Slack once again.
10:21 AM PST
API failures continue and due to this many users are unable to connect to Slack. We're working to restore service as quickly as possible.
10:12 AM PST
Slack's web servers are being overwhelmed at the moment and we're working to restore full capacity and get everyone back to using Slack. API requests may respond in error and chat may behave quite slowly in the meantime.
10:06 AM PST
Status
Outage