## Overview

Our build network suffered partial outages several times during the beginning to middle of September. We know this isn’t the level of reliability you expect from us, so our team discussed the causes and solutions (already in place and planned) and would like to share a summary of that conversation with you.

We believe in transparency and welcome any follow-up questions you might have via [our helpdesk](mailto:support@netlify.com).

## Timelines

The timelines for degraded service are reflected in our [status Twitter account](https://twitter.com/netlifystatus) as well as our [status page](https://netlifystatus.com).

The high level overview is that on three separate days our build network was unable to begin or complete deploys for a period of time. Our web service and UI were generally not affected except for a few minutes when we were deploying fixes to our backend to mitigate the problems below.

In all three cases the proximate cause varied, but there are two root causes shared by all:

1.  A change to our background processing processing system caused some of our job handling to queue many requests.
2.  When we successfully ran those requests in bulk after repairing the proximate causes, this caused a part of our message queueing system to fail.

Two of those incidents were additionally affected by another root cause; a lot of traffic from spam users — people creating thousands of bogus sites on our service creating extra load that took things from bad to worse.

We also feel our communication with Netlify users wasn’t timely, which is considered a large failure on our part. That combined with prematurely declaring the situation resolved more than once (solved: job backup, about to fail: message queue) shows a lot of room for communication improvements.

## Present

After the last incident, we deployed a permanent fix for the background processing system stalling to avoid future incidents. We have done some work on the message queueing system that has already improved reliability, but that is still a work in progress. (Read more in the “Future” section below.)

Additionally, we’ve improved the tooling around our status page backend (which is open source if you want to check it out: [https://github.com/netlify/netlify-statuskit](https://github.com/netlify/netlify-statuskit) ), so that future incidents can be created and managed more easily and we will communicate situations on our status page.

Finally, we’ve put a lot of safeguards in place to prevent abuse of our system by spammers. This ranges from rate limitations to better algorithms to detect and prevent that type of visitor from causing the same kinds of service degradation.

## Future

We are always trying to improve. Right now our improvement priorities are our service, stability and customer experience. Both the communications delay and the build system issues fall into those categories, so we have additional work planned for the immediate future:

**Build System**

-   Segment our build queueing system more thoroughly to prevent impact of the many by the few (spammers). The groundwork for this change is already in place but it needs to be carried over to some other systems before it is in full effect.
-   Improve the redundancy in the message queueing system to be more resilient to load spikes and other single node or network partitioning issues.
-   Split our API (drives uploads/builds/our UI) and our webservice requests to our backend into different pools that we can tune the balance dynamically but also prevent spikes in the API from affecting webservice at all.

**Customer Communication**

-   Train additional team members to be able to handle emergency communications so that anyone available can help us messaging to customers more quickly on twitter and the status page.
-   Improve the Support [webpage](https://www.netlify.com/support/) to include status indicators.

* * *

We hope this gave you insight as to the why’s of what went wrong as well as how we plan to address those issues. We value each and every Netlify supporter and would welcome any other feedback or questions in [our helpdesk](mailto:support@netlify.com).

### Share

-   [X (fka Twitter)](https://twitter.com/intent/tweet?text=Learning Review for Build Network Outages in September 2017&url=https://www.netlify.com/blog/2017/09/27/learning-review-for-build-network-outages-in-september-2017//)
-   [LinkedIn](https://www.linkedin.com/shareArticle?mini=true&url=https%3A%2F%2Fwww.netlify.com%2Fblog%2F2017%2F09%2F27%2Flearning-review-for-build-network-outages-in-september-2017%2F%2F)
-   [Facebook](https://www.facebook.com/sharer.php?u=https://www.netlify.com/blog/2017/09/27/learning-review-for-build-network-outages-in-september-2017//)
-   [Bluesky](https://bsky.app/intent/compose?text=Learning Review for Build Network Outages in September 2017+https://www.netlify.com/blog/2017/09/27/learning-review-for-build-network-outages-in-september-2017//)

* * *

### Tags

-   [stability](/blog/tags/stability/)

## Keep reading

![](/_astro/cfdc437592ee2bf75a62690af707d52533d08063-1600x900_2njoni.webp)

Opinions & Insights May 14, 2026

[

### How we built Netlify Database for AI-native development

](/blog/how-we-built-netlify-database-for-ai-native-development)

-   ![Profile picture of Eduardo Bouças](/_astro/52958f21e8450baf6d8e60302341a984e220c0cd-512x512_13VDlu.webp)
    
    Eduardo Bouças
    

![](/_astro/97a103abeebc73c01640f04a5c7555c1d10469aa-1200x675_Z8E0d4.webp)

Opinions & Insights May 6, 2026

[

### My experience building and deploying a project with Netlify Agent Runners

](/blog/my-experience-building-and-deploying-a-project-with-netlify-agent-runners)

-   ![Profile picture of Conor Martin ](/_astro/d1f759333090a4801940b47bf8701c441c6bd4a4-375x375_Bsg02.webp)
    
    Conor Martin
    

## Recent posts

News & Announcements June 25, 2026

[

### Netlify Functions, designed for Agent Experience

](/blog/netlify-functions-designed-for-agent-experience)

-   ![Profile picture of Eduardo Bouças](/_astro/52958f21e8450baf6d8e60302341a984e220c0cd-512x512_13VDlu.webp)
    
    Eduardo Bouças
    

News & Announcements June 24, 2026

[

### How we measure Netlify’s Agent Experience

](/blog/how-we-measure-netlify-agent-experience)

-   ![Profile picture of Sean Roberts](/_astro/bbf2243f8171dbddd80ab2103622106cef84d125-512x512_Z1d2LKE.webp)
    
    Sean Roberts
    

Guides & Tutorials May 15, 2026

[

### How to build a real-time AI chatbot in minutes with Netlify Agent Runners (no backend)

](/blog/how-to-build-a-real-time-ai-chatbot-in-minutes-with-netlify-agent-runners-no-backend)

-   ![Profile picture of Nahrin Jalal](/_astro/f0e7c8f227a03fe58340c99ef5439d5a896c0733-272x272_Z23kDpD.webp)
    
    Nahrin Jalal
    

![](/_astro/3f255b372fa958df35802666ee33b4609b2d71bd-1200x1586_1VtE2D.webp)

### How do the best dev and marketing teams work together?

[Access the report](https://www.netlify.com/reports/2024-leadership-trend-report/access/)