應用Kubernetes失敗案例彙總

ServiceMesher2019-02-24 04:59:01

我們都知道Kubernetes有眾多組件,是個相當複雜的系統。它的生態系統也在不斷髮展,而且正在增加更多的抽象層,如服務網格。。。但是到目前為止,我們都沒聽到多少關於應用 Kubernetes 的失敗案例來幫助我們規避使用這個複雜系統的風險。

下面列舉的是應用 Kubernetes 的各種失敗案例彙總,希望能夠幫助大家減少在應用 Kubernetes 時候遇到的未知風險,這些文章大多比較短,來自:https://github.com/hjacobs/kubernetes-failure-stories

  • Kubernetes Load Balancer Konfiguration – Vorsicht beim Drainen von Nodes (German) - DevOps Hof - blog post 2019

  • On Infrastructure at Scale: A Cascading Failure of Distributed Systems - Target - Medium post January 2019

  • Running Kubernetes in Production: A Million Ways to Crash Your Cluster - Zalando - DevOpsCon Munich 2018

  • Outages? Downtime? - Veracode - blog post 2018

  • NRE Labs Outage Post-Mortem - NRE Labs - blog post 2018

  • A Perfect DNS Storm - Toyota Connected - blog post 2018

  • Kubernetes and the Menace ELB, the tale of an outage - Turnitin - blog post 2018

  • Moving the Entire Stack to K8s Within a Year – Lessons Learned - ThredUP - DevOpsStage 2018

  • AirMap Platform Service Outage - AirMap - incident report 2018

  • Anatomy of a Production Kubernetes Outage - Monzo - KubeCon Europe 2018

  • 101 Ways to "Break and Recover" Kubernetes Cluster - Oath/Yahoo - KubeCon Europe 2018

  • 101 Ways to Crash Your Cluster - Nordstrom - KubeCon North America 2017

  • Major Outage: Current account payments may fail - Monzo - Monzo Community post 2017

  • Search and Reporting Outage - Universe - incident report 2017

  • Our First Kubernetes Outage - Saltside - blog post 2017

  • Our Failure Migrating to Kubernetes - Saltside - blog post 2017

  • SaleMove US System Issue - SaleMove - incident report 2017

點擊閱讀原文跳轉到 GitHub 上瀏覽查看上文鏈接。

相關閱讀推薦

Kubernetes與雲原生2018年終總結及新春展望

kubernetes 資源管理概述

如何從零開始編寫一個Kubernetes CRD

評估Kubernetes中的Serverless框架

如何將雲原生工作負載映射到Kubernetes中的控制器

加入 ServiceMesher 社區