CINXE.COM
Best Practices for High Availability Stability of Alibaba Cloud Container Service for Kubernetes - Alibaba Cloud Community
<!DOCTYPE html> <html lang="en" class="sub-site-nav alicloud-header alicloud-footer"> <head> <meta charset="UTF-8"> <title>Best Practices for High Availability Stability of Alibaba Cloud Container Service for Kubernetes - Alibaba Cloud Community</title> <link rel="shortcut icon" href="https://img.alicdn.com/tfs/TB1ugg7M9zqK1RjSZPxXXc4tVXa-32-32.png" /> <meta name="viewport" content="width=device-width, initial-scale=1.0"> <meta name="keywords" content="Container Service,Container Service for Kubernetes,cloud native,Apsara Conference,Kubernetes Service,Apsara Conference 2024,High Availability Stability,ACK One Fleet,ACK One GitOps,High-availability architecture" /> <meta name="description" content="This article shares the high availability and stability architecture of ACK and its best practices."> <meta name="csrf-param" content="yunqi_csrf"/> <meta name="csrf-token" content="BAHS4I3YUD"/> <meta name="data-spm" content="a2c65"> <meta name="aplus-rhost-v" content="sg.mmstat.com"> <meta name="aplus-rhost-g" content="sg.mmstat.com"> <meta http-equiv="X-UA-Compatible" content="ie=edge"> <link rel="stylesheet" type="text/css" href="//g.alicdn.com/??alicloud-components/alicloud-ui3/0.0.7/acUI.css,alicloud-components/acApp/0.0.3/app.css,alicloud-components/i18n/0.0.29/css/en-us/index.css,alicloud-components/iconfont/0.0.7/product-icon.css"> <link rel="stylesheet" type="text/css" href="//g.alicdn.com/aliyun-international/blog-assert/0.0.23/css/detail.css"> <link rel="stylesheet" type="text/css" href="//g.alicdn.com/aliyun-international/blog-assert/0.0.23/css/nav.css"> <link rel="stylesheet" type="text/css" href="//g.alicdn.com/aliyun-international/blog-assert/0.0.23/fonts/iconfont.css"> <link rel="stylesheet" type="text/css" href="https://g.alicdn.com/ali-mod/b-alicloud-v3-bottom/0.0.19/index.css"> <link rel="stylesheet" type="text/css" href="//g.alicdn.com/aliyun-international/blog-assert/0.0.23/fonts/iconfont.css"> <meta property="og:url" content="https://www.alibabacloud.com/blog/best-practices-for-high-availability-stability-of-alibaba-cloud-container-service-for-kubernetes_601779"> <meta property="og:site_name" content="Alibaba Cloud Community"> <meta property="og:title" content="Best Practices for High Availability Stability of Alibaba Cloud Container Service for Kubernetes"> <meta property="og:description" content="This article shares the high availability and stability architecture of ACK and its best practices."> <meta property="og:image" content="https://yqintl.alicdn.com/be5f625925d4fa1696c3bcea8eedb3748ab3e8cd.png"> <meta property="og:image:type" content="image/png"> <meta property="twitter:creator" content="Alibaba Cloud Community"> <meta property="twitter:card" content="summary_large_image"> <meta property="twitter:title" content="Best Practices for High Availability Stability of Alibaba Cloud Container Service for Kubernetes"> <meta property="twitter:description" content="This article shares the high availability and stability architecture of ACK and its best practices."> <meta property="twitter:image:src" content="https://yqintl.alicdn.com/be5f625925d4fa1696c3bcea8eedb3748ab3e8cd.png"> <script src="//g.alicdn.com/??alicloud-components/kloud/0.0.31/vendor/requirejs/require.js,alicloud-components/kloud/0.0.1/scripts/vendor/jquery/jquery.min.js,alicloud-components/common/scripts/layout.js,alicloud-components/alicloud-ui3/0.0.7/acUI.js"></script> <script src="//g.alicdn.com/aliyun-international/blog-assert/0.0.23/js/layout.js"></script> </head> <body data-spm="11461447"><script type="text/javascript"> (function (d) { var t=d.createElement("script");t.type="text/javascript";t.async=true;t.id="tb-beacon-aplus"; t.setAttribute("exparams","category=&userid=&aplus&yunid=&yunpk=&channel=&cps="); t.src="//g.alicdn.com/alilog/mlog/aplus_v2.js"; d.getElementsByTagName("head")[0].appendChild(t); })(document); </script> <div class="blog-nav"> <div class="container"> <div class="row"> <h1> Community </h1> <main class="blog-nav-center"> <a href="https://www.alibabacloud.com/blog/" class="bg"> Blog </a> <a href="https://resource.alibabacloud.com/event/index"> Events </a> <a href="https://resource.alibabacloud.com/webinar/index.htm"> Webinars </a> <a href="https://community.alibabacloud.com/tags/type_blog-tagid_28404/"> Tutorials </a> <a href="https://www.alibabacloud.com/forum"> Forum </a> </main> <ul class="blog-nav-right"> <li class="search"><input type="text" placeholder="Search" id="search"> <i class="search-btn k-iconfont icon-sousuo1"></i> <div class="close-box"><img data-original="https://img.alicdn.com/tfs/TB1BIBBsbPpK1RjSZFFXXa5PpXa-24-24.png" data-toggle="lazy-loading" class="off" /><img data-original="https://img.alicdn.com/tfs/TB1vrJ2shnaK1RjSZFBXXcW7VXa-24-24.png" data-toggle="lazy-loading" class="on" /></div> </li> </ul> <div class="blog-nav-right-m"> <i class="k-iconfont icon-sousuo1 show-search"></i> <i class="show-more"></i> </div> </div> <div class="blog-nav-main-m"> <ol> <li><a href="https://community.alibabacloud.com">Blog</a></li> <li> <a href="https://resource.alibabacloud.com/event/index"> Events </a> </li> <li> <a href="https://resource.alibabacloud.com/webinar/index.htm"> Webinars </a> </li> <li> <a href="https://www.alibabacloud.com/getting-started/projects"> Tutorials </a> </li> <li> <a href="https://www.alibabacloud.com/forum"> Forum </a> </li> </ol> <div class="btn-box"> <a href="https://account.alibabacloud.com/register/register.htm?from_type=yqclub&oauth_callback=https%3A%2F%2Fwww.alibabacloud.com%2Fblog%2F601779%3Fdo%3Dlogin" class="free" style="display: block;"> Create Account </a> <a href="https://account.alibabacloud.com/login/login.htm?from_type=yqclub&oauth_callback=https%3A%2F%2Fwww.alibabacloud.com%2Fblog%2F601779%3Fdo%3Dlogin" class="login" style="display: block;"> Log In </a> </div> </div> <div class="container blog-nav-search-m"> <div class="blog-nav-search-m-top"> <input type="text" placeholder="Search" class="int-search"> <button> <i class="k-iconfont icon-sousuo1"></i> </button> <span> 脳 </span> </div> </div> </div> </div> <div class="wrap container"> <div class="wrap-top"> <a href="https://community.alibabacloud.com">Community</a> <i class="icon icon-more"></i> <a href="https://www.alibabacloud.com/blog/">Blog</a> <i class="icon icon-more"></i> Best Practices for High Availability Stability of Alibaba Cloud Container Service for Kubernetes </div> <div class="wrap-main"> <div class="col-md-8"> <div class="wrap-main-left"> <h1> Best Practices for High Availability Stability of Alibaba Cloud Container Service for Kubernetes </h1> <aside> <main> <a href="https://community.alibabacloud.com/users/5883248061556614">Alibaba Container Service</a> <span>November 21, 2024</span> <span> <img src="https://img.alicdn.com/tfs/TB19L9AbXuWBuNjSspnXXX1NVXa-40-26.png" alt=""> 1,111 </span> <a href="#comment"> <i class="icon icon-pinglun"></i><b class="comments-num">0</b> </a> </main> <div> </div> </aside> <div class="wrap-main-left-abstract"> This article shares the high availability and stability architecture of ACK and its best practices. </div> <div class="wrap-main-left-article markdown-body"> <p><strong><em>Watch the replay of the <a href="https://www.alibabacloud.com/en/apsara-conference?spm=a2c65.11461447.0.0.44b93272FEyUa0&_p_lc=1" target="_blank">Apsara Conference 2024 at this link</a>!</em></strong></p> <p><em>This article is based on Liu Jiaxu's speech at the Apsara Conference 2024.</em></p> <h2>Introduction</h2> <p>As cloud-native technology continues to develop rapidly and be applied in depth in the enterprise IT field, the high-availability architecture in cloud-native scenarios is increasingly important for the availability, stability, and security of enterprise services. Through proper architecture design and technical support from the cloud platform, the cloud-native high-availability architecture can provide many advantages, including high availability, elastic scalability, simplified O&M management, and improved reliability and security, providing enterprises with a more reliable and efficient application runtime environment.</p> <p>As one of the core technologies of cloud-native, Kubernetes provides container orchestration and management capabilities, including infrastructure automation, elastic scalability, microservice architecture, and automated O&M. The high-availability architecture of Kubernetes applications is the cornerstone of cloud-native high availability. This article takes <a href="https://www.alibabacloud.com/product/kubernetes?spm=a3c0i.26967091.6791778070.435.6d983c14YEvHoD" target="_blank">Alibaba Cloud Container Service for Kubernetes</a> (ACK) as an example to describe the best practices for high-availability architecture and governance of applications based on ACK.</p> <h2>Error Cases and Pain Points of Kubernetes Clusters in High Availability Scenarios</h2> <p>High-availability architecture disaster recovery design is the cornerstone of the Kubernetes system stability and is of great significance in the production environment. </p> <p>Let's first take a look at the error cases and pain points of Kubernetes clusters in high availability scenarios and how ACK addresses these issues through architecture design, product capabilities, and best practices. </p> <p><img src="https://yqintl.alicdn.com/327cd2c06a1418a083967645078a588a5ce0b8e3.png" alt="_1" title="_1"></p> <h3>Case 1: Cluster nodes are deployed in a single availability zone and services are offline due to zone-level exceptions.</h3> <p>In the scenario where cluster nodes are deployed in a single zone, all nodes in the Kubernetes cluster are deployed in the same zone. A zone-level exception, such as network or hardware failure in a zone, may lead to service unavailability in the entire cluster.</p> <h3>Case 2: Cluster nodes are deployed in multiple zones, and service pods are not configured to be distributed by zone.</h3> <p>With the configuration of pod distribution rules, Kubernetes automatically distributes pods to multiple zones, ensuring that services in the cluster can still run when a zone fails. </p> <p>In the scenario where cluster nodes are deployed in multiple zones, if pods are not configured to be evenly distributed by zone, all or part of your business may be damaged and services may be offline due to the failure of a single zone.</p> <h3>Case 3: Health monitoring and alerting for cluster application availability and zone-level node availability is insufficient.</h3> <p>High-availability monitoring of applications at the Kubernetes level is crucial to your business. The monitoring functionality should trigger alerts or notify self-healing systems to perform repairs even when only partial damage occurs, significantly enhancing the rapid alerting capability by 1-5-10. The zone-level health node monitoring can detect the availability of underlying resources at the cluster level and trigger alerts.</p> <h3>Case 4: Application distribution, traffic control, and high-availability management in multiple clusters are complex.</h3> <p>Application distribution, security policies, traffic control, global monitoring, and job distribution of multiple clusters that are not managed by a centralized management platform will bring significant complexity. In terms of the product capabilities of ACK, you can use ACK One Fleet to uniformly manage multiple cluster instances, improving overall system availability. When problems occur in a single cluster, it can automatically switch to another cluster instance, ensuring stable system operation.</p> <h2>Single-cluster High-availability Architecture of ACK</h2> <p>After summarizing common misconfigurations and pain points in Kubernetes scenarios, let's take a look at the single-cluster high-availability architecture of ACK and how to deal with high-availability stability risks. </p> <p><img src="https://yqintl.alicdn.com/02efbe7a8bb2b2617cd64dd6de59254d70a17031.png" alt="_2" title="_2"></p> <p>Let's take a look at the single-cluster high-availability architecture of ACK. The left side of the picture shows the high-availability architecture diagram of the ACK cluster. In the upper part, you can view the resources in the ACK VPC, including the ACK meta cluster, which is the form of an ACK dedicated cluster, hosting the control plane components and managed components of an ACK managed cluster. The nodes of the ACK meta cluster and managed components that run as pods are distributed across multiple zones to implement disaster recovery with high availability. The lower part displays the resources in your VPC, including ECS, SLB, and ECI. </p> <p>An ACK-managed cluster consists of the control plane and the data plane. Control plane components run as pods in ACK meta clusters and are managed by using the KoK architecture. ACK manages the entire lifecycle of control plane components. Data plane resources are deployed in your VPC and ACK provides users with configurable high-availability capabilities and best practices.</p> <h3>The control plane implements zone-level and node-level high availability.</h3> <p>All control plane components implement high-availability distribution that aligns with the zone capabilities of Alibaba Cloud ECS. In regions with three zones, the SLA for the control plane of ACK Pro-managed clusters is 99.95%. In regions without three zones, the SLA for the control plane of ACK Pro-managed clusters is 99.5% (single-zone fault tolerance is not provided). </p> <p>Take APIServer as an example. Multiple replicas are deployed in high availability mode across zones and nodes. The failure of any zone does not affect service availability. In addition, it supports enhanced etcd partition governance capabilities, that is, APIServer automatically detects the health status of the backend etcd endpoints and automatically removes endpoints with No Leader exceptions. Even if a network partition exception occurs in etcd, ACK APIServer still serves normally. The overall control plane is based on the KoK architecture and automatically manages managed components in the form of pods, including automated forced cross-zone distribution, liveness health check, self-healing, adaptive replica scaling, upgrade management, and automatic migration of abnormal nodes.</p> <h3>The data plane allows customers to configure various high-availability policies and best practices.</h3> <p>In the data plane, ACK combines the native scheduling capabilities of Kubernetes, such as Topology Spread Constraints, and Alibaba Cloud service capabilities to support pod placement across different failure domains, including nodes, deployment sets, and availability zones, implementing various levels of high availability strategies. For application loads, you can use Kubernetes health check, self-healing, and PDB to improve the stability of application loads. Cloud resources such as load balancers, virtual machine nodes, and cloud disks support multi-zone high availability configurations in Kubernetes scenarios and the corresponding containerized configuration interface. The following section describes the best practices for data plane high availability.</p> <h2>Best Practices for Single-cluster High Availability - Node/ Zone High Availability</h2> <p><img src="https://yqintl.alicdn.com/e24fbeb494948fd1fd934ffed67db37211eb05f9.png" alt="_3" title="_3"></p> <p>The upper-left part is a diagram that illustrates the distribution scheduling of pods across nodes, deployment sets, and availability zones, and their disaster recovery capabilities. Pods should be distributed by node and zone as much as possible. If needed, they can be more strictly distributed across nodes within deployment sets.</p> <h3>Business pods distributed across nodes</h3> <p>Configure a node-based anti-affinity scheduling policy for pods to distribute pods by node, achieving high availability at the node level.</p> <h3>Business pods distributed across deployment set nodes</h3> <p>Configure a deployment set node-based anti-affinity scheduling policy for pods to distribute physical servers, achieving high availability at the physical server level. </p> <p>The deployment set is a policy used to control the distribution of ECS instances. It spreads ECS instances across different physical servers to prevent multiple ECS instances from breaking down when one physical server is down. You can specify a deployment set for a node pool to ensure that ECS instances added to the node pool are spread across different physical servers.</p> <h3>Business pods distributed across multiple zones</h3> <p>Configure a zone-based anti-affinity scheduling policy for pods to evenly distribute pods by zone, achieving high availability at the zone level.</p> <h2>Best Practices for Single-cluster High Availability - Workload High Availability</h2> <p><img src="https://yqintl.alicdn.com/3340667e675b76e8289bd9bfd55c8935014b53cb.png" alt="_4" title="_4"></p> <p>Based on the features of Kubernetes, you can refer to the following best practices to enhance the availability of application loads.</p> <h3>Configure pod topology spread constraints</h3> <p>Topology spread constraints allow <strong>pods to be evenly spread across nodes and zones</strong> to improve the availability and stability of applications. </p> <p>This feature is applicable to workloads such as Deployment, StatefuSet, DaemonSet, Job, and CronJob.</p> <h3>Configure pod anti-affinity</h3> <p>Pod anti-affinity is used to <strong>schedule pods to different nodes</strong> to improve the high availability and fault isolation of applications.</p> <h3>Configure pod disruption budgets</h3> <p>Pod disruption budgets allow you to define the minimum number of replicated pods that must be retained on a node. When a node is under maintenance or faulty, the cluster ensures that at least the specified number of pods are still running on the node. <strong>You can also configure pod disruption budgets to avoid the issue that an excessive number of replicated pods are concurrently terminated. Pod disruption budgets are suitable for scenarios where multiple replicated pods are deployed to process business traffic.</strong></p> <h3>Configure pod health check and self-healing</h3> <p>You can use the following probes to monitor and manage the status and availability of containers, including liveness probes, readiness probes, and startup probes.</p> <p>Liveness: Liveness probes are used to <strong>determine when to restart a container.</strong> </p> <p>Readiness: Readiness probes are used to determine <strong>whether a container is ready to receive traffic.</strong> </p> <p>Startup: Startup probes are used to <strong>determine when to start a container</strong>.</p> <h2>Best Practices for Single-cluster High Availability -</h2> <p>High Availability Configuration of Container Registry Enterprise Edition</p> <p><img src="https://yqintl.alicdn.com/b3ae50e23d45ea27f6f55c525489bb3467a5b828.png" alt="_5" title="_5"></p> <p>The high availability configuration of Container Registry Enterprise Edition includes two best practices: zone disaster recovery and cross-region disaster recovery.</p> <h3>Zone disaster recovery: Use Container Registry Enterprise Edition and zone-redundant storage (ZRS) OSS buckets</h3> <p>The production environment uses <strong>Container Registry Enterprise Edition</strong> instead of <strong>Container Registry Personal Edition because the former supports capabilities such as high availability and security scanning</strong>. </p> <p>For <strong>Container Registry Enterprise Edition</strong>, in regions where OSS supports zone-redundant storage, when you create an instance, <strong>OSS buckets that support zone-redundant storage are created by default to achieve high availability across zones</strong>. If OSS newly supports zone-redundant storage in a region, you can <strong>convert the bucket to zone-redundant storage in the OSS console</strong> to implement zone-redundant storage for the Container Registry.</p> <h3>Cross-region disaster recovery: Use the multi-region Container Registry Enterprise Edition to configure geo-disaster recovery</h3> <p>Container Registry Enterprise Edition is activated in at least two different regions. You can push container images to multiple Container Registry Enterprise Edition instances in different regions at the same time to implement geo-disaster recovery. </p> <p>The process is as follows:</p> <ol> <li>Configure the same <strong>custom endpoint</strong> for the instances in different regions and use the custom endpoint to pull container images in the cluster.</li> <li>Configure <strong>image synchronization rules</strong> for instances in different regions to ensure that core business images exist on instances in different regions.</li> <li>Configure <strong>access control lists (ACLs)</strong> for the instances.</li> <li>Modify the <strong>domain name resolution setting</strong> to implement geo-disaster recovery.</li> </ol> <h2>Best Practices for Single-cluster High Availability - Cloud Resource High Availability and Kubernetes Configuration Interface</h2> <p><img src="https://yqintl.alicdn.com/6096687880314896a7ddb9223384e337b2b42da8.png" alt="_6" title="_6"></p> <p>The high availability capabilities of Alibaba Cloud products provide a Kubernetes configuration interface so that users can flexibly configure high availability capabilities as needed. </p> <p>Take the load balancer high availability configuration as an example. Let's look at how the high-availability configuration of cloud products is revealed through the container interface. Load balancer cloud products, including CLB, NLB, and ALB, support cross-zone disaster recovery. You can use the <strong>Kubernetes Service Annotation</strong> to <strong>specify primary and secondary zones for a CLB instance and specify multiple zones for an NLB instance</strong>. You can <strong>use ALBConfig to specify multiple zones for an ALB instance</strong>. Note that zones should be the same as the zones of ECS nodes in a node pool. This reduces cross-zone data transfer and enhances network access performance. You can search for the containerized configuration methods of cloud products on the Alibaba Cloud official website.</p> <h2>Best Practices for Single-cluster High Availability - Monitoring and Alerting of Application Availability and Node Availability in a Zone</h2> <p><img src="https://yqintl.alicdn.com/7a24e444471a73e2cfcdfd7ccb34ee26c8829b83.png" alt="_7" title="_7"></p> <h3>Configure monitoring and alerting for application load replica unavailability</h3> <p>Based on metrics related to workload replicas in kube-state-metrics:</p> <p>kube_deployment_status_replicas_unavailable<br>kube_deployment_status_replicas<br>kube_daemonset_status_number_unavailable<br>kube_statefulset_status_replicas<br>kube_statefulset_status_replicas_available </p> <p>and so on</p> <p>the number of unavailable replicas and the total number of replicas of the Deployment, StatefulSet, or DaemonSet of application load are aggregated and analyzed. </p> <p>Based on these metrics, you can <strong>check whether an application has unavailable replicas and the percentage of unavailable replicas to the total number of replicas</strong>, to implement monitoring and alerting for <strong>services being partially or completely affected. ACK enables integration by default.</strong> </p> <p>The following example shows the content of a Prometheus alert:</p> <p><img src="https://yqintl.alicdn.com/6f3fdaa7d865fcba3530279c220c53b2b318edb0.png" alt="1" title="1"></p> <h3>Monitoring and alerting for the percentage of unhealthy nodes in a cluster zone</h3> <p>The kube-controller-manager components of Kubernetes count the number of unhealthy nodes, the percentage of healthy nodes, and the total number of nodes in a zone. You can configure related alerts. </p> <p>The following example shows the content of a Prometheus alert:</p> <p><img src="https://yqintl.alicdn.com/b8b635ca16d2ef49b83d58111f0bbefb0e5aaecd.png" alt="2" title="2"></p> <h2>Best Practices for Multi-cluster High Availability - Multi-cluster Management Through ACK One Fleet</h2> <p>With the wide adoption of Kubernetes clusters, enterprises may need to run and manage multiple Kubernetes clusters. This brings challenges such as how to manage multiple clusters, how to use a unified external ingress to access the clusters, and how to schedule resources for the clusters. The Fleet instances of Distributed Cloud Container Platform for Kubernetes (ACK One) are managed by Container Service for Kubernetes (ACK). You can use the Fleet instances to manage Kubernetes clusters that are deployed in different environments in a centralized manner. Fleet instances create a consistent experience in cloud-native application management for enterprises. </p> <p><img src="https://yqintl.alicdn.com/92225c680b14b208e7c301869ee4b61337146912.png" alt="_8" title="_8"></p> <p>This diagram introduces the functions of application distribution, traffic control, security policy, global monitoring, component management, and cluster management for multiple clusters through the unified control plane of ACK One Fleet, efficiently managing public cloud and IDC clusters. </p> <p>Cluster disaster recovery is achieved through multiple zones and multiple clusters. Application disaster recovery is achieved through ACK One GitOps multi-cluster application deployment. Traffic disaster recovery is achieved through ACK One multi-cluster gateway and global ingress in the same city.</p> <h2>Summary</h2> <p><img src="https://yqintl.alicdn.com/316e521eaac3a67ce35716e3d8cf5b278f731863.png" alt="_9" title="_9"></p> <p>High-availability architecture design and best practices in cloud-native scenarios are critical to the availability, stability, and security of enterprise services. They can effectively improve application availability and user experience, and provide fault isolation and fault tolerance capabilities. </p> <p>This article shares the high availability and stability architecture of ACK and its best practices. These practices are based on the high availability capabilities and experience of tens of thousands of ACK Pro clusters on the control plane and data plane across the network. They have been verified and trained in a large-scale online environment with rich scenarios. It is hoped that these experiences can provide reference and help for enterprises with related requirements. At present, ACK's high-availability stability architecture, product capabilities, and best practices have become the cornerstone of ACK cluster stability. ACK will continue to provide customers with cloud-native products and services that are continuously optimized and upgraded in security, stability, performance, and cost.</p> </div> <div class="wrap-main-left-bar"> <span><a href="https://community.alibabacloud.com/tags/type_blog-tagid_23980/">Container Service</a></span> <span><a href="https://community.alibabacloud.com/tags/type_blog-tagid_27753/">Container Service for Kubernetes</a></span> <span><a href="https://community.alibabacloud.com/tags/type_blog-tagid_28580/">cloud native</a></span> <span><a href="https://community.alibabacloud.com/tags/type_blog-tagid_29801/">Apsara Conference</a></span> <span><a href="https://community.alibabacloud.com/tags/type_blog-tagid_31261/">Kubernetes Service</a></span> <span><a href="https://community.alibabacloud.com/tags/type_blog-tagid_37913/">Apsara Conference 2024</a></span> <span><a href="https://community.alibabacloud.com/tags/type_blog-tagid_38294/">High Availability Stability</a></span> <span><a href="https://community.alibabacloud.com/tags/type_blog-tagid_38295/">ACK One Fleet</a></span> <span><a href="https://community.alibabacloud.com/tags/type_blog-tagid_38296/">ACK One GitOps</a></span> <span><a href="https://community.alibabacloud.com/tags/type_blog-tagid_38297/">High-availability architecture</a></span> </div> <div class="wrap-main-left-action"> <main> <a href="#comment"> <i class="icon icon-pinglun"></i> 0 </a> <span class="action-zan" data-islogin="false" data-id="601779" data-already="false" rel="nofollow"> <i class="icon icon-zan"></i> <b>1</b> </span> <span class="action-love" data-islogin="false" data-id="601779" data-already="false" rel="nofollow"> <i class="icon icon-love"></i> <b>0</b> </span> </main> <div> <b>Share on</b> <a href="javascript:;" class="sharer" data-sharer="linkedin" data-url="" title="Best Practices for High Availability Stability of Alibaba Cloud Container Service for Kubernetes"> <i class="icon icon-linkedin1"></i> </a> <a href="javascript:;" class="sharer" data-sharer="facebook" data-url="" title="Best Practices for High Availability Stability of Alibaba Cloud Container Service for Kubernetes"> <i class="icon icon-lianshu1"></i> </a> <a href="javascript:;" class="sharer" data-sharer="twitter" data-url="" title="Best Practices for High Availability Stability of Alibaba Cloud Container Service for Kubernetes"> <i class="icon icon-twitter1"></i> </a> </div> </div> <div class="wrap-main-left-read"> <main> <h2> Read previous post: </h2> <p> <a href="/blog/implementation-of-alibaba-cloud-distributed-cloud-container-platform-for-kubernetes-ack-one_601778"> Implementation of Alibaba Cloud Distributed Cloud Container Platform for Kubernetes (ACK One) </a> </p> </main> <main> <h2> Read next post: </h2> <p> <a href="/blog/use-alibaba-cloud-asm-llmproxy-plug-in-to-ensure-user-data-security-for-large-models_601805"> Use Alibaba Cloud ASM LLMProxy Plug-in to Ensure User Data Security for Large Models </a> </p> </main> </div> <div class="wrap-main-right-user wrap-main-right-user-mobile"> <dl> <dt> <a href="https://community.alibabacloud.com/users/5883248061556614"> <img src="https://yqintl.alicdn.com/img_1449da4b60932ee3e836593f830ed0ae.png?x-oss-process=image/resize,m_fixed,h_64,w_64" alt=""> </a> </dt> <dd> <h1> <a href="https://community.alibabacloud.com/users/5883248061556614"> Alibaba Container Service </a> </h1> <p> 181 posts | 32 followers </p> <a href="#" class="follow-btn" data-islogin="false" data-uid="5883248061556614" data-isfollowed="false" id="follow-btn" rel="nofollow">Follow</a> </dd> </dl> </div> <h3> You may also like </h3> <ul class="wrap-main-left-list"> <li> <span></span> <a href="/blog/cloud-native-best-practices-for-container-technology-implementation_596411"> Cloud-Native: Best Practices for Container Technology Implementation </a> <p> Alibaba Clouder - July 15, 2020 </p> </li> <li> <span></span> <a href="/blog/high-availability-and-performance-best-practices-for-deploying-dify-based-on-ack_601874"> High Availability and Performance: Best Practices for Deploying Dify based on ACK </a> <p> Alibaba Container Service - December 19, 2024 </p> </li> <li> <span></span> <a href="/blog/evolution-of-o%26m-system-in-the-cloud-native-era_598676"> Evolution of O&M System in the Cloud-Native Era </a> <p> Alibaba Cloud Community - March 8, 2022 </p> </li> <li> <span></span> <a href="/blog/oplg-best-observability-practices-of-new-generation-cloud-native_599179"> OPLG: Best Observability Practices of New Generation Cloud-Native </a> <p> Alibaba Cloud Native Community - July 26, 2022 </p> </li> <li> <span></span> <a href="/blog/optimization-on-alibaba-cloud-native-etcd-cluster-management-and-control_598018"> Optimization on Alibaba Cloud-Native Etcd, Cluster Management, and Control </a> <p> Alibaba Developer - August 19, 2021 </p> </li> <li> <span></span> <a href="/blog/interpreting-data-acceleration-in-the-era-of-large-models-balancing-performance-stability-and-consistency_601711"> Interpreting Data Acceleration in the Era of Large Models: Balancing Performance, Stability, and Consistency </a> <p> Alibaba Container Service - October 30, 2024 </p> </li> </ul> <h3 id="comment"> Comments </h3> <div class="wrap-main-left-comments"> <span class="hidden" id="pageCount" data-pageCount="0"></span> </div> <div class="page parent-page"></div> <div class="write-comments"> <textarea name="" id="" cols="30" rows="10" placeholder="Write your comment..."></textarea> <div class="write-comments-btn"> <button class="btn btn-primary add-parent-comment">Post</button> </div> </div> </div> <div class="wrap-main-iconBox"> <a href="javascript:;" class="bg sharer" data-sharer="linkedin" data-url="https://www.alibabacloud.com/blog/best-practices-for-high-availability-stability-of-alibaba-cloud-container-service-for-kubernetes_601779" title="Best Practices for High Availability Stability of Alibaba Cloud Container Service for Kubernetes"> <i class="icon icon-linkedin1"></i> </a> <a href="javascript:;" class="sharer" data-sharer="facebook" data-url="https://www.alibabacloud.com/blog/best-practices-for-high-availability-stability-of-alibaba-cloud-container-service-for-kubernetes_601779" title="Best Practices for High Availability Stability of Alibaba Cloud Container Service for Kubernetes"> <i class="icon icon-lianshu1"></i> </a> <a href="javascript:;" class="sharer" data-sharer="twitter" data-url="https://www.alibabacloud.com/blog/best-practices-for-high-availability-stability-of-alibaba-cloud-container-service-for-kubernetes_601779" title="Best Practices for High Availability Stability of Alibaba Cloud Container Service for Kubernetes"> <i class="icon icon-twitter1"></i> </a> </div> </div> <div class="wrap-main-right col-md-4"> <div class="wrap-main-right-user wrap-main-right-user-pc"> <dl> <dt> <a href="https://community.alibabacloud.com/users/5883248061556614"> <img src="https://yqintl.alicdn.com/img_1449da4b60932ee3e836593f830ed0ae.png?x-oss-process=image/resize,m_fixed,h_64,w_64" alt=""> </a> </dt> <dd> <h1> <a href="https://community.alibabacloud.com/users/5883248061556614"> Alibaba Container Service </a> </h1> <p> 181 posts | <span class="followers-num">32</span> followers </p> <a href="#" class="follow-btn" data-islogin="false" data-uid="5883248061556614" data-isfollowed="false" id="follow-btn" rel="nofollow">Follow</a> </dd> </dl> </div> <div class="wrap-main-right-box"> <h1> Related Products </h1> <ul> <li> <h2> <a href="https://community.alibabacloud.com/go/1/214"> <img src="https://yqintl.alicdn.com/img_8bd74827341a73ecfaf3ba88d5274f38.png" alt=""> Container Service for Kubernetes </a> </h2> <p> Alibaba Cloud Container Service for Kubernetes is a fully managed cloud container management service that supports native Kubernetes and integrates with other Alibaba Cloud products. </p> <a href="https://community.alibabacloud.com/go/1/214" class="btn btn-default"> Learn More </a> </li> <li> <h2> <a href="https://community.alibabacloud.com/go/1/441"> <img src="https://yqintl.alicdn.com/img_8bd74827341a73ecfaf3ba88d5274f38.png" alt=""> ACK One </a> </h2> <p> Provides a control plane to allow users to manage Kubernetes clusters that run based on different infrastructure resources </p> <a href="https://community.alibabacloud.com/go/1/441" class="btn btn-default"> Learn More </a> </li> <li> <h2> <a href="https://community.alibabacloud.com/go/1/322"> <img src="https://yqintl.alicdn.com/img_e4c192f553354bf37ab226b1f9259a62.png" alt=""> Cloud-Native Applications Management Solution </a> </h2> <p> Accelerate and secure the development, deployment, and management of containerized applications cost-effectively. </p> <a href="https://community.alibabacloud.com/go/1/322" class="btn btn-default"> Learn More </a> </li> <li> <h2> <a href="https://community.alibabacloud.com/go/1/220"> <img src="https://yqintl.alicdn.com/img_360bce4ebc844b7613136d246047337a.png" alt=""> Container Registry </a> </h2> <p> A secure image hosting platform providing containerized image lifecycle management </p> <a href="https://community.alibabacloud.com/go/1/220" class="btn btn-default"> Learn More </a> </li> </ul> </div> <div class="wrap-main-right-list"> <div> <p> <b> More Posts </b> <span> by Alibaba Container Service </span> </p> <main> <span> <a href="https://community.alibabacloud.com/users/5883248061556614/article">See All</a> </span> <i class="icon icon-more"></i> </main> </div> <ul> <li> <a href="/blog/best-practices-for-kubernetes-migration-flexible-management-of-resource-backup-for-application-recovery_601996">Best Practices for Kubernetes Migration: Flexible Management of Resource Backup for Application Recovery</a> </li> <li> <a href="/blog/backup-center-helps-enterprises-migrate-kubernetes-container-service-platforms-across-clouds_601979">Backup Center Helps Enterprises Migrate Kubernetes Container Service Platforms Across Clouds</a> </li> <li> <a href="/blog/end-to-end-canary-release-based-on-asm-for-bidirectional-communication-applications-built-with-websocket_601933">End-to-end Canary Release Based on ASM for Bidirectional Communication Applications Built with WebSocket</a> </li> <li> <a href="/blog/use-kmesh-as-the-data-plane-for-alibaba-cloud-service-mesh-asm-in-sidecarless-mode_601926">Use Kmesh as the Data Plane for Alibaba Cloud Service Mesh (ASM) in Sidecarless Mode</a> </li> <li> <a href="/blog/application-distribution-capability-by-ack-one-efficient-multi-cluster-application-management_601899">Application Distribution Capability by ACK One: Efficient Multi-cluster Application Management</a> </li> <li> <a href="/blog/alibaba-cloud-ack-one-auto-scaling-of-cloud-node-pools-cpugpu-in-registered-clusters_601898">Alibaba Cloud ACK One: Auto Scaling of Cloud Node Pools (CPU/GPU) in Registered Clusters</a> </li> <li> <a href="/blog/ack-one-gitops-simplified-multi-cluster-gitops-application-management-with-applicationset-ui_601875">ACK One GitOps: Simplified Multi-cluster GitOps Application Management with ApplicationSet UI</a> </li> <li> <a href="/blog/high-availability-and-performance-best-practices-for-deploying-dify-based-on-ack_601874">High Availability and Performance: Best Practices for Deploying Dify based on ACK</a> </li> <li> <a href="/blog/argo-workflows-3-6-key-new-features-in-cloud-native-orchestration_601872">Argo Workflows 3.6: Key New Features in Cloud-native Orchestration</a> </li> <li> <a href="/blog/cloud-elasticity-provided-by-ack-one-registered-clusters-a-new-tool-for-business-expansion_601871">Cloud Elasticity Provided by ACK One Registered Clusters: A New Tool for Business Expansion</a> </li> </ul> </div> </div> </div> </div> <script type="text/javascript" nonce="5KXG9A72UV"> window.localconfigs = { 'aid': 601779 }; </script> <script type="text/javascript" nonce="5KXG9A72UV"> window.configs = { "csrf-param": "yunqi_csrf", "csrf-token": "BAHS4I3YUD", "islogin": false, "registerurl": "https://account.alibabacloud.com/register/register.htm?from_type=yqclub&oauth_callback=https%3A%2F%2Fwww.alibabacloud.com%2Fblog%2F601779%3Fdo%3Dlogin", "loginurl": "https://account.alibabacloud.com/login/login.htm?from_type=yqclub&oauth_callback=https%3A%2F%2Fwww.alibabacloud.com%2Fblog%2F601779%3Fdo%3Dlogin", "isNeedNickname": false, "baseurl": "/blog" }; </script> <script src="//g.alicdn.com/aliyun-international/blog-assert/0.0.23/js/detail.js"></script> <script src="//g.alicdn.com/aliyun-international/blog-assert/0.0.23/js/nav.js"></script> <script type="text/javascript" nonce="5KXG9A72UV"> (function (i, s, o, g, r, a, m) { i['GoogleAnalyticsObject'] = r; i[r] = i[r] || function () { (i[r].q = i[r].q || []).push(arguments) }, i[r].l = 1 * new Date(); a = s.createElement(o), m = s.getElementsByTagName(o)[0]; a.async = 1; a.src = g; m.parentNode.insertBefore(a, m) })(window, document, 'script', 'https://www.google-analytics.com/analytics.js', 'ga'); ga('create', 'UA-86123020-1', 'auto'); ga('send', 'pageview'); </script> </body> </html>