Merge branch 'master' of ssh://gitea.zwackl.de:2222/dominik/k3s

pod tolerations
2021-11-27 16:03:39 +01:00 · 2021-11-27 16:03:32 +01:00
1 changed files with 56 additions and 4 deletions
--- a/README.md
+++ b/README.md
@ -726,7 +726,7 @@ spec:
        - name: DEMO_GREETING                                    
          value: Hello from the environment                      
        image: dockreg-zdf.int.zwackl.de/alpine/latest/amd64:prod
-        imagePullPolicy: Always                                  
+        imagePullPolicy: IfNotPresent                                  
        name: netcat-daemonset                                            
        ports:                                                   
        - containerPort: 23456 
@ -858,12 +858,64 @@ web-2                                        1/1     Running       0          26
 ds-test-c6xx8                                1/1     Running       0          18m
 ds-test-w45dv                                1/1     Running       5          28h
 ```
-Kubernetes knows something like a `--pod-eviction-timeout`, which is a grace period (**default: 5 minutes**) for deleting pods on failed nodes. This timeout is useful to keep pods on nodes, which are rebooted in term of maintenance reasons. So, first of all, nothing happens to the pods on failed nodes until *pod eviction timeout* exceeded. If the *pod eviction period* times out, Kubernetes re-schedules *workloads* (Deployments, StatefulSets) to working nodes. As *DaemonSets* are bound to a specific node they will not be re-scheduled on other nodes.
+Kubernetes supports [Pod-tolerations](https://kubernetes.io/docs/concepts/scheduling-eviction/taint-and-toleration), which are per default configured with a *timeout* of `300s` (5 minutes!). This means, that affected Pods will *remain* for a timespan of 300s on a *broken* node before eviction takes place
+```
+$ kubectl -n <namespace> describe pod <pod-name>

-Docs: https://kubernetes.io/docs/concepts/scheduling-eviction/eviction-policy/
+[...]
+Tolerations:                 node.kubernetes.io/not-ready:NoExecute op=Exists for 300s
+                             node.kubernetes.io/unreachable:NoExecute op=Exists for 300s
+[...]
+```
+
+To be more reactive Pod-tolerations can be configured as follows:
+```
+kind: Deployment or StatefulSet
+apiVersion: apps/v1
+metadata:
+[...]
+spec:
+[...]
+  template:
+  [...]
+    spec:
+      tolerations:
+      - key: "node.kubernetes.io/unreachable"
+        operator: "Exists"
+        effect: "NoExecute"
+        tolerationSeconds: 30
+      - key: "node.kubernetes.io/not-ready"
+        operator: "Exists"
+        effect: "NoExecute"
+        tolerationSeconds: 30
+      [...]
+```

 ## Keep your cluster balanced <a name="user-content-keep-cluster-balanced"></a>
-Kubernetes, in first place, takes care of high availability, but not of well balance of pod/node. [This](https://itnext.io/keep-you-kubernetes-cluster-balanced-the-secret-to-high-availability-17edf60d9cb7) project could be a solution! Pod/Node balance is not a subject to *DaemonSets*.
+Kubernetes, in first place, takes care of high availability, but not of well balance of pod/node.
+
+In case of `Deployment` or `StatefulSet` a `topologySpreadConstraint` needs to be specified:
+```
+kind: Deployment or StatefulSet
+apiVersion: apps/v1
+metadata:
+[...]
+spec:
+[...]
+  template:
+  [...]
+    spec:
+      # Prevent to schedule more than one pod per node
+      topologySpreadConstraints:
+      - labelSelector:
+          matchLabels:
+            app: {{ .Chart.Name }}-{{ .Values.stage }}
+        maxSkew: 1
+        topologyKey: kubernetes.io/hostname
+        whenUnsatisfiable: DoNotSchedule
+      [...]
+```
+`DaemonSet` workloads do not support `topologySpreadConstraints` at all.

 ## Node maintenance <a name="user-content-node-maintenance"></a>
 *Mark* a node for maintenance:
Author	SHA1	Message	Date
Dominik Chilla	4bc0d45de5	Merge branch 'master' of ssh://gitea.zwackl.de:2222/dominik/k3s	2021-11-27 16:03:39 +01:00
Dominik Chilla	5d851a6373	pod tolerations	2021-11-27 16:03:32 +01:00