Repairing Failed Pods in Kubernetes

1 hour
  • 4 Learning Objectives

About this Hands-on Lab

As a Kubernetes Administrator, you will come across broken pods. Being able to identify the issue and quickly fix the pods is essential to maintaining uptime for your applications running in Kubernetes. In this hands-on lab, you will be presented with a number of broken pods. You must identify the problem and take the quickest route to resolve the problem in order to get your cluster back up and running.

Learning Objectives

Successfully complete this lab by achieving the following learning objectives:

Identify the broken pods.

Use the following command to see what’s in the cluster:

kubectl get all --all-namespaces
Find out why the pods are broken.

Use the following command to inspect the pod and view the events:

kubectl describe pod <pod_name> -n web
Repair the broken pods.
  1. Use the following command to repair the broken pods in the most efficient manner:

    kubectl edit deploy nginx -n web
  2. Where it says image: nginx:191, change it to image: nginx. Save and exit.

  3. Verify the repair is complete:

    kubectl get po -n web
  4. See the new replica set:

    kubectl get rs -n web
Ensure pod health by accessing the pod directly.
  1. List the pods including the IP addresses:

    kubectl get po -n web -o wide
  2. Start a busybox pod:

    kubectl run busybox --image=busybox --rm -it --restart=Never -- sh
  3. Use the following command to access the pod directly via its container port, replacing POD_IP_ADDRESS with an appropriate pod IP:

    wget -qO- POD_IP_ADDRESS:80

Additional Resources

You have been given access to a three-node cluster. Within that cluster, there are a number of failing pods. You must discover why they aren’t running and repair them as quickly as possible to get them back up and running. Once you’ve applied the fix, verify the pods in the cluster are running and can operate sufficiently. Perform the following tasks in order to complete this hands-on lab:

  • Identify the broken pods in your cluster.
  • Find out why they are broken.
  • Repair them as quickly as you can.
  • After applying the fix, verify the pods are running.
  • Access the pod directly, install 'wget' utility
  • Access the pod directly to ensure the health of the pod.

What are Hands-on Labs

Hands-on Labs are real environments created by industry experts to help you learn. These environments help you gain knowledge and experience, practice without compromising your system, test without risk, destroy without fear, and let you learn from your mistakes. Hands-on Labs: practice your skills before delivering in the real world.

Sign In
Welcome Back!

Psst…this one if you’ve been moved to ACG!

Get Started
Who’s going to be learning?