/
Flink-jobmanager stuck in CLBO

Flink-jobmanager stuck in CLBO

Sometimes you may see flink-jobmanager crashing and not coming up, which means one of the checkpoint snapshots is corrupted. To fix it you can use one of the following methods:

In Rancher

  1. Scale down flink-jobmanager deployment to 0.

  2. Scale down flink-taskmanager deployment to 0.

  3. Go to storage/configmaps

  4. Delete

    1. gv-flink-cluster-config-map

    2. gv-flink-*-config-map

  5. Scale up flink-taskmanager deployment to 1.

  6. Wait 15-20 sec

  7. Scale up flink-jobmanager deployment to 1.

In terminal

kubectl scale --replicas=0 deployment/flink-jobmanager kubectl scale --replicas=0 deployment/flink-taskmanager kubectl get configmap -n default | grep "gv-flink" kubectl delete configmap gv-flink-cluster-config-map -n default kubectl delete configmap gv-flink-*-config-map -n default <--- insert config map names from previous command kubectl scale --replicas=1 deployment/flink-taskmanager sleep 20 kubectl scale --replicas=1 deployment/flink-jobmanager

Related content

Focus Deployment Steps
Focus Deployment Steps
Read with this
Redeploy Cluster-cattle-agent pod
Redeploy Cluster-cattle-agent pod
More like this
Synergy Server Installation Guide
Synergy Server Installation Guide
Read with this
Rancher-webhook stuck in pending-upgrade
Rancher-webhook stuck in pending-upgrade
More like this
Focus Admin Guide
Focus Admin Guide
Read with this
Configuring Rancher and Fleet agent to run behind a HTTP proxy if cluster was previously registered
Configuring Rancher and Fleet agent to run behind a HTTP proxy if cluster was previously registered
Read with this

Classified as Getvisibility - Partner/Customer Confidential