My previous blog post discussed the VCD Extension for Tanzu Mission Control and covered the end-to-end deployment steps. In this post, I will cover how to troubleshoot a stuck TMC self-managed deployment in VCD.
I was deploying TMC self-managed in a new environment, and during configuration, I made a mistake by passing an incorrect value for the DNS zone, leading to a stuck deployment that did not terminate automatically. I waited for a couple of hours for the task to fail, but the task kept on running, thus preventing me from installing it with the correct configuration.
The deployment was stalled in the Creating phase and did not fail.
On checking the pods in the tmc-local namespace, a lot of them were stuck in either ‘CreateContainerConfigError” or “CrashLoopBackOff” states.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 |
root@jumpbox:~# kubectl get po -n tmc-local | grep CreateContainerConfigError audit-service-consumer-59b6954688-5fhkd 0/1 CreateContainerConfigError 0 6m27s audit-service-consumer-59b6954688-mqkrt 0/1 CreateContainerConfigError 0 6m27s audit-service-server-778d89bf7-lw2ph 0/1 CreateContainerConfigError 0 6m27s audit-service-server-778d89bf7-q5g6z 0/1 CreateContainerConfigError 0 6m27s dataprotection-server-65848fb688-62b46 0/1 CreateContainerConfigError 0 6m24s dataprotection-server-65848fb688-n8cgz 0/1 CreateContainerConfigError 0 6m24s inspection-server-679cccbc57-kfs5z 0/2 CreateContainerConfigError 0 6m23s inspection-server-679cccbc57-pldsb 0/2 CreateContainerConfigError 0 6m23s root@jumpbox:~# kubectl get po -n tmc-local | grep CrashLoopBackOff agent-gateway-server-5c6b5dd5d4-8dvdb 0/1 CrashLoopBackOff 6 (2m18s ago) 8m45s agent-gateway-server-5c6b5dd5d4-qw7bz 0/1 CrashLoopBackOff 6 (2m20s ago) 8m45s api-gateway-server-6c54fd7f86-v4mld 0/1 CrashLoopBackOff 6 (2m19s ago) 8m44s api-gateway-server-6c54fd7f86-xjd55 0/1 CrashLoopBackOff 6 (2m32s ago) 8m44s policy-insights-server-5d5458b76d-t25vx 0/1 CrashLoopBackOff 6 (102s ago) 8m39s policy-insights-server-5d5458b76d-xcgxc 0/1 CrashLoopBackOff 6 (113s ago) 8m39s |
In VCD, when I checked the failed task ‘Execute global ‘post-create’ action,” I found the installer was complaining that the tmc package installation reconciliation failed.… Read More