If for some of the applications reconciliation fails with the error etcdserver : leader changed.

  • If for some of the VMware Telco Cloud Service Assurance application reconciliation fails with the error (etcdserver : leader changed), then follow the below steps to recover from the failure.

    For example: strmzi-kafka and kafka-edge

    kubectl get pods -n tps-system | egrep "strimzi-cluster-operator" | cut -f1 -d' ' | xargs kubectl delete pod -n tps-system
  • If for some of the VMware Telco Cloud Service Assurance application reconciliation fails with the error (Failed with reason BackoffLimitExceeded), then follow the below steps to recover from the failure.

    For example: airflow and grafana

    • For airflow service, deleting the following below jobs using the command:
      kubectl delete job -n tcsa-system airflow-run-airflow-migrations
      kubectl delete job -n tcsa-system airflow-create-user
    • For grafana service, deleting the following below jobs using the command:
      kubectl delete job -n tcsa-system tcx-grafana-deployer-job