Troubleshooting Connectware on Kubernetes

Troubleshooting guide for diagnosing and resolving common issues with Connectware running on Kubernetes.

This guide helps you diagnose and resolve common issues with Connectware running on Kubernetes. Follow the sections in order for systematic troubleshooting.

How to Troubleshoot

When troubleshooting Connectware issues, proceed in the following order:

Check pod status to identify obvious failures.
Inspect pod events for Kubernetes level errors.
Review logs to identify application level problems.
Collect debug information before making changes.
Restart or remove unhealthy pods if appropriate.

If you cannot identify or resolve the issue, contact the Cybus support team at [email protected].

Prerequisites

Before troubleshooting, ensure you have:

Helm version 3 is installed on your system.
kubectl is installed on your system.
You know the name and namespace of your Connectware installation. See Obtaining the Name, Namespace, and Version of Your Connectware Installation.
You have permissions to view pods, logs, and events.

Checking Pod Status

Connectware requires all pods to be in Running status with all containers ready. Check this by running:

kubectl get pods

Expected output: All pods show matching values in the Ready column, for example 1/1 or 2/2, and a Status of Running.

Name

Ready

Status

Restarts

Age

admin-web-app-8649f98fc6-sktb7

1/1

Running

3m1s

auth-server-5f46964984-5rwvc

1/1

Running

2m39s

broker-0

1/1

Running

2m11s

broker-1

1/1

Running

2m50s

connectware-5b948ffdff-tj2x9

1/1

Running

2m41s

container-manager-5f5678657c-94486

1/1

Running

2m46s

ingress-controller-85fffdcb4b-m8kpm

1/1

Running

2m37s

nats-0

1/1

Running

2m31s

nats-1

1/1

Running

2m30s

nats-2

1/1

Running

2m30s

postgresql-0

1/1

Running

2m58s

protocol-mapper-69f59f7dd4-6xhkf

1/1

Running

2m42s

resource-status-tracking-fcd58dc79-cl5nw

1/1

Running

2m12s

resource-status-tracking-fcd58dc79-vlzqs

1/1

Running

2m22s

service-manager-6b5fffd66d-gt584

1/1

Running

2m52s

system-control-server-bd486f5bd-2mkxz

1/1

Running

2m45s

welder-robots-0

1/1

Running

2m59s

workbench-57d4b59fbb-gqwnb

1/1

Running

2m38s

Identifying Unhealthy Pods

A pod should be considered unhealthy if it:

Shows an error state such as CrashLoopBackOff or Init.
Remains in a transitional state for an extended time.
Shows mismatched Ready values (e.g., 0/1 instead of 1/1).

Example of a pod that is unable to start

Name

Ready

Status

Restarts

Age

auth-server-b4b69ccfd-fvsmz

0/1

Init:0/1

Inspecting Pod Events

To identify the cause of a pod issue:

Describe the pod:

kubectl describe pod <podname>

Review the Events section at the bottom of the output.

Warning  FailedMount  7m4s kubelet Unable to attach or mount volumes: unmounted volumes=[testfail], unattached volumes=[certs testfail kube-api-access-52xmc]: timed out waiting for the condition

This indicates a cluster level issue where required volumes are unavailable. Such issues must be resolved at the Kubernetes or infrastructure level and are outside the scope of Connectware documentation.

If no clear cause is visible, continue with log inspection.

As general guidance:

Issues immediately after upgrades or configuration changes are often caused by incorrect Helm values.
Issues appearing later are often related to cluster infrastructure.

Checking Logs Using Kubetail

For viewing logs from multiple pods simultaneously, we recommend using kubetail. kubetail is a wrapper around kubectl that aggregates logs from multiple pods. By default, kubetail will always follow the logs like kubectl logs -f would.

Installation instructions are available at https://github.com/johanhaleby/kubetail.

Here are a few examples of how you can use kubetail. Also make sure to check kubetail --help.

Displaying Logs from All Pods in a Namespace

kubetail -n ${NAMESPACE}

Displaying Logs of Pods That Match a Search Term

kubetail broker

Displaying Logs for Pods That Match a Regular Expression

kubetail '(service-manager|protocol-mapper)' -e regex

Displaying Logs from the Past

You can combine the parameter -s <timeframe> with any other command to display logs from the past up to now:

kubetail broker -s 10m

Displaying Logs of a Terminated Container of a Pod

kubetail broker --previous

Displaying Timestamps

If the logs you are viewing a missing timestamps, you can use the parameter --timestamps for kubetail to add timestamps to each log line:

kubetail broker --timestamps

Checking Logs Using Kubectl

If you do not want to use kubetail as suggested in the previous chapter, you can use kubectl to read logs.

Here are a few examples of how you can use it:

Displaying and Tailing Logs of a Pod

kubectl logs -f <podname>

Displaying and Tailing Logs for All Pods with a Label

kubectl logs -f -l app=broker

Displaying Logs of a Terminated Container of a Pod

kubectl logs --previous <podname>

Displaying Logs from the Past

You can combine the parameter --since <timeframe> with any other command to display logs from the past up to now:

kubectl logs -f --since 10m <podname>

Displaying Timestamps

If the logs that you are viewing are missing timestamps, you can use the parameter --timestamps for kubectl to add timestamps to each log line:

kubectl logs -f --timestamps <podname>

Removing Unhealthy Pods

When a pod is identified as unhealthy, either through pod status checks or log inspection, first collect the current system state using the debugging script (collect_debug.sh) from the Connectware Kubernetes Toolkit. This ensures that diagnostic information is preserved before any changes are made. For more information, see Collecting Debug Information.

After collecting debug data, delete the affected pod:

kubectl delete pod <podname>

The controller managing the pod will automatically create a new instance. Restarting pods in this way often resolves transient issues and does not delete persisted data.

Special Considerations for StatefulSet Pods

Pods whose names end with a fixed number, such as broker-0, belong to a StatefulSet. Kubernetes handles StatefulSets differently from other workloads. An unhealthy StatefulSet pod is not automatically replaced after configuration changes.

If a StatefulSet pod is unhealthy due to a configuration issue, you must:

Correct the configuration.
Manually delete the affected pod so it can be recreated with the updated settings.

This behavior is intentional, as StatefulSets often manage persistent or stateful data.

In Connectware, StatefulSets include the broker, nats, postgresql, and any protocol mapper agents that you have defined.

Collecting Debug Information

The Connectware Kubernetes Toolkit provides a debugging script (collect_debug.sh) to gather logs and state information. Always run this script before attempting fixes.

Prerequisites

Installed the following tools: kubectl, tar, sed, rm, sort
Access to the target installation using kubectl.

Downloading the Debugging Script

You can download the debugging script from https://download.cybus.io/.

Example

wget https://download.cybus.io/connectware-k8s-toolkit/latest/collect_debug.sh
chmod u+x ./collect_debug.sh

Running the Debugging Script

The debugging script takes parameters to target the correct Kubernetes namespace holding a Connectware installation:

Parameter

Value

Description

-n

namespace

The Kubernetes namespace to use

-k

path to kubeconfig file

A kubeconfig file to use other than the default (~/.kube/config)

-c

name of kubeconfig context

The name of a kubeconfig context different than the currently selected

If your kubectl command is already configured to point at the correct cluster, you can use the debugging script by specifying the namespace:

./collect_debug.sh -n ${NAMESPACE}

The script creates a compressed archive in the current directory. Provide this archive to support when reporting issues.

If you use a central log aggregator, also include relevant logs for the affected timeframe.

Troubleshooting Protocol-Mapper Agents

This section covers issues with protocol-mapper agents that can be the result of minor configuration mistakes.

Agent does not connect when the Connectware broker uses mTLS

Symptoms

Agent log shows:

VRPC agent connection to broker lost
Reconnecting to mqtts://localhost:8883

Likely cause

mTLS is not enabled in the agent configuration.

Resolution

Enable mTLS for the agent as described in Using Mutual Transport Layer Security (mTLS) for agents with the connectware-agent Helm chart.

TLS connection fails before handshake

Symptoms

Agent log shows:

Client network socket disconnected before secure TLS connection was established

Likely cause

The agent is connecting to the wrong MQTTS port on the broker.

Resolution

Verify mqttPort and mqttDataPort in the protocolMapperAgents section of your Helm values.yaml.
If you are not using a custom setup, these values are correct by default and can be removed.

Agent with mTLS enabled does not connect to broker

Symptoms

Agent log shows:

Failed to read certificates during mTLS setup please check the configuration

Likely cause

Certificates are missing or invalid.

Resolution

Verify certificate generation and configuration as described in Using Mutual Transport Layer Security (mTLS) for agents with the connectware-agent Helm chart.
Ensure Kubernetes objects were created from files named ca-chain.pem, tls.crt, and tls.key. Incorrect filenames will cause the agent to fail to locate certificates.

Agent registration fails due to certificate Common Name mismatch

Symptoms

Allowing an mTLS enabled agent in Connectware Client Registry fails with the message An Error has occurred - Registration failed.

auth-server logs show:

Unable to process request: 'POST /api/client-registry/confirm', because: Certificate Common Name does not match the username. CN: someCN, username: agentName

Likely cause

The certificate Common Name does not match the agent name.

Resolution

Ensure the certificate Common Name exactly matches the agent name configured in the Helm value name.

Agent registration fails with connection error

Symptoms

Agent log shows:

Can not register protocol-mapper agent, because: socket hang up

Likely cause

The agent certificate was signed by the wrong Certificate Authority.

Resolution

Verify the agent certificate was signed by the Certificate Authority that is used by Connectware.

Agent registration fails with conflict error

Symptoms

Agent log shows:

Failed to register agent. Response: 409 Conflict. A conflicting registration might be pending, or a user with the same username <agent-name> is already existing (which you must delete first).

Likely cause

An agent or user with the same name already exists.

Resolution

Every agent needs a user whose username matches the value configured in the name Helm value for the agent.

Ensure the agent name is unique.
If there is another agent with the same name, do the following:

Delete the agent.
Delete the corresponding user. For more information, see Deleting Users.

If you created a user with the agent’s name for something else, you have to choose a different name for the agent.

Agent enters CrashLoopBackOff due to license errors

Symptoms

Agent pod enters CrashLoopBackOff.
Logs show authentication or license errors followed by agent shutdown.

Example

{"level":30,"time":1670940068658,"pid":8,"hostname":"welder-robots-0","service":"protocol-mapper","msg":"Re-starting using cached credentials"}
{"level":50,"time":1670940068759,"pid":8,"hostname":"welder-robots-0","service":"protocol-mapper","msg":"Failed to query license at https://connectware/api/system/info probably due to authentication": 401 Unauthorized."}
{"level":50,"time":1670940068759,"pid":8,"hostname":"welder-robots-0","service":"protocol-mapper","msg":"No valid license file available. Protocol-mapper will stop."}

Likely cause

Cached agent credentials are no longer valid.

Resolution

The agent needs to be re-registered.

Delete the agent.
Delete the corresponding user. For more information, see Deleting Users.
Delete the agent StatefulSet:

kubectl -n ${NAMESPACE} delete sts <agent-name>

Delete the agent PersistentVolumeClaim:

kubectl -n ${NAMESPACE} delete pvc protocol-mapper-<agent-name>-0

Apply the configuration changes via the helm upgrade command:

helm upgrade -n ${NAMESPACE} ${INSTALLATION_NAME} -f values.yaml

For more information, see Applying Helm Configuration Changes.

PreviousSpreading Connectware Workloads Across Kubernetes Nodes NextEnvironment Variables

Last updated 5 days ago

Was this helpful?

hashtagHow to Troubleshoot

hashtagPrerequisites

hashtagChecking Pod Status

hashtagIdentifying Unhealthy Pods

hashtagInspecting Pod Events

hashtagChecking Logs Using Kubetail

hashtagDisplaying Logs from All Pods in a Namespace

hashtagDisplaying Logs of Pods That Match a Search Term

hashtagDisplaying Logs for Pods That Match a Regular Expression

hashtagDisplaying Logs from the Past

hashtagDisplaying Logs of a Terminated Container of a Pod

hashtagDisplaying Timestamps

hashtagChecking Logs Using Kubectl

hashtagDisplaying and Tailing Logs of a Pod

hashtagDisplaying and Tailing Logs for All Pods with a Label

hashtagDisplaying Logs of a Terminated Container of a Pod

hashtagDisplaying Logs from the Past

hashtagDisplaying Timestamps

hashtagRemoving Unhealthy Pods

hashtagSpecial Considerations for StatefulSet Pods

hashtagCollecting Debug Information

hashtagPrerequisites

hashtagDownloading the Debugging Script

hashtagRunning the Debugging Script

hashtagTroubleshooting Protocol-Mapper Agents

hashtagAgent does not connect when the Connectware broker uses mTLS

hashtagTLS connection fails before handshake

hashtagAgent with mTLS enabled does not connect to broker

hashtagAgent registration fails due to certificate Common Name mismatch

hashtagAgent registration fails with connection error

hashtagAgent registration fails with conflict error

hashtagAgent enters CrashLoopBackOff due to license errors

How to Troubleshoot

Prerequisites

Checking Pod Status

Identifying Unhealthy Pods

Inspecting Pod Events

Checking Logs Using Kubetail

Displaying Logs from All Pods in a Namespace

Displaying Logs of Pods That Match a Search Term

Displaying Logs for Pods That Match a Regular Expression

Displaying Logs from the Past

Displaying Logs of a Terminated Container of a Pod

Displaying Timestamps

Checking Logs Using Kubectl

Displaying and Tailing Logs of a Pod

Displaying and Tailing Logs for All Pods with a Label

Displaying Logs of a Terminated Container of a Pod

Displaying Logs from the Past

Displaying Timestamps

Removing Unhealthy Pods

Special Considerations for StatefulSet Pods

Collecting Debug Information

Prerequisites

Downloading the Debugging Script

Running the Debugging Script

Troubleshooting Protocol-Mapper Agents

Agent does not connect when the Connectware broker uses mTLS

TLS connection fails before handshake

Agent with mTLS enabled does not connect to broker

Agent registration fails due to certificate Common Name mismatch

Agent registration fails with connection error

Agent registration fails with conflict error

Agent enters CrashLoopBackOff due to license errors