Follow these instructions to generate a support bundle with data collected for the last 48 hours of the life of the cluster.
Prerequisites
Before generating a support bundle, verify that you have:
- An AMD64-based Linux or macOS machine with a supported version of the operating system.
- A running Kubernetes cluster.
dkp-diagnosecommand for macOS or Linux for collecting the support bundle.
Download dkp-diagnose
-
To download and extract the
dkp-diagnosebinary for macOS or LinuxFor Linux:
mkdir dkp-diagnose && curl -sL https://downloads.d2iq.com/dkp/dkp-diagnose_v0.4.1_linux_amd64.tar.gz | tar -xz -C ./dkp-diagnose/For macOS:
mkdir dkp-diagnose && curl -sL https://downloads.d2iq.com/dkp/dkp-diagnose_v0.4.1_darwin_amd64.tar.gz | tar -xz -C ./dkp-diagnose/ -
Add the binary to your PATH:
export PATH=./dkp-diagnose/:$PATH -
Verify the binary works:
dkp-diagnose version
Create a diagnostic bundle
dkp-diagnose was developed by D2iQ and builds on the open source troubleshoot.sh project.
dkp-diagnose supports multiple support bundle collectors and
can be configured as a SupportBundle Kubernetes resource in a yaml file.
The following list is the minimum set of resources that is required to debug a cluster, but can be further customized.
The bundle uses the following collectors:
- clusterInfo collects basic information about the cluster
- clusterResources collects a subset of available resources in the cluster
- configMap collects the values of Kubernetes ConfigMaps
- secrets collects the values of Kubernetes ConfigMaps
- execCopyFromHost runs a container on each node on the cluster and copies the created data
- allLogs is capable of collecting logs from all containers on the cluster
Generate a Support Bundle
To generate the support bundle, perform the following steps:
-
Run the
dkp-diagnosecommand by running the default collectors configuration.dkp-diagnoseCollecting support bundle ... support-bundle-2021-08-13T14_44_23.tar.gz -
To view the bundle contents, extract the bundle (replacing
support-bundle-2021-08-13T14_44_23.tar.gzwith the location from the previous step):tar -xzvf support-bundle-2021-08-13T14_44_23.tar.gz -
A new directory named
support-bundle-<date-created>is created. This directory contains the files specified:ls support-bundle-2021-08-13T14_44_23cluster-info cluster-resources configmaps node-diagnostics pod-logs secrets version.yaml
Collect information from a bootstrap cluster
In the case where your bootstrap cluster has not yet pivoted towards your Konvoy cluster, you can collect log information from that bootstrap cluster as well, and there are a preconfigured set of relevant collectors. Specify an additional bootstrap cluster kubeconfig using the --bootstrap-kubeconfig parameter to activate bootstrap cluster diagnostics. You will receive an additional support bundle named bootstrap-support-bundle-<date created>.
Note that the bootstrap cluster diagnostics are independent of the configuration of the “main” or Konvoy cluster diagnostics. We run a static collector set that collects the following bootstrap cluster information:
- ClusterInfo
- ClusterResources
- AllLogs
- ConfigMaps
- Secrets
- Run the
dkp-diagnosecommand with bootstrap bundle configuration.
dkp-diagnose bundle.yaml
Customizations
To print the default collectors configuration, run the following command:
dkp-diagnose default-config > bundle.yaml
Edit the file to make appropriate modifications.
SSH fallback
In some cases the Kubernetes API is not available for the cluster. In those cases you can collect node level information using SSH access to the diagnosed nodes. Be aware that not all clusters have SSH access configured. If they do not then access using SSH fallback is not possible.
To get node level information from your cluster using SSH access, perform the following steps:
- Enter the following command:
dkp-diagnose ssh <path/to/ansible-inventory.yaml>
The ansible-inventory.yaml file specifies the nodes to access for data collection.
Only the following attributes of the ansible-inventory.yaml are supported. All other group definitions are ignored.
-
Support for
allshared variables. -
Support for
hostskey inallgroups. -
Supported behavioral inventory is limited to:
ansible_hostansible_portansible_useransible_ssh_private_key_file
The following is an example
inventory.yamlfile:
all:
vars:
ansible_user: centos
hosts:
host-1:
ansible_host: 192.168.10.1
host-2:
ansible_host: 192.168.10.22
ansible_port: 2222
More information on these Ansible parameters can be found in the Ansible user guide.
Refer to the following example file:
all:
vars:
ansible_user: centos
hosts:
host-1:
ansible_host: 192.168.10.1
host-2:
ansible_host: 192.168.10.22
ansible_port: 2222
The fallback collector runs a bash script over SSH and copies the collected data. The format of the created bundle matches that of dkp-diagnose collector generated bundles.
node-diagnostics/<HOSTNAME_PORT>/data/
- dmesg
- ....
Redactors are supported and are in the same format as the main dkp-diagnose command. Per node collection timeouts are supported using the --timeout parameter.
Konvoy Documentation