Oracle Cloud Infrastructure Documentation

Monitoring Clusters

Having created a cluster, you can monitor the status of the cluster itself, and the nodes and node pools within it.

Using the Console

To monitor a Kubernetes cluster:

  1. In the Console, open the navigation menu. Under Solutions, Platform and Edge, go to Developer Services and click Container Clusters.
  2. Choose a Compartment you have permission to work in.

    The Status column on the Cluster List page shows a summary status for each individual cluster and its master nodes. Clusters can have one of the following statuses:

    Cluster Status Explanation Possible Reason
    Creating Cluster is in the process of being created. Application is being deployed.
    Active Cluster is running normally. Master nodes are running normally.
    Failed Cluster is not running due to an unrecoverable error.

    Possible reasons:

    • a problem setting up load balancers
    • an error installing cluster add-ons (Tiller, Kubernetes dashboard)
    • conflicts in networking ranges
    Deleting Cluster is in the process of being deleted. Application no longer required, so resources in the process of being released. Application no longer required, so resources in the process of being released.
    Deleted Cluster has been deleted. Application no longer required, so resources have been released. Application no longer required, so resources have been released.
    Updating Version of Kubernetes on the master nodes is in the process of being upgraded. A newly supported version of Kubernetes has become available.

    Note that the cluster's summary status is not necessarily directly related to the status of node pools and nodes within the cluster.

  3. On the Cluster List page, click the name of the cluster for which you want to see detailed status.

    The Cluster Details tab shows the summary status for the cluster and its master nodes.

  4. Use the Node Pools tab to see the status of individual nodes within each node pool.

    Nodes can have one of the following statuses:

    Node Status Explanation Possible Reason
    Creating Node is being created. Compute instance in the process of being created.
    Active Node is running normally. Node is running normally.
    Updating Node is in the process of being updated.

    Container Engine for Kubernetes is performing an operation on the node.

     

    Deleting Node is in the process of being deleted. Application no longer required, so resources in the process of being released.
    Deleted Node has been deleted. Application no longer required, so resources have been released.
    Inactive Node still exists, but is not running. Compute resource has a status of Stopped, Stopping, or Down For Maintenance.

Using the API

For information about using the API and signing requests, see REST APIs and Security Credentials. For information about SDKs, see Software Development Kits and Command Line Interface.

Use the GetCluster and GetNodePool operations to monitor the status of Kubernetes clusters.