Oracle Cloud Infrastructure Documentation

Managing Cluster Networks

A cluster network is a pool of high performance computing (HPC) instances that are connected with a high-bandwidth, ultra low-latency network. Each node in the cluster is a bare metal machine located in close physical proximity to the other nodes. A remote direct memory access (RDMA) network between nodes provides latency as low as single-digit microseconds, comparable to on-premises HPC clusters.

Cluster networks are designed for highly demanding parallel computing workloads. For example:

  • Computational fluid dynamics simulations for automotive or aerospace modeling
  • Financial modeling and risk analysis
  • Biomedical simulations
  • Trajectory analysis and design for space exploration
  • Artificial intelligence and big data workloads

Cluster networks are built on top of the instance pools feature. Most operations in the instance pool are managed directly by the cluster network, though you can monitor and add tags to the underlying instance pool.

For more information about how to access and store the data that you want to process in your cluster networks, see FastConnect Overview, Overview of File Storage, Overview of Object Storage, and Overview of Block Volume.

Note

Cluster networks are not available in Government Cloud tenancies.

Warning

Avoid entering confidential information when assigning descriptions, tags, or friendly names to your cloud resources through the Oracle Cloud Infrastructure Console, API, or CLI.

Supported Regions and Availability Domains for Cluster Networks

Cluster networks are supported in the following regions:

  • Germany Central (Frankfurt)
  • Japan East (Tokyo)
  • UK South (London)
  • US East (Ashburn)

The availability domain that you create the cluster network in must have cluster network-capable hardware. Typically, to be able to create the multiple HPC instances that are contained in a cluster network, you must request a service limit increase.

Required IAM Policy

To use Oracle Cloud Infrastructure, you must be given the required type of access in a An IAM document that specifies who has what type of access to your resources. It is used in different ways: to mean an individual statement written in the policy language; to mean a collection of statements in a single, named "policy" document (which has an Oracle Cloud ID (OCID) assigned to it); and to mean the overall body of policies your organization uses to control access to resources. written by an administrator, whether you're using the Console or the REST API with an SDK, CLI, or other tool. If you try to perform an action and get a message that you don’t have permission or are unauthorized, confirm with your administrator the type of access you've been granted and which A collection of related resources that can be accessed only by certain groups that have been given permission by an administrator in your organization. you should work in.

For administrators: For a typical policy that gives access to cluster networks, see Let users manage Compute instance configurations, instance pools, and cluster networks.

Important

See this known issue for information about the policy statements that are required if the instance configuration or load balancer associated with the cluster network includes defined tags.

Tagging Resources

You can apply tags to your resources to help you organize them according to your business needs. You can apply tags at the time you create a resource, or you can update the resource later with the desired tags. For general information about applying tags, see Resource Tags.

Prerequisites

Create an instance configuration for the instance pool that is managed by the cluster network. To do this:

  1. Create an instance with the following settings:

    • Choose an operating system or image source: Click Change Image Source, and then click Oracle Images. Select the Oracle HPC cluster networking image.
    • Instance type: Select Bare Metal Machine.
    • Instance Shape: Select the BM.HPC2.36 shape. For more information about this shape, see Compute Shapes.
  2. Create an instance configuration using the instance that you created in the previous step as a template.

    Optionally, you can delete the instance after you create the instance configuration.

Using the Console

To create a cluster network
To edit the name of a cluster network
To manage tags for a cluster network
To delete a cluster network

Using the API

For information about using the API and signing requests, see REST APIs and Security Credentials. For information about SDKs, see Software Development Kits and Command Line Interface.

Use these API operations to work with cluster networks: