Skip to content

DAOS Version 2.2 Support

Community Support and Commercial Support

Community support for DAOS is available through the DAOS mailing list and the DAOS Slack channel. The DAOS community JIRA tickets can be searched for known issues and possible solutions. Community support is provided on a best effort basis without any guaranteed SLAs.

Starting with DAOS Version 2.0, the Intel DAOS engineering team can also be contracted to provide Commercial Level-3 Support for DAOS. Under such a support agreement, Intel partners that offer DAOS Commercial Support to their end customers will provide the DAOS Level-1 and Level-2 support. They can then escalate Level-2 support tickets to the Intel Level-3 support team through a dedicated JIRA path with well-defined SLAs. Please refer to the intel.com landing page for DAOS for information on the DAOS partner ecosystem.

This document describes the supported environments for Intel Level-3 support at the DAOS 2.2 release level. Information for future releases is indicative only and may change. Partner support offerings may impose further constraints, for example if they include DAOS support as part of a more general cluster support offering with its own release cycle.

Some members of the DAOS community have reported successful compilation and basic testing of DAOS in other environments (for example on ARM64 platforms, or on other Linux distributions). Those activities are highly appreciated community contributions. However such environments are not currently supported in a production environment.

Hardware platforms supported for DAOS Servers

DAOS Version 2.2 supports the x86_64 architecture.

DAOS servers require Storage Class Memory (SCM) that is supported by PMDK. (SCM can be emulated by DRAM for development and testing purposes, but this is not supported in a production environment.) DAOS Version 2.2 has been validated with Intel Optane Persistent Memory 100 Series on 2nd gen Intel Xeon Scalable processors, and with Intel Optane Persistent Memory 200 Series on 3rd gen Intel Xeon Scalable processors. For maximum performance, it is strongly recommended that all memory channels of a DAOS server are populated with one DRAM module and one Optane PMem module. All Optane PMem modules in a DAOS server must have the same capacity.

While not strictly required, DAOS servers typically include NVMe disks for bulk storage, which must be supported by SPDK. (NVMe storage can be emulated by files on non-NVMe storage for development and testing purposes, but this is not supported in a production environment.) All NVMe disks managed by a single DAOS engine must have identical capacity, and it is strongly recommended to use identical drive models. It is also strongly recommended that all DAOS engines in a DAOS system have identical NVMe storage configurations. The number of targets per DAOS engine must be identical for all DAOS engines.

DAOS Version 2.2 supports Intel Volume Management Devices (VMD) to manage the NVMe disks on the DAOS servers. Enabling VMD is platform-dependent; details are provided in the Administration Guide.

Each DAOS engine needs one high-speed network port for communication in the DAOS data plane. DAOS Version 2.2 does not support more than one high-speed network port per DAOS engine. (It is possible that two DAOS engines on a 2-socket server share a single high-speed network port for development and testing purposes, but this is not supported in a production environment.) It is strongly recommended that all DAOS engines in a DAOS system use the same model of high-speed fabric adapter. Heterogeneous adapter population across DAOS engines has not been tested, and running with such configurations may cause unexpected behavior. Please refer to "Fabric Support" below for more details.

Hardware platforms supported for DAOS Clients

DAOS Version 2.2 supports the x86_64 architecture.

DAOS clients have no specific hardware dependencies.

Each DAOS client needs a network port on the same high-speed interconnect that the DAOS servers are connected to. Multiple high-speed network ports per DAOS client are supported. Note that a single task on a DAOS client will always use a single network port, but when multiple tasks per client node are used then the DAOS agent will distribute the load by allocating different network ports to different tasks.

Operating Systems supported for DAOS Servers

The DAOS software stack is built and supported on Linux for the x86_64 architecture.

DAOS Version 2.2 is primarily validated on Rocky Linux 8.6 and openSUSE Leap 15.3. The following subsections provide details on the Linux distributions which DAOS Version 2.2 supports on DAOS servers.

Note that all DAOS servers in a DAOS server cluster (also called DAOS system) must run the same Linux distribution. DAOS clients that access a DAOS server cluster can run the same or different Linux distributions.

CentOS 7 and Red Hat Enterprise Linux 7

With DAOS Version 2.2, CentOS 7.9 and RHEL 7.9 are supported on DAOS servers with 2nd gen Intel Xeon Scalable processors (Cascade Lake).

CentOS 7.9 or RHEL 7.9 are not supported on DAOS servers with 3rd gen Intel Xeon Scalable processors (Ice Lake) or newer Intel Xeon processor generations.

Links to CentOS Linux 7 and RHEL 7 Release Notes:

CentOS Linux 7 will reach End Of Life (EOL) on June 30st, 2024. Refer to the RHEL Life Cycle description on the Red Hat support website for information on RHEL support phases.

CentOS Linux 8

CentOS Linux 8 has reached End Of Life (EOL) on December 31st, 2021. Consequently, DAOS Version 2.2 does not support CentOS Linux 8. DAOS clusters that have been running CentOS Linux 8 must be be migrated to a supported EL8 operating system in order to deploy DAOS Version 2.2.

Red Hat Enterprise Linux 8

DAOS Version 2.2 is supported on RHEL 8.4 EUS and RHEL 8.6. RHEL 8.5 is no longer supported by RedHat.

Note

Most validation of DAOS 2.2 has been done on the Rocky Linux 8.6 release. DAOS 2.2 should work on RHEL 8.4 EUS and RHEL 8.5, but fixing DAOS issues that are discovered on RHEL 8.5 may require an OS upgrade to a supported release.

Links to RHEL 8 Release Notes:

Refer to the RHEL Life Cycle description on the Red Hat support website for information on RHEL support phases.

Rocky Linux 8

DAOS Version 2.2 is supported on Rocky Linux 8.6.

Note

Most validation of DAOS 2.2 has been done on the Rocky Linux 8.6 release. DAOS 2.2 should work on Rocky Linux 8.5, but fixing DAOS issues that are discovered on Rocky Linux 8.5 may require an OS upgrade to a supported release.

Links to Rocky Linux Release Notes:

openSUSE Leap 15

DAOS Version 2.2 is supported on openSUSE Leap 15.3.

Links to openSUSE Leap 15 Release Notes:

SUSE Linux Enterprise Server 15

DAOS Version 2.2 is supported on SLES 15 SP3.

Links to SLES 15 Release Notes:

Refer to the SLES Life Cycle description on the SUSE support website for information on SLES support phases.

Unsupported Linux Distributions

DAOS does not support openSUSE Tumbleweed, Fedora, CentOS Stream, CentOS Linux, Alma Linux, Ubuntu, or Oracle Linux.

Operating Systems supported for DAOS Clients

The DAOS software stack is built and supported on Linux for the x86_64 architecture.

In DAOS Version 2.2, the supported Linux distributions and versions for DAOS clients are identical to those for DAOS servers. Please refer to the previous section for details.

In future DAOS releases, DAOS client support may be added for additional Linux distributions and/or versions.

Fabric Support

DAOS relies on libfabric for communication in the DAOS data plane.

DAOS Version 2.2 requires at least libfabric Version 1.15-1. The RPM distribution of DAOS includes libfabric RPM packages with the correct version. It is strongly recommended to use exactly the provided libfabric version on all DAOS servers and all DAOS clients.

Not all libfabric providers are supported. DAOS Version 2.2 has been validated mainly with the verbs provider for InfiniBand fabrics, and the tcp provider for other fabrics. Future DAOS releases may add support for additional libfabric providers.

Note

The psm2 provider for Omni-Path fabrics has known issues when used in a DAOS context, and is not supported for production environments. Please refer to the Cornelis Networks presentation at DUG21 for an outlook on future Omni-Path Express support for DAOS. In the meantime, please use the tcp provider on Omni-Path fabrics.

On InfiniBand fabrics with the verbs provider, DAOS requires that the Mellanox OFED (MLNX_OFED) software stack is installed on the DAOS servers and DAOS clients. DAOS Version 2.2 has been validated with MLNX_OFED Version 5.5-1 and Version 5.6-1. Versions older than 5.5-1 are not supported by DAOS 2.2.

Links to MLNX_OFED Release Notes:

It is strongly recommended that all DAOS servers and all DAOS clients run the same version of MLNX_OFED, and that the InfiniBand adapters are updated to the firmware levels that are included in that MLNX_OFED distribution. It is also strongly recommended that the same model of InfiniBand fabric adapter is used in all DAOS servers. DAOS Version 2.2 has not been tested with heterogeneous InfiniBand adapter configurations. The only exception to this recommendation is the mix of single-port and dual-port adapters of the same generation, where only one of the ports of the dual-port adapter(s) is used by DAOS.

For InfiniBand fabrics, DAOS 2.2 also includes a Technology Preview of UCX fabric support, that can be used as an alternative to libfabric. Note that UCX is not yet fully validated on DAOS 2.2, and should not be used in production environments.

Note

The MLNX_OFED 5.6-1 release is not yet supported in conjunction with UCX, due to a known issue with the UCX 1.13 version in MLNX_OFED 5.6-1.

DAOS Scaling

DAOS is a scale-out storage solution that is designed for extreme scale. This section summarizes the DAOS scaling targets, some DAOS architectural limits, and the current testing limits of DAOS Version 2.2.

Note: Scaling characteristics depend on the properties of the high-performance interconnect, and the libfaric provider that is used. The DAOS scaling targets below assume a non-blocking, RDMA-capable fabric. Most scaling tests so far have been performed on InfiniBand fabrics with the libfabric verbsprovider.

DAOS scaling targets (these are order of magnitude figures that indicate what the DAOS architecture should support - see below for the scales at which DAOS 2.2 has been validated):

  • DAOS client nodes in a DAOS system: 105 (hundreds of thousands)
  • DAOS servers in a DAOS system: 103 (thousands)
  • DAOS engines per DAOS server: 100 (less than ten)
  • DAOS targets per DAOS engine: 101 (tens)
  • SCM storage devices per DAOS engine: 101 (tens)
  • NVMe storage devices per DAOS engine: 101 (tens)
  • DAOS pools in a DAOS system: 102 (hundreds)
  • DAOS containers in a DAOS pool: 102 (hundreds)
  • DAOS objects in a DAOS container: 1010 (tens of billions)
  • Application tasks accessing a DAOS container: 106 (millions)

Note that DAOS has an architectural limit of 216=65536 storage targets in a DAOS system, because the number of storage targets is encoded in 16 of the 32 "DAOS internal bits" within the 128-bit DAOS Object ID.

DAOS Version 2.2 has been validated at the following scales:

  • DAOS client nodes in a DAOS system: 256
  • DAOS servers in a DAOS system: 128
  • DAOS engines per DAOS server: 1, 2 and 4
  • DAOS targets per DAOS engine: 4-16
  • SCM storage devices per DAOS engine: 6 (Optane PMem 100), 8 (Optane PMem 200)
  • NVMe storage devices per DAOS engine: 0 (PMem-only pools), 4-12
  • DAOS pools in a DAOS system: 100
  • DAOS containers in a DAOS pool: 100
  • DAOS objects in a DAOS container: 6 billion (in mdtest benchmarks)
  • Application tasks accessing a DAOS container: 3072 (using verbs)

This test coverage will be expanded in subsequent DAOS releases.

Back to top