AI Infrastructure Fleet Management Made Easy with COSMOS 

AI infrastructure demands large server deployments in hyperscale data centers to meet requirements for compute power, efficiency, and total cost of ownership (TCO). However, managing such a large fleet of systems presents complex challenges of observability, data collection, and fault isolation.

COnnectivity System Management and Optimization Software – or COSMOS – is a key component of Astera Labs’ Intelligent Connectivity Platform. It enables management and optimization of resources for large fleets at cloud-scale via link, fleet, and RAS management capabilities.

COSMOS Monitors and Manages a System or Fleet

COSMOS Highlights

  • Operates in on-chip microcontrollers and systems baseboard/system management controllers
  • Provides link management, fleet management and reliability/availability/serviceability (RAS) features across the entire product portfolio
  • Delivers enhanced diagnostics and telemetry features with in-band and out-of-band management
  • Monitors the state and health of a systems’ critical data links, such as link rate, link stability, error rate, receiver margins, and more
  • Offers enhanced security features enabling device updates and configurations in a secure manner
  • Available as a C-SDK, Python-SDK and Reference Applications (or Platform Utilities)

Request the White Paper

Cloud Infrastructure Fleet Management Made Easy with COSMOS

Establishing an optimized scale-up ecosystem with UALink: A fireside chat with Astera Labs

As AI models continue to grow, data centers are requiring increasing amounts of compute and memory to efficiently execute training and inferencing.The UALink Consortium represents a dynamic group of industry leaders united in our goal to foster innovation and establish an open, interoperable standard for high-performance computing connections in scale-up AI environments.The UALink…

Read more

Combating Noisy Neighbors with Scorpio P-Series Fabric Switches

AI server designs are being impacted by an issue that becomes increasingly worse as GPUs scale to meet demands of AI workloads. The issue: noisy neighbors! Scorpio P-Series Fabric Switches – the industry’s first PCIe 6 fabric switch – are architected for mixed traffic AI head-node traffic connectivity (GPU-to-CPU/NIC/SSD). Let’s take a closer look at the problem of noisy neighbors…

Read more

Advancing CXL with Interoperable Solutions

In AI and cloud infrastructure, seamless connectivity and scalable memory expansion solutions are key to unlocking performance at scale. As AI workloads grow more demanding, the need for robust, standards-based and interoperable solutions is greater than ever. The CXL Consortium Compliance Program was established to validate end-products against the CXL Specification, ensuring they meet the…

Read more

Astera Labs Announces Conference Call to Review First Quarter 2025 Financial Results

SANTA CLARA, CA, U.S. – April 10, 2025 – Astera Labs, Inc. (Nasdaq: ALAB), a global leader in semiconductor-based connectivity solutions for AI and cloud infrastructure, today announced that it will release its financial results for the first quarter 2025 after the close of market on Tuesday, May 6, 2025. Astera Labs will host a corresponding conference call at 1:30 p.m. Pacific…

Read more

Astera Labs Optimizes Connectivity for NVIDIA Blackwell-Based MGX Platforms at Scale

Seamless Scorpio Smart Fabric Switch integration with NVIDIA MGX™ platform delivers PCIe® 6-ready modular designs for rapid deployment across a range of AI serversSANTA CLARA, CA, U.S. – March 18, 2025 – Astera Labs, Inc. (Nasdaq: ALAB), a global leader in semiconductor-based connectivity solutions for AI and cloud infrastructure, today announced its Scorpio Smart Fabric Switches…

Read more

Astera Labs Expands Cloud-Scale Interop Lab Leadership to Propel Next-Gen PCIe 6.x Ecosystem

Comprehensive Cloud-Scale Interop Lab testing of Scorpio Smart Fabric Switches advances PCIe 6.x ecosystem enablement, fast-tracking customer platform designs, development, and time-to-marketSANTA CLARA, CA, U.S. – March 13, 2025 – Astera Labs, Inc. (Nasdaq: ALAB), a global leader in semiconductor-based connectivity solutions for AI and cloud infrastructure, today announced the expansion…

Read more

Astera Labs Appoints Dr. Craig Barratt to Board of Directors

Dr. Barratt brings leadership experience in scaling high-growth public and private technology companiesSANTA CLARA, CA, U.S. – March 3, 2025 – Astera Labs, Inc. (Nasdaq: ALAB), a global leader in semiconductor-based connectivity solutions for AI and cloud infrastructure, today announced the appointment of Dr. Craig Barratt to its Board of Directors. Dr. Barratt is a seasoned technology…

Read more

NVIDIA GTC 2025: End-to-End PCIe 6 Interop

At NVIDIA GTC 2025, Astera Labs showcased PCIe 6 end-to-end connectivity with our Scorpio P-Series Fabric Switch interconnected to Micron’s PCIe 6 SSD, an NVIDIA Blackwell GPU through our Aries 6 Smart Retimer, and an NVIDIA CX7 NIC card, with an Intel CPU connected on the upstream side. The demonstration shows how our Scorpio P-Series Fabric Switch can support mixed mode traffic, while…

Read more

NVIDIA GTC 2025: Hyperscaler-Inspired Scale-Out Rack Architecture

At NVIDIA GTC 2025, Astera Labs showcased how hyperscalers can gain smart connectivity and optionality for NVIDIA rack-scale Blackwell deployments. We showcased modular architecture with a Scorpio IO Switch Board with two NICs connected via the Aries Retimer Mezzanine Board to the GB200. In this set-up, we show how the third-party NICs can provide maximum bandwidth to saturate the Blackwell…

Read more

NVIDIA GTC 2025: Scorpio Switch Board for the MGX Platform

See our latest collaboration with NVIDIA and Wistron: a Scorpio Switch Board that integrates PCIe 6-ready Scorpio P-Series Fabric Switches with the NVIDIA MGX Production-Grade Reference Design.The benefits are clear:Scalable design for IO expansionPredictable performance with no noisy neighborsMaximum performance with PCIe 6 supportEasy setup and diagnostics with COSMOSMulti-generational…

Read more