Last month, Nvidia gave us a look at Rubin-based servers, and the 6 kinds of chips inside it:
It’s a server rack modular design, with Nvida-made boards containing the GPU and CPUs, plus networking boards are part of a Compute Tray (looks like a 1U but might be a 2U height tray) where the boards are in modules that snap into place. This was done to elminate cabling, hoses, and fans.
In between those compute trays, there are trays for the NVLink (Scale-Up) connectivity, and while separate, I believe those are required and can’t be replaced by third party options.
Finally, there is the external connectivity (Scale-Out), which in Nvidia’s reference design, uses their Spectrum-X photonix ethernet switch. This could be replaced in shipping units with something from Arista or Broadcom.
CNBC has an Nvidia advertisement disguised as a news video, but it’s worth watching just for more details as well as more looks at the components:
It lists the 80 third-parties that make components in Nvidia’s server reference design. Astera, for instance, is not listed.
The latest Rubins are liquid cooled, with components from Delta Electronics and Vertiv.
There’s also a peek near the end of the video at the upcoming Kyber racks, which reduce cabling even more.
Now, while Nvidia doesn’t sell complete racks, they do sell the boards and snap-in modules, and probably the trays. Third party companies like Dell, SMCI, HPE, etc. are the companies you deal with in terms of laying out racks and equipping them to your specifications, which probably most notably includes networking out.
I’m wondering how much of Astera’s offerings are in Rubin-based servers being delivered. Astera is a member of the NVLink Fusion ecosystem, and so could be making components for the internal GPU to GPU networking, but it seems to me that Nvidia is optimizing their boards to reduce retimer use.
And so, for Astera anyway, I think more and more the future for them is with non-Nvidia installations, such as Anthropic using Amazon’s Trainium chips, and, of course, AMD installations using UALink.
I don’t really have any conclusions to share, but this highly technical space is interesting and it’s getting harder to predict the winners and losers.
EDIT: Bad timing on my part: Semi-Analysis just came out with this detailed article with even more details: