CEO Jensen Hiang made a string of bulletins throughout his Computex keynote, together with particulars in regards to the firm’s subsequent DGX supercomputer. Given the place the trade is clearly heading, it shouldn’t come as a shock that the DGX GH200 is basically about serving to corporations develop fashions.
The supercomputer makes use of a brand new NVLink Change System to allow 256 GH200 Grace Hopper superchips to behave as a single GPU (every of the chips has an Arm-based Grace CPU and an H100 Tensor Core GPU). This, in keeping with NVIDIA, permits the DGX GH200 to ship 1 exaflop of efficiency and to have 144 terabytes of shared reminiscence. The corporate says that is practically 500 occasions as a lot reminiscence as you’d discover in a single DGX A100 system.
For comparability, the of the Top500 supercomputers lists as the one recognized exascale system, having reached a efficiency of practically 1.2 exaflops on the Linmark benchmark. That is over twice the height efficiency of the second-placed system, Japan’s .
In impact, NVIDIA claims to have developed a supercomputer that may stand alongside essentially the most highly effective recognized system on the planet (Meta is constructing one that it claims would be the quickest AI supercomputer on this planet as soon as it’s absolutely constructed out). NVIDIA says the structure of the DGX GH200 affords 10 occasions extra bandwidth than the earlier technology, “delivering the ability of an enormous AI supercomputer with the simplicity of programming a single GPU.”
Some massive names have an interest within the DGX GH200. Google Cloud, Meta and Microsoft ought to be among the many first corporations to achieve entry to the supercomputer to check the way it can deal with generative AI workloads. NVIDIA says DGX GH200 supercomputers ought to be out there by the tip of 2023.
The corporate can be constructing its personal supercomputer, Helios, that mixes 4 DGX GH200 methods. NVIDIA expects Helios to be on-line by the tip of the yr.
Huang mentioned different generative AI developments throughout his keynote, together with one on the gaming entrance. NVIDIA Avatar Cloud Engine (ACE) for Video games is a service builders will be capable of faucet into with a view to create customized AI fashions for speech, dialog and animation. NVIDIA says ACE for Video games can “give non-playable characters conversational expertise to allow them to reply to questions with lifelike personalities that evolve.”