Nvidia

Nvidia Reveals Ada Lovelace GPU Secrets and techniques: Excessive Transistor Counts at Excessive Clocks

Posted on


When Nvidia launched its Ada Lovelace household of graphics processing models earlier this week, it primarily centered on its top-of-the-range AD102 GPU and its flagship GeForce RTX 4090 graphics card. It did not launch too many particulars about its AD103 and AD104 graphics chips. Fortuitously, Nvidia uploaded its Ada Lovelace whitepaper as we speak that comprises a great deal of information in regards to the new GPUs and fills in lots of gaps. We have up to date the RTX 40-series GPUs the whole lot we all know hub with the brand new particulars, however here is the overview of the brand new and attention-grabbing info.

Large GPUs for Large Gaming 

We already know that Nvidia’s range-topping AD102 is a 608-mm^2 GPU containing 76.3 billion transistors, 18,432 CUDA cores, and 96MB of L2 cache. We now additionally know that AD103 is a 378.6 mm^2 graphics processor that includes 45.9 billion transistors, 10,240 CUDA cores, and 64MB L2 cache. As for the AD104, it has a die dimension of 294.5 mm^2, 35.8 billion transistors, 7680 CUDA cores, and 48MB of L2.

Nvidia Ada Specs vs. Ampere
GPU/Graphics Card Full AD102 RTX 4090 RTX 4080 16GB RTX 4080 12GB RTX 3090 Ti
Structure AD102 AD102 AD103 AD104 GA102
Course of Expertise TSMC 4N TSMC 4N TSMC 4N TSMC 4N Samsung 8LPP
Transistors (Billion) 76.3 76.3 45.9 35.8 28.3
Die dimension (mm^2) 608 608 378.6 294.5 628.4
Streaming Multiprocessors 144 128 76 60 84
GPU Cores (Shaders) 18432 16384 9728 7680 10752
Tensor Cores 576 512 320 240 336
Ray Tracing Cores 144 144 80 60 84
TMUs 512 512 304? 240 336
ROPs 192 192 112 80 112
L2 Cache (MB) 96 96 64 48 6
Increase Clock (MHz) ? 2520 2505 2600 1860
TFLOPS FP32 (Increase) ? 82.6 48.7 40.1 40.0
TFLOPS FP16 (FP8) ? 661 (1321) 390 (780) 319 (639) 320 (N/A)
TFLOPS Ray Tracing ? 191 113 82 78.1
Reminiscence Interface (bit) 384 384 256 192 384
Reminiscence Velocity (GT/s) ? 21 22.4 21 21
Bandwidth (GBps) ? 1008 736 504 1008
TDP (watts) ? 450 320 285 450
Launch Date ? Oct 12, 2022 Nov 2022? Nov 2022? Mar 2022
Launch Worth ? $1,599 $1,199 $899 $1,999

One of many attention-grabbing issues that Nvidia tells in its whitepaper is that Ada Lovelace GPUs use high-speed transistors in essential paths to spice up most clock speeds. In consequence, its fully-enabled AD102 GPU with 18,432 CUDA cores is ”able to working at clocks over 2.5 GHz, whereas sustaining the identical 450W TGP.” Preserving this in thoughts, we’re not shocked that the corporate is speaking about 3.0 GHz clocks for the GeForce RTX 4090 (with 16,384 CUDA cores) reached in its labs. At 3.0 GHz, the GeForce RTX 4090 will completely headline our checklist of the finest graphics playing cards round. 

(Picture credit score: Nvidia)

Along with excessive clocks, Nvidia’s Ada Lovelace GPU additionally boast huge L2 caches that enhance efficiency in compute intensive workloads (e.g., ray tracing, path tracing, simulations, and many others.) and reduces reminiscence bandwidth necessities. Basically, Nvidia’s Ada GPUs take a web page from RDNA 2 Infinity Cache’s ebook right here, though we imagine that basic targets for the brand new structure had been set nicely earlier than AMD’s Radeon RX 6000-series merchandise debuted in 2020. 



Supply hyperlink

Leave a Reply

Your email address will not be published.