

These stream multiprocessors are the main engine of Nvidia’s GPUs, responsible for some serious number crunching, and they claim the Ampere SM has double the capacity of the Turing SM. Nvidia’s engineers concentrated much of their effort into improving the throughput of the SM units for Ampere. The Ampere Streaming Multiprocessor Ampere Stream Multiprocessor All RTX 30XX GPUs feature a PCI-Express 4.0 x16 host interface, but only the RTX 3090 has the updated NVLink multi-GPU interface. The processing power is spread across seven GPCs (Graphics Processing Clusters) with a shared L2 cache and a Giga Thread Engine to coordinate the work between GPCs while utilizing the full 384-bit wide memory interface connecting a massive 24 GB of GDDR6X to the processing cores. This value is more than double the 4,352 found in the RTX 2080 Ti. The RTX 3090 has a total of 82 SMs enabled with each SM sporting 128 CUDA cores for a total of 10,490.

Taking a closer look at the Nvidia RTX 3090, we find it uses the full Ampere GA102 die pictured above minus two SM (stream multiprocessors), which helps to increase yields. It does appear as if Nvidia was going for muscle and not power savings with the RTX 3090, judging by the TDP, which is up to 350 W compared to 280 W for the RTX 2080 Ti. This node shrink allows for a much higher transistor density with Ampere sporting nearly twice as many per square millimeter for a whopping 28 billion in the full GA102 die which measures 628 mm². According to Nvidia, this leverages 1.9 times the performance per watt compared to the previous generation Turing architecture. With Ampere, we see a new silicon fabrication process moving away from TSMC’s 12 nm to Samsung’s 8N (eight nanometer) node.

The table below is based on the Nvidia Founders Edition GPUs. Let’s dive in and see for ourselves what Nvidia’s new flagship GPU is capable of: Specifications and Features This should have been done with an RTX 3080 review, but due to some unfortunate events (by a card partner), we missed out on a launch day sample, so we’re including that information here. Per usual, we take that value with a grain of salt because, well, marketing.īefore we get to the performance metrics, we’ll go over some key features of the Ampere architecture and how it differs from the last generation. Nvidia claims the RTX 3090 is 1.5X faster than the Titan RTX Turing-based GPU, and if true, this is a substantial generational leap in performance. It’s been two years since Nvidia released a new architecture (not that they needed to), so we’re excited to see what Ampere brings to the table. Today we get our first taste of Nvidia’s flagship RTX 3090 GPU, and thanks to MSI, we’ll be testing their RTX 3090 Gaming X Trio.
