NVIDIA GeForce Ampere Architecture, Board Design, Gaming Tech & Software

on Sep 4th, 2020,

Manufacturer: NVIDIA

The new Ampere RT Core and Tensor Core

With Ampere, NVIDIA introduces its 2nd generation RT core that aims to improve raytracing acceleration, as well as new effects, such as raytraced motion blur. An RT core is a fixed-function hardware component that handles two of the most challenging tasks for SIMD programmable shaders, bounding volume hierarchy (BVH) traversal and intersection; i.e., calculating the exact point where a ray collides with a surface, so its next course can be charted. Typical raytracing workloads in a raster+raytracing hybrid rendering path involve calculating steps of traversal and intersection across the BVH and bounding-box/triangle intersections, which is a very unsuitable workload for typical GPUs because of the nature of memory accesses involved. This kind of pointer chasing doesn't scale well with SIMD architectures (read: programmable shaders) and is better suited to special fixed-function hardware, like the MIMD RT cores.

Without taking names, NVIDIA pointed out that a minimalist approach toward raytracing (possibly what AMD is up to with RDNA2) has a performance impact due to overreliance on SIMD stream processors. NVIDIA's RT cores offer a completely hardware-based BVH traversal stack, a purpose-built MIMD execution unit, and inherently lower latency from the hardware stack. The 2nd generation RT core being introduced with Ampere adds one more hardware component.

Ampere introduces a new logic block that interpolates triangle positions along a time scale, in coordination with the triangle intersection unit. NVIDIA tells us that this is useful in generating motion blur effects in real-time raytracing. Our take on this is that NVIDIA is, rather, implementing this as performance optimization for raytracing. As very little will likely change in two frames, there is no need to recalculate all the results for the following frame after all the ray intersections for the current frame have been calculated—the player moved or changed the camera, and objects in the world are positioned only ever so slightly differently. We suspect NVIDIA paired a motion-estimation algorithm with RTX that remembers the last intersections as "good candidates" and checks them early on in the whole process, which can lead to a valid result early in the test and means many entries in the BVH don't have to be processed at all.

3rd Generation Tensor Cores

The new 3rd generation tensor core is largely carried over from the A100 Tensor Core processor NVIDIA introduced this spring, which is purpose-built for AI deep-learning work. To improve performance, Ampere tensor cores are designed to leverage sparsity in deep learning neural nets. Sparsity is a phenomenon where a dense matrix can be trimmed without affecting its accuracy—kind of like how the goal in Jenga is to keep the column intact despite pulling out pieces from the middle. Sparse matrices increase AI inference performance by an order of magnitude.

May 1st, 2024 16:03 EDT change timezone

Latest GPU Drivers

New Forum Posts

16:03 by maxfly
Arctic MX-6 shelf life is just a couple months? (58)
16:02 by Toothless
7900 XTX Seriously lacking (95)
16:00 by purecain
Need HELP, pic attached new build Gigabyte x670 Aorus Extreme new motherboard w new 7950x3d- power up STUCK on CPU LED RED,POST LEDS BLANK, no CODE (12)
15:55 by purecain
Looking for recommendations to upgrade the GPU (36)
15:50 by TechKilledMe
Old high quality PSU, or semi-old mid-quality PSU? (29)
15:48 by tabascosauz
Alphacool CORE 1 CPU block - bulging with danger of splitting? (61)
14:45 by mrcardio33x
PYPrime 2.x free Memory benchmark. let's see those daily Memory OC performances. (41)
14:26 by neatfeatguy
Brother bought a house, found some old PC hardware.. (21)
14:25 by five
problem with my 7900xtx (24)
14:18 by mayhemmodz
CYBERPUNK 2077 O.F. (6)

Popular Reviews

Apr 26th, 2024 Ugreen NASync DXP4800 Plus Review
Apr 29th, 2024 Team Group T-Force Vulcan ECO DDR5-6000 32 GB CL38 Review
Apr 25th, 2024 HYTE THICC Q60 240 mm AIO Review
Feb 12th, 2024 Upcoming Hardware Launches 2023 (Updated Feb 2024)
Apr 30th, 2024 Montech Sky Two GX Review
Apr 22nd, 2024 MOONDROP x Crinacle DUSK In-Ear Monitors Review - The Last 5%
Apr 17th, 2024 Thermalright Phantom Spirit 120 EVO Review
Apr 5th, 2023 AMD Ryzen 7 7800X3D Review - The Best Gaming CPU
Apr 12th, 2024 ASUS Radeon RX 7900 GRE TUF OC Review
Apr 18th, 2024 FiiO K19 Desktop DAC/Headphone Amplifier Review

The new Ampere RT Core and Tensor Core

3rd Generation Tensor Cores

Latest GPU Drivers

New Forum Posts

Popular Reviews

Controversial News Posts