Chat with NVIDIA RTX Tech Demo Review

Name: Chat with NVIDIA RTX Tech Demo
Brand: NVIDIA

W1zzard

on Feb 13th, 2024,

in Graphics Cards.

Manufacturer: NVIDIA

Installation »

Introduction

NVIDIA today released the first public demo of Chat with RTX. No, you can't talk to your graphics card and ask "how's it going?," you'll need TechPowerUp GPU-Z to do that. Chat with RTX is something else. Imagine a fully localized AI chat that's running entirely on your PC, accelerated by the highly capable cores in your GeForce RTX graphics card; and which sends none of your queries to a cloud-based chat server. That's Chat with RTX. This thing is being developed by NVIDIA to be a ChatGPT alternative that has all its knowledge stored locally on your PC, and a GeForce RTX GPU to use for a brain.

While 2024 promises to be the "year of the AI PC," as industry leaders Microsoft and Intel would have you believe, NVIDIA has had an incredible six-year head-start with AI acceleration. The company introduced on-device accelerated AI for its RTX real-time ray tracing technology. As part of this innovation, its 2017 GeForce RTX GPUs were equipped with Tensor cores. These components significantly boosted AI deep-learning neural network (DNN) building and training compared to using CUDA cores alone. This advancement marked a substantial leap forward in performance, enhancing the capabilities of the GPUs for AI-driven tasks. Besides the denoiser, NVIDIA leverages AI acceleration to drive its DLSS performance enhancement feature. Can't max out a game? Simply enable DLSS and pick one of its presets until the game is playable at the settings you choose.

In our recent interactions with NVIDIA, the company made it clear that they aren't too impressed with the newest processors from Intel and AMD, which introduce NPUs (neural processing units); with performance figures around the 10-16 TOPS mark for the NPU itself, and no more than 40 TOPS for the whole chip (NPU + CPU + iGPU). NVIDIA GeForce RTX GPUs with their Tensor cores in contrast, tend to offer anywhere between 20x to 100x (!) this performance due to the sheer scale at which NVIDIA has deployed AI acceleration on its GPU silicon.

While CPU-based NPUs are intended to drive simple text-based and light image-based generative AI tasks; NVIDIA is incorporating AI at a different level, even today—think of generating every alternate frame in DLSS 3 Frame Generation, or denoising a 4K in-game scene at 60+ FPS, depending on the resolution. Put simply, GeForce RTX GPUs have enormous amounts of AI acceleration hardware resources that remain dormant when you're not gaming; and so NVIDIA has taken it upon itself to show gamers they can run fully localized generative AI tools leveraging this hardware. The company is just getting started, and one of its first projects is Chat with RTX, for which we're reviewing a preview build today. NVIDIA has a vast install base—millions of gamers with GeForce RTX GPUs, and so in the near future, we expect NVIDIA to take a more active role in the AI PC ecosystem, by providing additional AI-driven experiences and productivity tools for PCs with a GeForce RTX GPU.

Chat with RTX, as we said, is a text-based generative AI platform—a ChatGPT or Copilot of sorts—but one that doesn't send a single bit of your data to a cloud server, or use web-based datasets. The dataset is whatever you provide. You even have the flexibility to choose an AI model, between Llama2, and Mistral. For the tech demo of Chat with RTX, NVIDIA provided both Llama2 and Mistral, along with their native datasets that are updated till mid-2022.

In this article, we take Chat with RTX for a spin to show you its potential to bring powerful, completely-offline AI chat to gamers.

Apr 29th, 2024 14:22 EDT change timezone

Latest GPU Drivers

New Forum Posts

14:21 by few34fwfs
Is it better for zero RPM PSUs to place the fan on top? (25)
14:18 by VuurVOS
Is this Sapphire PULSE RX 5600 XT legit or fake? (12)
14:16 by GerKNG
Arctic MX-6 shelf life is just a couple months? (20)
14:04 by unclewebb
Aorus laptop 15p kd i7 11800h rtx 3060 (1)
13:58 by unclewebb
i7-1355U (5)
13:34 by Veseleil
EK seems to be having major issues (55)
13:13 by thesmokingman
Why MS buying all of these Studios is bad for Gaming (51)
13:08 by wNotyarD
What's your latest tech purchase? (20392)
12:59 by MaexxDesign
Show us your collections thread!! (283)
12:56 by Tralalak
New High Performance, x86 Compatible Microprocessors from Centaur / VIA (146)

Popular Reviews

Apr 26th, 2024 Ugreen NASync DXP4800 Plus Review
Apr 25th, 2024 HYTE THICC Q60 240 mm AIO Review
Feb 12th, 2024 Upcoming Hardware Launches 2023 (Updated Feb 2024)
Apr 22nd, 2024 MOONDROP x Crinacle DUSK In-Ear Monitors Review - The Last 5%
Apr 18th, 2024 FiiO K19 Desktop DAC/Headphone Amplifier Review
Apr 17th, 2024 Thermalright Phantom Spirit 120 EVO Review
Apr 5th, 2023 AMD Ryzen 7 7800X3D Review - The Best Gaming CPU
Apr 12th, 2024 ASUS Radeon RX 7900 GRE TUF OC Review
Apr 18th, 2024 Logitech G Pro X Superlight 2 Review - Updated with 4000 Hz Tested
Feb 26th, 2024 Sapphire Radeon RX 7900 GRE Pulse Review

Introduction

Latest GPU Drivers

New Forum Posts

Popular Reviews

Controversial News Posts