• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

NVIDIA Unveils Next Generation CUDA GPU Architecture – Codenamed ''Fermi''

FordGT90Concept

"I go fast!1!11!1!"
Joined
Oct 13, 2008
Messages
26,259 (4.65/day)
Location
IA, USA
System Name BY-2021
Processor AMD Ryzen 7 5800X (65w eco profile)
Motherboard MSI B550 Gaming Plus
Cooling Scythe Mugen (rev 5)
Memory 2 x Kingston HyperX DDR4-3200 32 GiB
Video Card(s) AMD Radeon RX 7900 XT
Storage Samsung 980 Pro, Seagate Exos X20 TB 7200 RPM
Display(s) Nixeus NX-EDG274K (3840x2160@144 DP) + Samsung SyncMaster 906BW (1440x900@60 HDMI-DVI)
Case Coolermaster HAF 932 w/ USB 3.0 5.25" bay + USB 3.2 (A+C) 3.5" bay
Audio Device(s) Realtek ALC1150, Micca OriGen+
Power Supply Enermax Platimax 850w
Mouse Nixeus REVEL-X
Keyboard Tesoro Excalibur
Software Windows 10 Home 64-bit
Benchmark Scores Faster than the tortoise; slower than the hare.
Since when can DX11 do the things CUDA can. Besides Physics, what does DX11 offer that CUDA does. They are two different technologies. Last I checked, DX11 has nothing to do with parallel computing.
Since it included DirectCompute (which is an API inside of the DirectX 11 package).

DX11 offers unified sound support, unified input support, unified networking support, and more that CUDA does not. But that's not what you were asking. CUDA and DirectCompute are virtually the same with one caveat: Microsoft will flex their industry muscles to get developers to use it and developers will want to use it because the same code will work on NVIDIA, AMD, and Intel GPUs.

DirectCompute has everything to do with parallel computing. That is the reason why it was authored.
 
Joined
Aug 13, 2008
Messages
146 (0.03/day)
Location
Dresden, Germany, Europe, Earth
System Name HTPC
Processor Intel Core i5
Motherboard Asrock ITX
Cooling Water
Memory 8GB
Video Card(s) Radeon R9 290
Storage Samsung 850 EVO 1TB
Display(s) Sharp Aquos 70" TV
Case Coolermaster ITX
Audio Device(s) Radeon Onboard HDMI
Software Windows 10 64bit
Benchmark Scores -242 whatsoever
now its official, this is the 5870 killer, damn i had hoped that ATi would get some more marketshares ... but with an opponend like this, oh boy

now we'd need some benchies ... i believe Wizz does know more about the new card then we
 
Joined
Apr 29, 2008
Messages
742 (0.13/day)
Location
Auckland
System Name PBD
Processor Core i5 760 @ 4.0GHz
Motherboard Asus Maximus III Gene
Cooling Corsair H-50-1
Memory 4 x 4096mb G.Skill Ripjaws 1600 Cas7
Video Card(s) ASUS GTX 670 DirectCU TOP
Storage Crucial 256GB SSD (system) + 2x Samsung F3 1TB (storage) + 2x 2TB Raid-1 NAS (backup)
Display(s) Dell SP2309w 23" 2048x1152
Case Antec Max Fusion Remote
Power Supply Corsair AX750W
Software Win 7 Pro x64
Soldier,

Market share does not [directly] come from having the performance crown. It comes from having the best price-performance ratio. In the GT200 vs HD4000 era Nvidia had the fastest GPU but ATI gained market-share because of it's better value.

Of course the halo-effect can have a small influence, but it is generally not great.
 
Joined
Nov 1, 2008
Messages
468 (0.08/day)
System Name It does stuff
Processor Ryzen 3600
Motherboard B550 Gaming X V2
Cooling Stock
Memory 16GB DDR4 3600
Video Card(s) RX 6700XT
Storage Too much
Display(s) 27" & 21.5"
Case Antec 300
Audio Device(s) ASUS Xonar DGX / Sony MDR-XB500s
Power Supply Corsair 750W
Software Win10 64
Joined
Oct 1, 2006
Messages
4,883 (0.76/day)
Location
Hong Kong
Processor Core i7-12700k
Motherboard Z690 Aero G D4
Cooling Custom loop water, 3x 420 Rad
Video Card(s) RX 7900 XTX Phantom Gaming
Storage Plextor M10P 2TB
Display(s) InnoCN 27M2V
Case Thermaltake Level 20 XT
Audio Device(s) Soundblaster AE-5 Plus
Power Supply FSP Aurum PT 1200W
Software Windows 11 Pro 64-bit
Since it included DirectCompute (which is an API inside of the DirectX 11 package).

DX11 offers unified sound support, unified input support, unified networking support, and more that CUDA does not. But that's not what you were asking. CUDA and DirectCompute are virtually the same with one caveat: Microsoft will flex their industry muscles to get developers to use it and developers will want to use it because the same code will work on NVIDIA, AMD, and Intel GPUs.

DirectCompute has everything to do with parallel computing. That is the reason why it was authored.
DX 11 includes Direct Compute 5.0.
DC4.0 and 4.1 are avalible to DX10 and 10.1 hardware respectively.
 

newtekie1

Semi-Retired Folder
Joined
Nov 22, 2005
Messages
28,472 (4.25/day)
Location
Indiana, USA
Processor Intel Core i7 10850K@5.2GHz
Motherboard AsRock Z470 Taichi
Cooling Corsair H115i Pro w/ Noctua NF-A14 Fans
Memory 32GB DDR4-3600
Video Card(s) RTX 2070 Super
Storage 500GB SX8200 Pro + 8TB with 1TB SSD Cache
Display(s) Acer Nitro VG280K 4K 28"
Case Fractal Design Define S
Audio Device(s) Onboard is good enough for me
Power Supply eVGA SuperNOVA 1000w G3
Software Windows 10 Pro x64
Since it included DirectCompute (which is an API inside of the DirectX 11 package).

DX11 offers unified sound support, unified input support, unified networking support, and more that CUDA does not. But that's not what you were asking. CUDA and DirectCompute are virtually the same with one caveat: Microsoft will flex their industry muscles to get developers to use it and developers will want to use it because the same code will work on NVIDIA, AMD, and Intel GPUs.

DirectCompute has everything to do with parallel computing. That is the reason why it was authored.

DX 11 includes Direct Compute 5.0.
DC4.0 and 4.1 are avalible to DX10 and 10.1 hardware respectively.

Yes, and it has gone largely unuses, and will continue to go unused due to its inflexibility compared to CUDA(and Streams).

DX11(and DX10) were not focussed on parallel computing, and don't compete with CUDA/Streams/OpenCL. You can't use DX11/DirectCompute to do the things that are possible with CUDA/Streams/OpenCL.
 
Joined
Oct 2, 2004
Messages
13,791 (1.94/day)
I just want damn OpenCL to become widely used standard so we move forward. This whole thing with CUDA is not getting us anywhere.
 
Joined
Jun 10, 2005
Messages
1,775 (0.26/day)
Location
Singapore
System Name Half-fucked overclockedd
Processor Intel Core i7 2600k 3.40Ghz @ 4.20Ghz
Motherboard Gigabyte P67 UD7 B3
Cooling Antec Kuhler H2O 920
Memory G.Skill RipjawsX DDR3 8GB X2 1866Mhz (Model F3-2133C9D-16GXH)
Video Card(s) Gigabyte AORUS 1080Ti Extreme Edition
Storage Samsung 840 Pro 256GB / Western Digital Black Cavier 2TB X2
Display(s) Dell U2715H 2560X1440
Case NZXT Phantom
Audio Device(s) Creative Sound Blaster Recon3D Fatal1ty Professional
Power Supply Cooler Master Silent Pro Gold 1000W
Mouse Logitech G510
Keyboard Tesoro Excalibur Spectrum
Software Microsoft Windows 10 Professional
Yes, and it has gone largely unuses, and will continue to go unused due to its inflexibility compared to CUDA(and Streams).

DX11(and DX10) were not focussed on parallel computing, and don't compete with CUDA/Streams/OpenCL. You can't use DX11/DirectCompute to do the things that are possible with CUDA/Streams/OpenCL.

Wait i don't really get you. In your previous post, you said DX11 had nothing to do with parallel. And from your current post, you're saying DX11 has it but just not focused? I'm confuse.
 
Joined
Jul 19, 2006
Messages
43,585 (6.74/day)
Processor AMD Ryzen 7 7800X3D
Motherboard ASUS TUF x670e
Cooling EK AIO 360. Phantek T30 fans.
Memory 32GB G.Skill 6000Mhz
Video Card(s) Asus RTX 4090
Storage WD m.2
Display(s) LG C2 Evo OLED 42"
Case Lian Li PC 011 Dynamic Evo
Audio Device(s) Topping E70 DAC, SMSL SP200 Headphone Amp.
Power Supply FSP Hydro Ti PRO 1000W
Mouse Razer Basilisk V3 Pro
Keyboard Tester84
Software Windows 11
Yes, and it has gone largely unuses, and will continue to go unused due to its inflexibility compared to CUDA(and Streams).

DX11(and DX10) were not focussed on parallel computing, and don't compete with CUDA/Streams/OpenCL. You can't use DX11/DirectCompute to do the things that are possible with CUDA/Streams/OpenCL.

Don't worry, Nvidia will still call it CUDA. ;) Like it matters...
 

FordGT90Concept

"I go fast!1!11!1!"
Joined
Oct 13, 2008
Messages
26,259 (4.65/day)
Location
IA, USA
System Name BY-2021
Processor AMD Ryzen 7 5800X (65w eco profile)
Motherboard MSI B550 Gaming Plus
Cooling Scythe Mugen (rev 5)
Memory 2 x Kingston HyperX DDR4-3200 32 GiB
Video Card(s) AMD Radeon RX 7900 XT
Storage Samsung 980 Pro, Seagate Exos X20 TB 7200 RPM
Display(s) Nixeus NX-EDG274K (3840x2160@144 DP) + Samsung SyncMaster 906BW (1440x900@60 HDMI-DVI)
Case Coolermaster HAF 932 w/ USB 3.0 5.25" bay + USB 3.2 (A+C) 3.5" bay
Audio Device(s) Realtek ALC1150, Micca OriGen+
Power Supply Enermax Platimax 850w
Mouse Nixeus REVEL-X
Keyboard Tesoro Excalibur
Software Windows 10 Home 64-bit
Benchmark Scores Faster than the tortoise; slower than the hare.
Yes, and it has gone largely unuses, and will continue to go unused due to its inflexibility compared to CUDA(and Streams).
It's not officially out until Windows 7 (DX11) launches. DirectCompute is backwards compatible with Windows Vista (DX10/10.1). The only real information we have about is what is presented in this slide show (including its "focus on parallel computing").


And exactly, erocker. Microsoft gets to do the evil laughing and thumb twiddling instead of NVIDIA. :roll:
 

Benetanegia

New Member
Joined
Sep 11, 2009
Messages
2,680 (0.50/day)
Location
Reaching your left retina.
Don't worry, Nvidia will still call it CUDA. ;) Like it matters...

They call it CUDA, because CUDA is the general computing architecture part behind their chip design. CUDA has never been the programing language, C for CUDA was. It's as if Intel/AMD said C for x86. x86 is the architecture, c is the programing language.

So Nvidia will run DX11 compute through CUDA, OpenCL through CUDA, etc.

CUDA= Compute Unified Device Architecture <- It says it all.

The problem is that the media has mixed things badly, giving the name CUDA to the software, when it's not.
 

FordGT90Concept

"I go fast!1!11!1!"
Joined
Oct 13, 2008
Messages
26,259 (4.65/day)
Location
IA, USA
System Name BY-2021
Processor AMD Ryzen 7 5800X (65w eco profile)
Motherboard MSI B550 Gaming Plus
Cooling Scythe Mugen (rev 5)
Memory 2 x Kingston HyperX DDR4-3200 32 GiB
Video Card(s) AMD Radeon RX 7900 XT
Storage Samsung 980 Pro, Seagate Exos X20 TB 7200 RPM
Display(s) Nixeus NX-EDG274K (3840x2160@144 DP) + Samsung SyncMaster 906BW (1440x900@60 HDMI-DVI)
Case Coolermaster HAF 932 w/ USB 3.0 5.25" bay + USB 3.2 (A+C) 3.5" bay
Audio Device(s) Realtek ALC1150, Micca OriGen+
Power Supply Enermax Platimax 850w
Mouse Nixeus REVEL-X
Keyboard Tesoro Excalibur
Software Windows 10 Home 64-bit
Benchmark Scores Faster than the tortoise; slower than the hare.
Stream nor CUDA gets much general consumer use--Microsoft is in a position to change that. This is getting off topic. *shame on me*
 

Benetanegia

New Member
Joined
Sep 11, 2009
Messages
2,680 (0.50/day)
Location
Reaching your left retina.
All discussion is pointless. Nvidia will have DX11, OpenGL and c for CUDA, the three and one will not interfere with the other. CUDA is going to be used in industrial and scientific areas, because it really is much much better for that kind of things, mainly because it's a high level language programing, while the other two are low/medium. Also in those places the ability to run on every GPU is not an issue as they want the best optimization posible for the computer they have just built. They already make different optimizations depending if they use Opterons or Xeons.

CUDA is going to be used in a supercomputer so that alone means a lot of cards sold. I don't know the number, but it could mean 20.000 Tesla cards. At $4000 each, do the calculations.

In the meantime the consumer market will not be affected at all. G80 and GT200 were already focused on general computing and they did well in gaming. People have nothing to worry about.
 
Last edited:

FordGT90Concept

"I go fast!1!11!1!"
Joined
Oct 13, 2008
Messages
26,259 (4.65/day)
Location
IA, USA
System Name BY-2021
Processor AMD Ryzen 7 5800X (65w eco profile)
Motherboard MSI B550 Gaming Plus
Cooling Scythe Mugen (rev 5)
Memory 2 x Kingston HyperX DDR4-3200 32 GiB
Video Card(s) AMD Radeon RX 7900 XT
Storage Samsung 980 Pro, Seagate Exos X20 TB 7200 RPM
Display(s) Nixeus NX-EDG274K (3840x2160@144 DP) + Samsung SyncMaster 906BW (1440x900@60 HDMI-DVI)
Case Coolermaster HAF 932 w/ USB 3.0 5.25" bay + USB 3.2 (A+C) 3.5" bay
Audio Device(s) Realtek ALC1150, Micca OriGen+
Power Supply Enermax Platimax 850w
Mouse Nixeus REVEL-X
Keyboard Tesoro Excalibur
Software Windows 10 Home 64-bit
Benchmark Scores Faster than the tortoise; slower than the hare.
The DirectCompute, OpenCL, and c extension for CUDA are all APIs. The playing field is level.


All super computers still run racks of CPUs (almost 300,000 of them in one :eek:) and very, very few GPUs (no more than display for management). That could change with Larrabee but I doubt it will change before then. Why? CUDA nor Stream are fully programmable: Larrabee is. Make a protein folding driver and you got a protein folding Larrabee card. Make a DX11 driver and you got a gaming Larrabee card. Every card, so long as it has an appropriate driver, is made to order for the task at hand. Throw that on a Nehalem-based Itanium mainframe and you got a giant bucket of kickass computing. :rockout:
 

Benetanegia

New Member
Joined
Sep 11, 2009
Messages
2,680 (0.50/day)
Location
Reaching your left retina.
The DirectCompute, OpenCL, and c extension for CUDA are all APIs. The playing field is level.


All super computers still run racks of CPUs (almost 300,000 of them in one :eek:) and very, very few GPUs (no more than display for management). That could change with Larrabee but I doubt it will change before then. Why? CUDA nor Stream are fully programmable: Larrabee is. Make a protein folding driver and you got a protein folding Larrabee card. Make a DX11 driver and you got a gaming Larrabee card. Every card, so long as it has an appropriate driver, is made to order for the task at hand. Throw that on a Nehalem-based Itanium mainframe and you got a giant bucket of kickass computing. :rockout:

You are heavily outdated man. Tesla cards are going to be used by ORNL to make the fastest supercomputer. 10x faster than RoadRunner.

http://www.dvhardware.net/article38174.html

Or here's another example, instead of creating a 8000 CPU (1000 8p servers) supersomputer they used only 48 servers with Tesla.

http://wallstreetandtech.com/it-inf...ticleID=220200055&cid=nl_wallstreettech_daily

Or this: http://www.embedded-computing.com/news/Technology+Partnerships/14323 Cray supercomputer on a desk using Tesla.

You are outdated when it comes to CUDA programability too: http://www.techpowerup.com/105013/N...st_IDE_for_Developers_Working_with_MS_VS.html

Fermi can run C/C++ and Fortran natively, Larrabee lost every bit of advantage it had there and it's nowhere to be found and it will not be until late 2010 or even 2011.
 
Last edited:
Joined
Apr 26, 2009
Messages
513 (0.09/day)
Location
You are here.
System Name Prometheus
Processor AMD Ryzen 9 5950x
Motherboard ASUS ROG Strix B550-I Gaming
Cooling EKWB EK-240 AIO D-RGB
Memory G.Skill Trident Z Neo 32GB
Video Card(s) MSI RTX 4070Ti Ventus 3X OC 12GB
Storage WD Black SN850 1TB + 1 x Samsung 970 Evo Plus 2TB
Display(s) DELL U4320Q 4K + Wacom Cintiq Pro 16 4K
Case Jonsbo A4 ver1.1 SFF
Audio Device(s) ASUS SupremeFX S1220A
Power Supply Corsair SF750 Platinum SFX
Mouse Logitech Pro Wireless
Keyboard Vortex Race 3 75% MX Brown
Software Windows 11 Pro x64
^ True that.
 

FordGT90Concept

"I go fast!1!11!1!"
Joined
Oct 13, 2008
Messages
26,259 (4.65/day)
Location
IA, USA
System Name BY-2021
Processor AMD Ryzen 7 5800X (65w eco profile)
Motherboard MSI B550 Gaming Plus
Cooling Scythe Mugen (rev 5)
Memory 2 x Kingston HyperX DDR4-3200 32 GiB
Video Card(s) AMD Radeon RX 7900 XT
Storage Samsung 980 Pro, Seagate Exos X20 TB 7200 RPM
Display(s) Nixeus NX-EDG274K (3840x2160@144 DP) + Samsung SyncMaster 906BW (1440x900@60 HDMI-DVI)
Case Coolermaster HAF 932 w/ USB 3.0 5.25" bay + USB 3.2 (A+C) 3.5" bay
Audio Device(s) Realtek ALC1150, Micca OriGen+
Power Supply Enermax Platimax 850w
Mouse Nixeus REVEL-X
Keyboard Tesoro Excalibur
Software Windows 10 Home 64-bit
Benchmark Scores Faster than the tortoise; slower than the hare.
You are heavily outdated man. Tesla cards are going to be used by ORNL to make the fastest supercomputer. 10x faster than RoadRunner.

...
It ain't official until it runs Linpack.

Larrabee is slated for Q2 2010.
 

leonard_222003

New Member
Joined
Jan 29, 2006
Messages
241 (0.04/day)
System Name Home
Processor Q6600 @ 3300
Motherboard Gigabyte p31 ds3l
Cooling TRUE Intel Edition
Memory 4 gb x 800 mhz
Video Card(s) Asus GTX 560
Storage WD 1x250 gb Seagate 2x 1tb
Display(s) samsung T220
Case no name
Audio Device(s) onboard
Power Supply chieftec 550w
Software Windows 7 64
I'm getting sick about all the names and possible things they can do but they really can't.
Bottom line , what they can do new now after some time of screaming CUDA and STREAMS :
1.they can encode in video , for professionals it's not an option , not to many options on the encoder and if you start to give it some deinterlacing stuff/filters sutff to do and some huge bitrate the video card will be very limited in what it can do to help , i tried it and it's crap , it's good for youtube and PS3/xbox360 but not freaks who want the best quality
2. folding home is not my thing
3. it can speed up some applications like adobe and sony vegas , again very very limited , cpu is still the best upgrade if you do this kind of things
4. some new decoders are helped by video cards but for people who have at least a dual core it's like "so now it runs partially on my video card? wow , i would've never known"
5. physics in games , it's probably the only thing we actually see it as a real progress from all the bullshit they serve us
6. something i forgot :)
Bottom line is they talked about what the video card can do and how great things will be but time passed and nothing , the encoding which was supposed to be top notch and fast on a gpu and look , a year has passed and it's basic and limited in what you can do , it's so basic that most of us who want better quality never has an option using badaboom/cyberlink espresso , they are crapppp.
I will not stand for more bullshit about "supercomputing" on the GPU , SHOW ME WHAT YOU CAN DO OTHER THAN GRAPHICS THAT COULD CHANGE MY PRIORITIES IN BUYING HARDWARE!!!
 
Joined
Apr 26, 2009
Messages
513 (0.09/day)
Location
You are here.
System Name Prometheus
Processor AMD Ryzen 9 5950x
Motherboard ASUS ROG Strix B550-I Gaming
Cooling EKWB EK-240 AIO D-RGB
Memory G.Skill Trident Z Neo 32GB
Video Card(s) MSI RTX 4070Ti Ventus 3X OC 12GB
Storage WD Black SN850 1TB + 1 x Samsung 970 Evo Plus 2TB
Display(s) DELL U4320Q 4K + Wacom Cintiq Pro 16 4K
Case Jonsbo A4 ver1.1 SFF
Audio Device(s) ASUS SupremeFX S1220A
Power Supply Corsair SF750 Platinum SFX
Mouse Logitech Pro Wireless
Keyboard Vortex Race 3 75% MX Brown
Software Windows 11 Pro x64

FordGT90Concept

"I go fast!1!11!1!"
Joined
Oct 13, 2008
Messages
26,259 (4.65/day)
Location
IA, USA
System Name BY-2021
Processor AMD Ryzen 7 5800X (65w eco profile)
Motherboard MSI B550 Gaming Plus
Cooling Scythe Mugen (rev 5)
Memory 2 x Kingston HyperX DDR4-3200 32 GiB
Video Card(s) AMD Radeon RX 7900 XT
Storage Samsung 980 Pro, Seagate Exos X20 TB 7200 RPM
Display(s) Nixeus NX-EDG274K (3840x2160@144 DP) + Samsung SyncMaster 906BW (1440x900@60 HDMI-DVI)
Case Coolermaster HAF 932 w/ USB 3.0 5.25" bay + USB 3.2 (A+C) 3.5" bay
Audio Device(s) Realtek ALC1150, Micca OriGen+
Power Supply Enermax Platimax 850w
Mouse Nixeus REVEL-X
Keyboard Tesoro Excalibur
Software Windows 10 Home 64-bit
Benchmark Scores Faster than the tortoise; slower than the hare.
Designed for CUDA means it doesn't benefit from Stream too. The reason why it really hasn't caught on except in a few niches is because support is still spotty at best. Unificiation is required and, at least on Windows, DirectCompute is in a position to do that. The gaming industry wouldn't be where it is today without DirectPlay, Direct3D, and DirectSound from the mid 1990's. History repeats.
 

Benetanegia

New Member
Joined
Sep 11, 2009
Messages
2,680 (0.50/day)
Location
Reaching your left retina.
It ain't official until it runs Linpack.

Larrabee is slated for Q2 2010.

If it is 10x faster for what it has been designed, but it can't run Linpack, it still is 10x faster.

@leonard_222003

The GPU architecture has changed, and you are talking about the past implementations. And it's been 2 years since it started. How much do you think it took the x86 CPUs to gain traction and substitute other implementations? More than that. New things get time.
 

leonard_222003

New Member
Joined
Jan 29, 2006
Messages
241 (0.04/day)
System Name Home
Processor Q6600 @ 3300
Motherboard Gigabyte p31 ds3l
Cooling TRUE Intel Edition
Memory 4 gb x 800 mhz
Video Card(s) Asus GTX 560
Storage WD 1x250 gb Seagate 2x 1tb
Display(s) samsung T220
Case no name
Audio Device(s) onboard
Power Supply chieftec 550w
Software Windows 7 64
If it is 10x faster for what it has been designed, but it can't run Linpack, it still is 10x faster.

@leonard_222003

The GPU architecture has changed, and you are talking about the past implementations. And it's been 2 years since it started. How much do you think it took the x86 CPUs to gain traction and substitute other implementations? More than that. New things get time.

I hope it does man because the hype sorunding this is greater and greater but we see nothing really spectacular.
Also what the man with vreveal shows me , is that all the GPU can do ? that program is so uselles you can't even imagine it before you use it , just try it and then talk.
 
Joined
Apr 26, 2009
Messages
513 (0.09/day)
Location
You are here.
System Name Prometheus
Processor AMD Ryzen 9 5950x
Motherboard ASUS ROG Strix B550-I Gaming
Cooling EKWB EK-240 AIO D-RGB
Memory G.Skill Trident Z Neo 32GB
Video Card(s) MSI RTX 4070Ti Ventus 3X OC 12GB
Storage WD Black SN850 1TB + 1 x Samsung 970 Evo Plus 2TB
Display(s) DELL U4320Q 4K + Wacom Cintiq Pro 16 4K
Case Jonsbo A4 ver1.1 SFF
Audio Device(s) ASUS SupremeFX S1220A
Power Supply Corsair SF750 Platinum SFX
Mouse Logitech Pro Wireless
Keyboard Vortex Race 3 75% MX Brown
Software Windows 11 Pro x64
Designed for CUDA means it doesn't benefit from Stream too. The reason why it really hasn't caught on except in a few niches is because support is still spotty at best. Unificiation is required and, at least on Windows, DirectCompute is in a position to do that. The gaming industry wouldn't be where it is today without DirectPlay, Direct3D, and DirectSound from the mid 1990's. History repeats.

It's interesting that you say that since these days I see nVidia doing a lot of work in this direction while ATI stands by waiting to see what happens. Intel with it's new "is it a plane? is it a bird? is it a GPU?" Larabee project that we know very little about is closer to that while not having an actual product. OpenCL was proposed by Apple (of all the things...) to Intel, ATI and nVidia. It's nothing more then a programming language. Same can be said about the DirectCompute included in DX10/10.1/11...

"Unification"? On Windows? Think bigger, better, more... OpenX. You are now imposing limitation to the idea, not just the actual product. If Windows is what you think about, then your idea is to "help" ATI, not the developers and the users. There is life outside Windows you know. DirectCompute is not the answer... it belongs to Microsoft.

I don't think people realize that CUDA and Stream will be here for ever. Why is that? Because all these so called "open" or "unified" standards run on CUDA/Stream. There is the GPU, then there is CUDA/Stream, then there's everything else. This things are not really APIs, they are just wrappers to the CUDA/Stream APIs.

This is why coding something for CUDA/Stream is more efficient then using OpenCL, DirectCompute or whatever.

So I must point out the obvious, because people also think that it's nVidia's fault that CUDA is used and not OpenCL or whatever. The coders choose to use CUDA. nVidia supports just as well OpenCL or whatever.

Another obvious point, would be that there are far more things using CUDA then there are using Stream. This is not because nVidia is the big bad wolf, it is because it has a pro-active mentality the opposite of ATI's wait and see approach.

ATI is pushing games. DX11. That's it. nVidia is struggling to change the market mentality by pushing GPGPU. And they a paying for it. Everyone has something to say about them, then points out that the competition (namely ATI) is doing things much better... the truth is they are not doing it.

For example PhysX/Havok. ATI doesn't have Havok anymore, they "employed" a shady 3rd rate company to "create" an "open" physics standard. I don't think they intend to complete the project, they just need something "in the works" to compete with PhysX. So it seems that PhysX does matter.
 
Joined
Apr 26, 2009
Messages
513 (0.09/day)
Location
You are here.
System Name Prometheus
Processor AMD Ryzen 9 5950x
Motherboard ASUS ROG Strix B550-I Gaming
Cooling EKWB EK-240 AIO D-RGB
Memory G.Skill Trident Z Neo 32GB
Video Card(s) MSI RTX 4070Ti Ventus 3X OC 12GB
Storage WD Black SN850 1TB + 1 x Samsung 970 Evo Plus 2TB
Display(s) DELL U4320Q 4K + Wacom Cintiq Pro 16 4K
Case Jonsbo A4 ver1.1 SFF
Audio Device(s) ASUS SupremeFX S1220A
Power Supply Corsair SF750 Platinum SFX
Mouse Logitech Pro Wireless
Keyboard Vortex Race 3 75% MX Brown
Software Windows 11 Pro x64
I hope it does man because the hype sorunding this is greater and greater but we see nothing really spectacular.
Also what the man with vreveal shows me , is that all the GPU can do ? that program is so uselles you can't even imagine it before you use it , just try it and then talk.

Ofcourse it's useless... you have an ATI card... Joke aside, no that is not "all" the GPU can do. It's what the developers of the application intended to do... take your poor quality, shaky family movies, or old dusty poor res movies and try and make them better. Why is that useless?

Only 5x faster then you'd normally do using an expensive high end CPU. Performance increase is always useless...

It's the intended purpose of the application, it's not a GPU functions' showcase.

Why do people post before thinking?
 

Benetanegia

New Member
Joined
Sep 11, 2009
Messages
2,680 (0.50/day)
Location
Reaching your left retina.
I hope it does man because the hype sorunding this is greater and greater but we see nothing really spectacular.
Also what the man with vreveal shows me , is that all the GPU can do ? that program is so uselles you can't even imagine it before you use it , just try it and then talk.

I have tried it with movies made with my cellphone and it did wonders. It's not something that I would pay for because I don't usually record with my phone, but it's useful, undoubtely. Some people do use their phones to record videos, so it can be very useful for them.

Why does everybody only care about what is good for them? All this GPU computing is free* right now, you just have to choose the correct program. So if it's free, where's the problem? Nvidia right now sell their cards competitively according to graphics performance, but they offer more. That extra things are not for everybody? And what? Let those people have what they DO want, what they do find useful. TBH, it's as if it bothered you that other people had something that you do not want anyway.

*The programs are not free, but they don't cost more because of that feature, it's a free added feature.

@Sihastru

Wel said.

Regarding CUDA/Stream I think that the best way of explaining it is that DX11 and OpenCL are more like BASIC (programming language) and CUDA/Stream are more like ASSEMBLY language, in the sense that's what the GPUs run natively. But c for CUDA (programming API) is a high level language with direct relation to the low level CUDA (architecture and ISA). CUDA is so good because it's a high level and low level language at the same time, and that is very useful in the HPC market.
 
Last edited:
Top