• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

GPU video encode Experience

jamsbong

New Member
Joined
Mar 17, 2010
Messages
83 (0.02/day)
System Name 2500Kjamsbong
Processor Core i5 2500K @ 4.6Ghz
Motherboard Asrock Extreme 4 Z68
Cooling Zalman Reserator (CPU and GPU)
Memory DDR3 8GB
Video Card(s) EVGA Nvidia 560Ti 1GB
Storage 60GB Kingston SSD
Display(s) 24" Dell IPS
Case CoolerMaster 690 Advanced II
Audio Device(s) on-board
Power Supply Zalman ZM-600HP modular 600watt
Software Windows 7
Hi Everyone,

I thought I'd share this interesting findings about GPU accelerated video encoding. It may not be interesting especially if you already done some tweaking in the past.

I've used Arcsoft's mediaconverter7 trial version for the tests. To enable GPU encoding, you have to enable hardware encoding in the ATI driver menu. I used ATI tray tool to observe the GPU usage and task manager for the CPU usage.

I found that most of the time, the CPU is doing all the hard work. My Quadcore is flexing its muscle even when GPU encoding is enabled. The most the GPU needed was about 20% (at 240Mhz underclocked speed).

The software utilises GPU with .mp4 file extension conversion only. I did not manage to find other formats that powers up the GPU usage. Another evidence is the quality, you can tell if it is the GPU that converts the file as the quality can be quite bad.

I don't have a Nvidia card yet so I can't really tell if CUDA can do more GPU enable file format.

As for the CPU and GPU utilisation, it seems that the higher resolution conversion will maximises the hardware workload and to me that is more efficient. Kinda like playing game at high resolution to make the GPU work hard. at Low DVD resolution, the CPU was about 50% and GPU... nil. at HD resolution, 90% and up to 20% GPU.

As for quality. It seems that using CPU standalone conversion, the quality at low bitrate is SO much better than GPU and it is what one expects. The right way to say it will be an efficient compromise in quality. In the case of GPU+CPU, once you reduce the bitrate, the picture suddenly becomes pixelated and very ugly. At the recommended high bitrate, both GPU and CPU standalone had similar quality. This means that GPU encoding has a huge drop-off in quality when there is restriction in the bitrate.

In regards to acceleration. It seems there is only a modest amount of speed improvement. I was testing a 47second video. I will try to test a 1hr video to see if there is a significant improvement with speed using GPU. at the moment, the speed is almost identical.

I hope this has been insightful.

Cheers

James
 
Joined
Sep 9, 2009
Messages
1,168 (0.20/day)
Location
Austria, Europe
System Name Bang4Buck
Processor AMD Ryzen 9 5900x
Motherboard MSI PRESTIGE x570 CREATION
Cooling Fractal Design Celsius S36
Memory 32Gb 4400Mhz Patriot Viper Steel(Samsung B-die) @ 3800Mhz 16-16-16-32-48-1T @ 1.38v
Video Card(s) MSI RTX 3080 Ti Suprim X
Storage Adata SX8200Pro 512Gb/2x Crucial P1 1Tb/Samsung 840 EVO/6Tb Raid -HGST Enterprise/2x IronWolf 8Tb/
Display(s) Samsung UE49KS8002 4K HDR TV (US - 9 series)
Case Fractal Design Define R6 Black Usb-C
Audio Device(s) HDMI out to Denon X4400H reciever, 2x Dali Zensor 7, Dali Zensor Vokal, 2x Dali Zensor 1, Yamaha Sub
Power Supply Seasonic Prime Ultra 750W
Mouse Logitech G305 Lightspeed
Keyboard Logitech K520
Software Windows 11 Pro x64
Benchmark Scores https://www.3dmark.com/spy/26216445
I would say that Arcsoft`s mediaconverter7 is not that good at using gpu encoding. I am using badaboom 2.0 on a nvidia gtx460, and i must say the conversion speed is superb, and so is the quality. Although, badaboom only works on nvidia, so i can not speak for Ati(Amd) cards.
 

Mussels

Freshwater Moderator
Joined
Oct 6, 2004
Messages
58,412 (7.78/day)
Location
Oystralia
System Name Rainbow Sparkles (Power efficient, <350W gaming load)
Processor Ryzen R7 5800x3D (Undervolted, 4.45GHz all core)
Motherboard Asus x570-F (BIOS Modded)
Cooling Alphacool Apex UV - Alphacool Eisblock XPX Aurora + EK Quantum ARGB 3090 w/ active backplate
Memory 2x32GB DDR4 3600 Corsair Vengeance RGB @3866 C18-22-22-22-42 TRFC704 (1.4V Hynix MJR - SoC 1.15V)
Video Card(s) Galax RTX 3090 SG 24GB: Underclocked to 1700Mhz 0.750v (375W down to 250W))
Storage 2TB WD SN850 NVME + 1TB Sasmsung 970 Pro NVME + 1TB Intel 6000P NVME USB 3.2
Display(s) Phillips 32 32M1N5800A (4k144), LG 32" (4K60) | Gigabyte G32QC (2k165) | Phillips 328m6fjrmb (2K144)
Case Fractal Design R6
Audio Device(s) Logitech G560 | Corsair Void pro RGB |Blue Yeti mic
Power Supply Fractal Ion+ 2 860W (Platinum) (This thing is God-tier. Silent and TINY)
Mouse Logitech G Pro wireless + Steelseries Prisma XL
Keyboard Razer Huntsman TE ( Sexy white keycaps)
VR HMD Oculus Rift S + Quest 2
Software Windows 11 pro x64 (Yes, it's genuinely a good OS) OpenRGB - ditch the branded bloatware!
Benchmark Scores Nyooom.
the problem with the hardware video encoders, is that they only work on media types that the hardware supports.


so while that means that the H264 codec may be supported, it may only be at certain resolutions, frame rates, and encoding settigns (reference frames, B frames, etc etc)


Even in the best circumstances, encoding will be a mix between hardware and software - unless its VERY specifically configured (For example, the clearly defined DVD and blu ray standards, for set top players)



the best methods that work atm dont directly use the GPU to encode it, but rather use GPGPU methods like CUDA and STREAM. Not too many of those exist, but they tend to work better (and have similar limitations to what i said above, limited output support)
 
Joined
Jan 14, 2009
Messages
2,644 (0.44/day)
Location
...
System Name MRCOMP!
Processor 5800X3D
Motherboard MSI Gaming Plus
Cooling Corsair 280 AIO
Memory 64GB 3600mhz
Video Card(s) GTX3060
Storage 1TB SSD
Display(s) Samsung Neo
Case No Case... just sitting on cardboard :D
Power Supply Antec 650w
badaboom works very well. only supports nvidia tho, as has already been said.
 

jamsbong

New Member
Joined
Mar 17, 2010
Messages
83 (0.02/day)
System Name 2500Kjamsbong
Processor Core i5 2500K @ 4.6Ghz
Motherboard Asrock Extreme 4 Z68
Cooling Zalman Reserator (CPU and GPU)
Memory DDR3 8GB
Video Card(s) EVGA Nvidia 560Ti 1GB
Storage 60GB Kingston SSD
Display(s) 24" Dell IPS
Case CoolerMaster 690 Advanced II
Audio Device(s) on-board
Power Supply Zalman ZM-600HP modular 600watt
Software Windows 7
Thanks for the feedback.

I'm really glad that Nvidia has made the initiative of GPGPU. I think to some extend has triggered the openCL and directcompute initiate. Both API are the way to go for software dev as there is no restriction in the hardware.

I hope to see C++ coded programs become OpenCL soon. I'm told that the process is quite straight forward. It won't be long now as there are more hardware support (APU) and proper API availability.

Then I hope, we can encode instantly and won't have to worry about how long it takes.
 
Joined
Nov 13, 2004
Messages
455 (0.06/day)
Location
Canada/quebec/Montreal
System Name Custom DIY
Processor Intel i7 2600K @ 4.8 Turbo 1.4v
Motherboard Asus P8Z68-V Pro 8801
Cooling XSPC RS240 + 120mm Rad/fan
Memory Corsair 1866 Vangence 9-10-9-27-2T
Video Card(s) 2X EVGA GTX570 SLI
Storage OCZ Revodrive 110GB + 2x1TB seagate
Display(s) ASUS MT276HE
Case CoolerMaster Sniper
Audio Device(s) X-Fi Titanium Fatal1ty Professional Series
Power Supply ANTEC TP-750 Blue 750Watts
Software win7 64
Benchmark Scores http://3dmark.com/3dm11/1347866
Thanks for the feedback.

I'm really glad that Nvidia has made the initiative of GPGPU. I think to some extend has triggered the openCL and directcompute initiate. Both API are the way to go for software dev as there is no restriction in the hardware.

I hope to see C++ coded programs become OpenCL soon. I'm told that the process is quite straight forward. It won't be long now as there are more hardware support (APU) and proper API availability.

Then I hope, we can encode instantly and won't have to worry about how long it takes.

openCL and directcompute and the like are in their infancy. now that both side support those we should see more and more software using it..
 

jamsbong

New Member
Joined
Mar 17, 2010
Messages
83 (0.02/day)
System Name 2500Kjamsbong
Processor Core i5 2500K @ 4.6Ghz
Motherboard Asrock Extreme 4 Z68
Cooling Zalman Reserator (CPU and GPU)
Memory DDR3 8GB
Video Card(s) EVGA Nvidia 560Ti 1GB
Storage 60GB Kingston SSD
Display(s) 24" Dell IPS
Case CoolerMaster 690 Advanced II
Audio Device(s) on-board
Power Supply Zalman ZM-600HP modular 600watt
Software Windows 7
Yes, OpenCL is new. I mean it is only version 1.1 now. however, OpenCL has a direct C++ wrapper. I've seen the research community transporting codes into GPU using OpenCL and have demonstrated that even a GF310 is able to perform better than a core-i7 CPU by up to 30x.

The double precision bit is slower but still much faster than CPU. The reality is that OpenCL is supported by both CPU and GPU regardless of vendors and it even works on mobile devices like smartphones and tablets. On the other hand, CUDA only works with Nvidia GPU, not even CPU.

There is a whole lot of opportunities for software developers to start selling software with "instant completion time" processes. CUDA may be mature but OpenCL will easily catch up from learning off CUDA. Sorry Nvidia fans... thats reality. I personally praise Nvidia for making all these efforts.
 
Joined
Aug 29, 2005
Messages
7,493 (1.04/day)
Location
Stuck somewhere in the 80's Jpop era....
System Name Lynni PS \ Lenowo TwinkPad L14 G2
Processor AMD Ryzen 7 7700 Raphael (Waiting on 9800X3D) \ i5-1135G7 Tiger Lake-U
Motherboard ASRock B650M PG Riptide Bios v. 3.10 AMD AGESA 1.2.0.2a \ Lenowo BDPLANAR Bios 1.68
Cooling Noctua NH-D15 Chromax.Black (Only middle fan) \ Lenowo C-267C-2
Memory G.Skill Flare X5 2x16GB DDR5 6000MHZ CL36-36-36-96 AMD EXPO \ Willk Elektronik 2x16GB 2666MHZ CL17
Video Card(s) Sapphire PURE AMD Radeon™ RX 9070 Gaming OC 16GB | Intel® Iris® Xe Graphics
Storage Gigabyte M30 1TB|Sabrent Rocket 2TB| HDD: 10TB|1TB \ WD RED SN700 1TB
Display(s) KTC M27T20S 1440p@165Hz | LG 48CX OLED 4K HDR | Innolux 14" 1080p
Case Asus Prime AP201 White Mesh | Lenowo L14 G2 chassis
Audio Device(s) Steelseries Arctis Pro Wireless
Power Supply Be Quiet! Pure Power 12 M 750W Goldie | 65W
Mouse Logitech G305 Lightspeedy Wireless | Lenowo TouchPad & Logitech G305
Keyboard Ducky One 3 Daybreak Fullsize | L14 G2 UK Lumi
Software Win11 IoT Enterprise 24H2 UK | Win11 IoT Enterprise LTSC 24H2 UK / Arch (Fan)
Benchmark Scores 3DMARK: https://www.3dmark.com/3dm/89434432? GPU-Z: https://www.techpowerup.com/gpuz/details/v3zbr
Top