• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

Post your Tiny Memory Benchmark results

acraft

New Member
Joined
Oct 22, 2020
Messages
5 (0.00/day)
Hi,

I recently bought AMD Ryzen 9 3950x - 32 GB system and I found that some of my applications are running 2x slower on this machine comparing with Intel i7-8750H 6 core - 32 GB system. 3950x's performance results are consistent what I see on the website. For example, AIDA64 cache and memory results are consistent. I tried to figure out why 3950x performing 2x slower and I found that some memory operations are much slower than intel.

I used tiny memory benchmark to figure out what is going on. i7-8750H 6-core CPU with 32 GB ram results are much higher than 3950x. I sometime think why AMD 3950 much cheaper than Intel and it looks they cut the power of some of the instructions to reduce the cost.

Here how you can compile and run the tiny memory benchmark under linux. (Or with WSL2 under windows also works) Instructions:

$ sudo apt update
$ sudo apt install clang make git
$ mkdir tmb
$ cd tmb
$ git clone https://github.com/ssvb/tinymembench .
$ CC=clang CFLAGS="-no-integrated-as" make
$ ./tinymembench


AMD Ryzen 9 3950x - 32 GB Ram

AMD 3950x Results.png


Here is the intel i7-8750H - 32 GB Ram Results:

Intel i7-8750H Results.png


Can you please run and post your results here.

AMD Ryzen 9 3950X System Spec:


AMD-ROG-1.png

AMD-ROG-2.png

AMD-ROG-3.png

AMD-ROG-4.png

AMD-ROG-5.png



INTEL i7-8750H System Spec:


Intel-ROG-1.jpg

Intel-ROG-2.jpg

Intel-ROG-3.jpg

Intel-ROG-4.jpg

Intel-ROG-5.jpg


Thanks!
 
Last edited:
Joined
Nov 7, 2009
Messages
4,473 (0.85/day)
Location
Denmark
System Name The work PC /2700x/5950x
Processor 3900X stock/ 2700x stock/ 5950x 4200 MHz fixed @ 1,056-1,08V
Motherboard Gigabyte AORUS Master X570/2xMSI X470 M7 AC
Cooling Custom WC XSPC RX480, Laing DDC, XSPC Laing DDC Top V3 and EK Velocity/NH15/NH-U12S SE
Memory 32 GB Viper 3600/14 /16 GB Trident Z F4-4000C18D-16GTZSW 3600 /32 GB G Skill Flare CL14 3400
Video Card(s) 2070 Super X MSI/GTX 970 MSI/ GTX 970 MSI
Storage 1 TB SSD+500 GB NVMe / 500 GB SSD/ 500 GB SSD
Display(s) Dell UltraSharp U2518D/2408WFP
Case Corsair 800D / Lian test bench/NZXT 500
Power Supply AX 850 Titanium/AX 860i/AX 760
Software Dual boot/Win 7 & 10 / Linux / Win 10
If you provide your settings for your two sets (speed and timings) I will consider. In general no post considering performance will be answered unless you fill in your system specification.
 

phill

Moderator
Staff member
Joined
Jun 8, 2011
Messages
15,973 (3.40/day)
Location
Somerset, UK
System Name Not so complete or overkill - There are others!! Just no room to put! :D
Processor Ryzen Threadripper 3970X
Motherboard Asus Zenith 2 Extreme Alpha
Cooling Lots!! Dual GTX 560 rads with D5 pumps for each rad. One rad for each component
Memory Viper Steel 4 x 16GB DDR4 3600MHz not sure on the timings... Probably still at 2667!! :(
Video Card(s) Asus Strix 3090 with front and rear active full cover water blocks
Storage I'm bound to forget something here - 250GB OS, 2 x 1TB NVME, 2 x 1TB SSD, 4TB SSD, 2 x 8TB HD etc...
Display(s) 3 x Dell 27" S2721DGFA @ 7680 x 1440P @ 144Hz or 165Hz - working on it!!
Case The big Thermaltake that looks like a Case Mods
Audio Device(s) Onboard
Power Supply EVGA 1600W T2
Mouse Corsair thingy
Keyboard Razer something or other....
VR HMD No headset yet
Software Windows 11 OS... Not a fan!!
Benchmark Scores I've actually never benched it!! Too busy with WCG and FAH and not gaming! :( :( Not OC'd it!! :(
IT does seem to be that the 3 series of AMD wasn't the best for handling memory timings etc so the results aren't much of a shock. Intel's CPU clock speed pushes them in first place for that...

You'll need to put a little more information down for what cooling, RAM, motherboards you use for the fact that there's so many variences that the test would be pointless unless we know what your timings etc were. Can you not put a CPU-Z CPU, memory and motherboard tab with the results for better clarity?
 
Joined
Mar 26, 2012
Messages
221 (0.05/day)
System Name Mixed Bag of OC
Processor AMD Ryzen 5800X3D
Motherboard Maxsun MS-iCraft B550M WIFI
Cooling CPU+GPU on Water with 3 X 420 Rad´s
Memory 32GB Patriot Viper RGB @ 3800 Mhz CL14
Video Card(s) XFX Merc 310 RX 7900 XTX
Storage 2TB Kingston Fury + 2TB Samsung PCIe 4 NVME
Display(s) Philips 48OLED806
Case Selfmade Huuuuuge *Case* :)
Audio Device(s) ifi Zen DAC + Monoprice M1060C & Burmester Replica AMP + Selfmade Huuuuuge Speakers :)
Power Supply Seasonic PRIME TX-750
Mouse Kensington Slimblade (main device) + Razer Basilisk V3 (for FPS)
Keyboard Sharkoon PureWriter RGB, Kailh Blue switches
VR HMD None
Software Windows 11
Benchmark Scores do not matter, my PC is fast :)
3600XT + 32GB @ 3753Mhz CL16 ... Read will be full speed but all Write Operations should be half of a 3900x because of cut off CCX
tinymem.JPG
 
Joined
Jan 8, 2017
Messages
8,931 (3.35/day)
System Name Good enough
Processor AMD Ryzen R9 7900 - Alphacool Eisblock XPX Aurora Edge
Motherboard ASRock B650 Pro RS
Cooling 2x 360mm NexXxoS ST30 X-Flow, 1x 360mm NexXxoS ST30, 1x 240mm NexXxoS ST30
Memory 32GB - FURY Beast RGB 5600 Mhz
Video Card(s) Sapphire RX 7900 XT - Alphacool Eisblock Aurora
Storage 1x Kingston KC3000 1TB 1x Kingston A2000 1TB, 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s) LG UltraGear 32GN650-B + 4K Samsung TV
Case Phanteks NV7
Power Supply GPS-750C
This is to be expected, each CCX can read 32 bytes/clock or write 16 bytes/clock. In total the two CCXs can achieve the same write performance like any other Intel CPU just not using a single thread because then the instructions are issued from just one CCX and therefore limited to 16 bytes/clock. I suspect AIDA64 is multi-threaded and this benchmark isn't, you need multiple threads (that are scheduled on different CCXs) to get the full throughout. It wasn't really a cost saving measure, it's just how the I/O was configured.
 
Last edited:
Joined
Jul 11, 2015
Messages
627 (0.20/day)
System Name Harm's Rig's
Processor 5950X /2700x / AMD 8370e 4500
Motherboard ASUS DARK HERO / ASRock B550 Phantom Gaming 4
Cooling Enermax LIQMAX III ARGB 360 AIO/ Zalman cooler fan 110mm
Memory Patriot Viper Steel DDR4 16GB (4x 8GB) 4000M TRIDENT Z F-43600V15D-16GTZ /G.SKILL DDR4
Video Card(s) ZOTAC AMP EXTREME AIRO 4090 / 1080 Ti /290X CFX
Storage SAMSUNG 980 PRO SSD 1TB/ WD DARK 770 2TB , Sabrent NVMe 512GB / 1 SSD 250GB / 1 HHD 3 TB
Display(s) Thermal Grizzly WireView / TCL 646 55 TV / 50 Xfinity Hisense A6 XUMO TV
Case TT 37 VIEW 200MM'S/ NZXT Tempest custom
Audio Device(s) Sharp Aquos
Power Supply FSP Hydro PTM PRO 1200W ATX 3.0 PCI-E GEN-5 80 Plus Platinum - EVGA 1300G2/Corsair w750
Mouse G502
Keyboard G413
Try this !
Capturetrythid.PNG
Capture5900x.PNG
 

Attachments

  • CaptureGOTYALL.PNG
    CaptureGOTYALL.PNG
    915.3 KB · Views: 74
Last edited:
Joined
Jul 14, 2006
Messages
2,416 (0.37/day)
Location
People's Republic of America
System Name It's just a computer
Processor i9-14900K Direct Die
Motherboard MSI Z790 ACE MAX
Cooling Dual D5T Vario, XSPC BayRes, Nemesis GTR560, NF-A14-iPPC3000PWM, NF-A14-iPPC2000, HK IV Pro Nickel
Memory G.SKILL F5-7200J3646F24GX2-TZ5RK
Video Card(s) eVGA RTX2080 FTW3 Ultra
Storage Samsung 990 PRO 1TB M.2
Display(s) LG 32GK650F
Case Thermaltake Xaser VI
Audio Device(s) Auzentech X-Meridian 7.1 2G/Z-5500
Power Supply Seasonic Prime PX-1300
Mouse Logitech
Keyboard Logitech
Software Win11PRO

acraft

New Member
Joined
Oct 22, 2020
Messages
5 (0.00/day)
This is to be expected, each CCX can read 32 bytes/clock or write 16 bytes/clock. In total the two CCXs can achieve the same write performance like any other Intel CPU just not using a single thread because then the instructions are issued from just one CCX and therefore limited to 16 bytes/clock. I suspect AIDA64 is multi-threaded and this benchmark isn't, you need multiple threads (that are scheduled on different CCXs) to get the full throughout. It wasn't really a cost saving measure, it's just how the I/O was configured.

The application I used (not tiny memory benchmark) is multi-threaded. I tried on Intel 6-core system and Intel 32 core system. It's very scalable on Intel based systems. The application is 100% scalable, no disk operations, no wait on synchronizations etc.. i7-8750H runs 2x faster than AMD 3950x. It scales as expected on intel core system without any issues.

Here perf stat result which could be helpful:
Screen Shot 2020-10-22 at 4.23.20 PM.png


Just look at how bad the IPC is on AMD.
 
Joined
Jul 11, 2015
Messages
627 (0.20/day)
System Name Harm's Rig's
Processor 5950X /2700x / AMD 8370e 4500
Motherboard ASUS DARK HERO / ASRock B550 Phantom Gaming 4
Cooling Enermax LIQMAX III ARGB 360 AIO/ Zalman cooler fan 110mm
Memory Patriot Viper Steel DDR4 16GB (4x 8GB) 4000M TRIDENT Z F-43600V15D-16GTZ /G.SKILL DDR4
Video Card(s) ZOTAC AMP EXTREME AIRO 4090 / 1080 Ti /290X CFX
Storage SAMSUNG 980 PRO SSD 1TB/ WD DARK 770 2TB , Sabrent NVMe 512GB / 1 SSD 250GB / 1 HHD 3 TB
Display(s) Thermal Grizzly WireView / TCL 646 55 TV / 50 Xfinity Hisense A6 XUMO TV
Case TT 37 VIEW 200MM'S/ NZXT Tempest custom
Audio Device(s) Sharp Aquos
Power Supply FSP Hydro PTM PRO 1200W ATX 3.0 PCI-E GEN-5 80 Plus Platinum - EVGA 1300G2/Corsair w750
Mouse G502
Keyboard G413
Joined
Nov 22, 2014
Messages
91 (0.03/day)
System Name I could say remaining parts or something like that...
Processor i5 2500k @ 4,8 ghz/Xeon x5650
Motherboard Asus Z68 gene-z/G1 Assassin
Cooling 240mm radiator/212+
Memory 8gb ddr3-1333/16gb ddr3-1866
Video Card(s) Nitro RX460 4gb/2x RX570 Red Devil
Storage Many
Display(s) Samsung TV/Topsync 2560*1440
Audio Device(s) onboard
Power Supply Seasonic 620w/Corsair 860i
Software 10 64 bit ultimate
You should try to compare your memory using the same timings (ideally the same sticks). The Intel system is running tightier timings, this could explain at least some part of the difference...
 
Joined
Sep 10, 2016
Messages
809 (0.29/day)
Location
Riverwood, Skyrim
System Name Storm Wrought | Blackwood (HTPC)
Processor AMD Ryzen 9 5900x @stock | i7 2600k
Motherboard Gigabyte X570 Aorus Pro WIFI m-ITX | Some POS gigabyte board
Cooling Deepcool AK620, BQ shadow wings 3 High Spd, stock 180mm |BQ Shadow rock LP + 4x120mm Noctua redux
Memory G.Skill Ripjaws V 2x32GB 4000MHz | 2x4GB 2000MHz @1866
Video Card(s) Powercolor RX 6800XT Red Dragon | PNY a2000 6GB
Storage SX8200 Pro 1TB, 1TB KC3000, 850EVO 500GB, 2+8TB Seagate, LG Blu-ray | 120GB Sandisk SSD, 4TB WD red
Display(s) Samsung UJ590UDE 32" UHD monitor | LG CS 55" OLED
Case Silverstone TJ08B-E | Custom built wooden case (Aus native timbers)
Audio Device(s) Onboard, Sennheiser HD 599 cans / Logitech z163's | Edifier S2000 MKIII via toslink
Power Supply Corsair HX 750 | Corsair SF 450
Mouse Microsoft Pro Intellimouse| Some logitech one
Keyboard GMMK w/ Zelio V2 62g (78g for spacebar) tactile switches & Glorious black keycaps| Some logitech one
VR HMD HTC Vive
Software Win 10 Edu | Ubuntu 22.04
Benchmark Scores Look in the various benchmark threads
The application I used (not tiny memory benchmark) is multi-threaded. I tried on Intel 6-core system and Intel 32 core system. It's very scalable on Intel based systems. The application is 100% scalable, no disk operations, no wait on synchronizations etc.. i7-8750H runs 2x faster than AMD 3950x. It scales as expected on intel core system without any issues.

Here perf stat result which could be helpful:
View attachment 172991

Just look at how bad the IPC is on AMD.
The IPC isn't bad on 3000 series Ryzen at all, it seems more likely that this particular benchmark uses an instruction that Ryzen doesn't have and that's what makes all the difference here
 
Top