techPowerUp! Forums

Go Back   techPowerUp! Forums > Hardware > Overclocking & Cooling

Reply
 
Thread Tools
Old Feb 9, 2013, 05:27 PM   #1
t.phase
25 Posts
 
t.phase's Avatar
 
Join Date: Feb 2008
Posts: 41 (0.02/day)
Thanks: 11
Thanked 12 Times in 10 Posts
Send a message via Yahoo to t.phase

Multi Core PI @ LINPACK

I developed a multithreaded CPU benchmark that calculates PI decimals using Bailey–Borwein–Plouffe formula. The benchmark is using a multithreaded algorithm written in C++ and provide excellent parallelism. Multi Core PI is written in Visual C++ using MFC and Win32API.

How it works

A slider will help you set the decimals of PI, from 10.000 to 100.000. Default is 80.000. Just hit Run benchmark button to start benching your CPU.

Submit to HWBOT

First, press Take Screenshot button. A screenshot and a XML datafile will be created. Attention! CPUZ must be running!
Second, follow the link provided on the dialog and submit your datafile to HWBOT.

Supported operating systems

Microsoft Windows XP / Server 2003
Microsoft Windows Vista / 7
Microsoft Windows 8 / Server 2012

Download link

http://www.pcgamingxtreme.ro/

Last edited by t.phase; Feb 9, 2013 at 05:50 PM.
t.phase is offline  
Reply With Quote
Old Feb 10, 2013, 07:49 AM   #2
HammerON
The Watchful Moderator
 
HammerON's Avatar
 
Join Date: Mar 2009
Location: The Cold North
Posts: 4,472 (2.85/day)
Thanks: 4,065
Thanked 2,999 Times in 1,578 Posts

System Specs

My results:


100% 12 thread utilization

Last edited by HammerON; Feb 10, 2013 at 07:54 AM.
HammerON is offline  
Crunching for Team TPU
Reply With Quote
Old Feb 10, 2013, 07:53 AM   #3
Aquinus
3500 Posts
 
Aquinus's Avatar
 
Join Date: Jan 2012
Location: Dover, New Hampshire, USA
Posts: 4,512 (8.86/day)
Thanks: 1,441
Thanked 1,425 Times in 1,065 Posts

System Specs

Quote:
Originally Posted by HammerON View Post
100% 12 thread utilization
I've explained this already multiple times and people seem too ignorant to listen and you're the last person I should need to explain this to.

Disable hyper-threading and run it again, please.
__________________
MyHeat
Aquinus is offline  
Crunching for Team TPU
Reply With Quote
Old Feb 10, 2013, 07:59 AM   #4
HammerON
The Watchful Moderator
 
HammerON's Avatar
 
Join Date: Mar 2009
Location: The Cold North
Posts: 4,472 (2.85/day)
Thanks: 4,065
Thanked 2,999 Times in 1,578 Posts

System Specs



Wow - that was amazing
My time was increased by almost 100%.... What else would I expect when disabling HT???
HammerON is offline  
Crunching for Team TPU
Reply With Quote
Old Feb 10, 2013, 08:03 AM   #5
Aquinus
3500 Posts
 
Aquinus's Avatar
 
Join Date: Jan 2012
Location: Dover, New Hampshire, USA
Posts: 4,512 (8.86/day)
Thanks: 1,441
Thanked 1,425 Times in 1,065 Posts

System Specs

Quote:
Originally Posted by HammerON View Post
http://img.techpowerup.org/130210/Capture113.jpg

Wow - that was amazing
My time was increased by almost 100%.... What else would I expect when disabling HT???
Well that confuses me even more. I disable HT on mine and my score goes from 18.5 to 19. :|

HT should never result in 100% improvement. There aren't the resources available to let it scale like that. That should be more like a 15-30% drop in performance on average.

Edit: I lied that was Multi Core PRIME not MC PI, they look exactly the same sans the formula so I didn't notice it off the bat. My skepticism from PRIME worked its way over here. Either way I disabled HT and now it runs slower by about 60%. That's a bit more normal. I'm less skeptical about this benchmark and more about the prime one (unless your storing the output in a float or a double and not a fixed point number, in that case the computer is chugging for nothing). Since floating point numbers are not exact and as you go more decimals in, the precision of further decimals decreases.

4c w/ HT:


4c w/o HT:


Once again is the output being verified? Can you do multiple runs per benchmark to make sure that every runs results are consistent and once again, I would like output so I can verify the benchmarks results so I can put my skepticism at ease. As it stands, something is happening on my rig and I don't know what it is or if it is right.
Attached Thumbnails
Click image for larger version

Name:	p8t.jpg
Views:	705
Size:	145.6 KB
ID:	50010   Click image for larger version

Name:	p4t.jpg
Views:	692
Size:	146.6 KB
ID:	50011  
__________________
MyHeat

Last edited by Aquinus; Feb 10, 2013 at 08:28 AM.
Aquinus is offline  
Crunching for Team TPU
Reply With Quote
Old Feb 10, 2013, 09:05 AM   #6
uuuaaaaaa
75 Posts
 
Join Date: Sep 2011
Posts: 136 (0.21/day)
Thanks: 3
Thanked 14 Times in 13 Posts

System Specs

My Phenom II x6 is slow
Attached Thumbnails
Click image for larger version

Name:	pHiI.png
Views:	76
Size:	150.4 KB
ID:	50012  

Last edited by uuuaaaaaa; Feb 10, 2013 at 09:07 AM. Reason: Forgot the screenshot
uuuaaaaaa is offline  
Reply With Quote
Old Feb 10, 2013, 09:16 AM   #7
AphexDreamer
Eligible for custom title
 
AphexDreamer's Avatar
 
Join Date: Jun 2007
Location: C:\Program Files (x86)\Aphexdreamer\
Posts: 5,617 (2.56/day)
Thanks: 812
Thanked 888 Times in 674 Posts

System Specs

Quote:
Originally Posted by uuuaaaaaa View Post
My Phenom II x6 is slow
Faster than my FX6100 apparently...
__________________
Sent from my PC using chrome.
AphexDreamer is offline  
Reply With Quote
Old Feb 10, 2013, 09:37 AM   #8
Aquinus
3500 Posts
 
Aquinus's Avatar
 
Join Date: Jan 2012
Location: Dover, New Hampshire, USA
Posts: 4,512 (8.86/day)
Thanks: 1,441
Thanked 1,425 Times in 1,065 Posts

System Specs

Quote:
Originally Posted by AphexDreamer View Post
Faster than my FX6100 apparently...
http://img.techpowerup.org/130209/Mu...creenShot.jpeg
It's because the FPU is getting used for this benchmark. Keep in mind that each module only has one FPU so without FMA3 optimizations you're only going to see 3-cores worth of performance out of it. However if this used fixed point instead of floating point, this could use the integer cores which are faster in general and performances significantly better on AMD's newer processors. Fixed point also offers a higher level of precision, floating point is inaccurate because of how it converts decimals to and from base 2 integers.
__________________
MyHeat
Aquinus is offline  
Crunching for Team TPU
Reply With Quote
Old Feb 10, 2013, 10:21 AM   #9
AphexDreamer
Eligible for custom title
 
AphexDreamer's Avatar
 
Join Date: Jun 2007
Location: C:\Program Files (x86)\Aphexdreamer\
Posts: 5,617 (2.56/day)
Thanks: 812
Thanked 888 Times in 674 Posts

System Specs

Quote:
Originally Posted by Aquinus View Post
It's because the FPU is getting used for this benchmark. Keep in mind that each module only has one FPU so without FMA3 optimizations you're only going to see 3-cores worth of performance out of it. However if this used fixed point instead of floating point, this could use the integer cores which are faster in general and performances significantly better on AMD's newer processors. Fixed point also offers a higher level of precision, floating point is inaccurate because of how it converts decimals to and from base 2 integers.
Which is why I had asked him if he would/could make a more FX optimized benchmark but he said it is FX optimized as it was coded with an FX processor. http://www.techpowerup.com/forums/sh...5&postcount=68
__________________
Sent from my PC using chrome.
AphexDreamer is offline  
Reply With Quote
Old Feb 10, 2013, 11:30 AM   #10
uuuaaaaaa
75 Posts
 
Join Date: Sep 2011
Posts: 136 (0.21/day)
Thanks: 3
Thanked 14 Times in 13 Posts

System Specs

Quote:
Originally Posted by AphexDreamer View Post
Faster than my FX6100 apparently...
http://img.techpowerup.org/130209/Mu...creenShot.jpeg
I wasn't expecting this. You also have a much higher clock.
uuuaaaaaa is offline  
Reply With Quote
Old Feb 10, 2013, 12:08 PM   #11
Bo$$
3500 Posts
 
Bo$$'s Avatar
 
Join Date: May 2009
Location: London, UK
Posts: 3,694 (2.45/day)
Thanks: 444
Thanked 782 Times in 688 Posts

System Specs



Maybe looks a little low here
__________________

Live in the UK? Join the UK TPU Group!
Caution! This is Sparta!
Im Looking For...
Im Selling
Bo$$ is online now  
Reply With Quote
Old Feb 10, 2013, 12:16 PM   #12
Melvis
2000 Posts
 
Melvis's Avatar
 
Join Date: Mar 2008
Location: Australia
Posts: 2,987 (1.56/day)
Thanks: 619
Thanked 471 Times in 392 Posts

System Specs

Attached Thumbnails
Click image for larger version

Name:	Multi core Pi.jpg
Views:	61
Size:	171.9 KB
ID:	50015  
Melvis is offline  
Reply With Quote
Old Feb 10, 2013, 07:21 PM   #13
CrackerJack
2000 Posts
 
CrackerJack's Avatar
 
Join Date: Dec 2007
Location: East TN
Posts: 2,392 (1.19/day)
Thanks: 408
Thanked 439 Times in 342 Posts

System Specs

CrackerJack is offline  
Reply With Quote
Old Feb 10, 2013, 08:12 PM   #14
Arctucas
1000 Posts
 
Arctucas's Avatar
 
Join Date: Jul 2006
Location: Cowtown, USSofA
Posts: 1,634 (0.65/day)
Thanks: 60
Thanked 278 Times in 219 Posts

System Specs

__________________
Whiskey-Tango-Foxtrot. over.
Arctucas is offline  
Reply With Quote
Old Feb 10, 2013, 08:49 PM   #15
LAN_deRf_HA
3500 Posts
 
LAN_deRf_HA's Avatar
 
Join Date: Apr 2008
Posts: 4,079 (2.14/day)
Thanks: 295
Thanked 853 Times in 595 Posts

System Specs

LAN_deRf_HA is offline  
Reply With Quote
Old Feb 10, 2013, 09:22 PM   #16
lemonadesoda
Eligible for custom title
 
lemonadesoda's Avatar
 
Join Date: Aug 2006
Posts: 5,347 (2.15/day)
Thanks: 750
Thanked 960 Times in 710 Posts

System Specs

Great x86 kernel 5.x compatible!

I think a REALLY USEFUL statistic would be the time / cores / GHz so that we can see the "efficiency" of the FP core!

lemonadesoda is offline  
Reply With Quote
Old Feb 11, 2013, 09:33 AM   #17
t.phase
25 Posts
 
t.phase's Avatar
 
Join Date: Feb 2008
Posts: 41 (0.02/day)
Thanks: 11
Thanked 12 Times in 10 Posts
Send a message via Yahoo to t.phase

t.phase is offline  
Reply With Quote
Old Feb 14, 2013, 09:24 AM   #18
t.phase
25 Posts
 
t.phase's Avatar
 
Join Date: Feb 2008
Posts: 41 (0.02/day)
Thanks: 11
Thanked 12 Times in 10 Posts
Send a message via Yahoo to t.phase

I have implemented encryption for the XML datafile! Now cheaters can't cheat anymore.

Current version is 2.101

Download link:

http://www.pcgamingxtreme.ro/



__________________
Mobo: ASRock Fatal1ty Z77 Performance | CPU: Intel Core i5-3570K | RAM: 8GB Kingston HyperX 1866Mhz | GPU: Sapphire Radeon HD 7970 | Intel SSD 330 Series 120GB | HDD: Seagate Barracuda 2TB | PSU: Cooler Masters GX 750W | Case: Zalman GS1000 SE | Periferice: Microsoft Sidewinder x4 | Razer Imperator | Razer Destructor | Display: Panasonic 50VT20 3D
t.phase is offline  
Reply With Quote
Old Feb 14, 2013, 11:29 AM   #19
rickss69
2000 Posts
 
rickss69's Avatar
 
Join Date: Aug 2009
Location: Rockvale TN (Not Australia)
Posts: 2,335 (1.67/day)
Thanks: 545
Thanked 613 Times in 480 Posts

System Specs

__________________
rickss69 is offline  
Reply With Quote
Old Feb 14, 2013, 07:23 PM   #20
rickss69
2000 Posts
 
rickss69's Avatar
 
Join Date: Aug 2009
Location: Rockvale TN (Not Australia)
Posts: 2,335 (1.67/day)
Thanks: 545
Thanked 613 Times in 480 Posts

System Specs

__________________
rickss69 is offline  
Reply With Quote
Old Feb 14, 2013, 07:54 PM   #21
cadaveca
My name is Dave
 
cadaveca's Avatar
 
Join Date: Apr 2006
Location: The Great White North
Posts: 10,932 (4.16/day)
Thanks: 4,718
Thanked 5,464 Times in 3,308 Posts

System Specs

Quote:
Originally Posted by Aquinus View Post
It's because the FPU is getting used for this benchmark. Keep in mind that each module only has one FPU so without FMA3 optimizations you're only going to see 3-cores worth of performance out of it. However if this used fixed point instead of floating point, this could use the integer cores which are faster in general and performances significantly better on AMD's newer processors. Fixed point also offers a higher level of precision, floating point is inaccurate because of how it converts decimals to and from base 2 integers.
PD emulates x87 entirely, hence the slowdown, IMHO. FPU doesn't matter when you aren't capable of running the instruction in the first place.
__________________
Gadgets, Phones, Tablets, Cameras, TVs, HiFi...NextPowerUp


-Only real men play games THIS way.
cadaveca is offline  
Reply With Quote
Old Feb 15, 2013, 07:33 AM   #22
t.phase
25 Posts
 
t.phase's Avatar
 
Join Date: Feb 2008
Posts: 41 (0.02/day)
Thanks: 11
Thanked 12 Times in 10 Posts
Send a message via Yahoo to t.phase

I removed the slider.

Default setting for benchmark is 80.000 decimals. The target is to submit to HWBOT and we have to make sure that all users are benching at the same settings [80k decimals]

Download Link:

www.pcgamingxtreme.ro

__________________
Mobo: ASRock Fatal1ty Z77 Performance | CPU: Intel Core i5-3570K | RAM: 8GB Kingston HyperX 1866Mhz | GPU: Sapphire Radeon HD 7970 | Intel SSD 330 Series 120GB | HDD: Seagate Barracuda 2TB | PSU: Cooler Masters GX 750W | Case: Zalman GS1000 SE | Periferice: Microsoft Sidewinder x4 | Razer Imperator | Razer Destructor | Display: Panasonic 50VT20 3D

Last edited by t.phase; Feb 15, 2013 at 12:28 PM.
t.phase is offline  
Reply With Quote
Old Feb 15, 2013, 07:49 AM   #23
Aquinus
3500 Posts
 
Aquinus's Avatar
 
Join Date: Jan 2012
Location: Dover, New Hampshire, USA
Posts: 4,512 (8.86/day)
Thanks: 1,441
Thanked 1,425 Times in 1,065 Posts

System Specs

Quote:
Originally Posted by cadaveca View Post
PD emulates x87 entirely, hence the slowdown, IMHO.
Pardon me, I know what x87 is but I don't know what you mean when you say "PD", could you clarify?
Quote:
Originally Posted by cadaveca View Post
FPU doesn't matter when you aren't capable of running the instruction in the first place.
I agree but do we know that the benchmark isn't executing x87 instructions in the first place?

Also floating point emulation is worse than just using floating point numbers to begin with. You really need the exact value if you want your result of pi to be at all accurate. As that decimal place goes further out you're going to start losing precision.
__________________
MyHeat
Aquinus is offline  
Crunching for Team TPU
Reply With Quote
Old Feb 17, 2013, 08:28 AM   #24
t.phase
25 Posts
 
t.phase's Avatar
 
Join Date: Feb 2008
Posts: 41 (0.02/day)
Thanks: 11
Thanked 12 Times in 10 Posts
Send a message via Yahoo to t.phase

Quote:
Originally Posted by Aquinus View Post
Pardon me, I know what x87 is but I don't know what you mean when you say "PD", could you clarify?

I agree but do we know that the benchmark isn't executing x87 instructions in the first place?

Also floating point emulation is worse than just using floating point numbers to begin with. You really need the exact value if you want your result of pi to be at all accurate. As that decimal place goes further out you're going to start losing precision.
The application is compiled using Streaming SIMD Extensions 2 (/arch:SSE2) setting in order to replace FPU instructions with SSE code.
__________________
Mobo: ASRock Fatal1ty Z77 Performance | CPU: Intel Core i5-3570K | RAM: 8GB Kingston HyperX 1866Mhz | GPU: Sapphire Radeon HD 7970 | Intel SSD 330 Series 120GB | HDD: Seagate Barracuda 2TB | PSU: Cooler Masters GX 750W | Case: Zalman GS1000 SE | Periferice: Microsoft Sidewinder x4 | Razer Imperator | Razer Destructor | Display: Panasonic 50VT20 3D
t.phase is offline  
Reply With Quote
Old Feb 17, 2013, 08:34 AM   #25
Aquinus
3500 Posts
 
Aquinus's Avatar
 
Join Date: Jan 2012
Location: Dover, New Hampshire, USA
Posts: 4,512 (8.86/day)
Thanks: 1,441
Thanked 1,425 Times in 1,065 Posts

System Specs

Quote:
Originally Posted by t.phase View Post
The application is compiled using Streaming SIMD Extensions 2 (/arch:SSE2) setting in order to replace FPU instructions with SSE code.
SSE still utilizes the FPU, but that answers part of my question. I'm still curious what Cadaveca meant by "PD" though.
__________________
MyHeat
Aquinus is offline  
Crunching for Team TPU
Reply With Quote
Reply


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Multi-core usage Jonesi13 General Software 4 Mar 31, 2013 01:17 PM
Multi-Core Prime t.phase Overclocking & Cooling 71 Feb 10, 2013 07:45 AM
Multi core registering as single core fixed D007 General Hardware 9 Sep 2, 2009 05:38 AM
multi core cpu registering as single core. D007 General Hardware 2 Aug 26, 2009 03:35 PM
Linpack only loads one core hat General Software 14 Jul 17, 2009 05:48 AM


All times are GMT. The time now is 12:24 PM.


Powered by vBulletin® Version 3.8.6
Copyright ©2000 - 2013, Jelsoft Enterprises Ltd.
no new posts