1. Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

Multi Core PI @ LINPACK

Discussion in 'Overclocking & Cooling' started by ovidiutabla, Feb 9, 2013.

  1. Arctucas

    Arctucas

    Joined:
    Jul 14, 2006
    Messages:
    1,772 (0.59/day)
    Thanks Received:
    290
    I got 28 seconds, 30 ms for 80K on the previous version.

    What would you estimate my time should be for 360K on the new version?

    I let it run for approximately 10 minutes with no result.


    EDIT:

    I feel rather sheepish, I should let it run a few more seconds rather than being impatient:

    [​IMG]
     
    Last edited: May 2, 2013
  2. newtekie1

    newtekie1 Semi-Retired Folder

    Joined:
    Nov 22, 2005
    Messages:
    20,029 (6.15/day)
    Thanks Received:
    6,094
    Just by my quick math based on the times I'm getting as I increase, you're looking at over an hour to complete 360,000 decimal places.
     
    ovidiutabla says thanks.
    Crunching for Team TPU 50 Million points folded for TPU
  3. Mydog

    Mydog

    Joined:
    Mar 8, 2010
    Messages:
    473 (0.28/day)
    Thanks Received:
    566
    [​IMG]

    Had to try this one :)

    ok for a 24/7 summer OC :toast:
     

    Attached Files:

  4. ovidiutabla

    ovidiutabla

    Joined:
    Feb 21, 2008
    Messages:
    40 (0.02/day)
    Thanks Received:
    12
    Something like that. Just leave the benchmark running...
     
    Last edited: May 3, 2013
  5. Arctucas

    Arctucas

    Joined:
    Jul 14, 2006
    Messages:
    1,772 (0.59/day)
    Thanks Received:
    290
    So, is something wrong with result above?
     
  6. Mydog

    Mydog

    Joined:
    Mar 8, 2010
    Messages:
    473 (0.28/day)
    Thanks Received:
    566
    Tested 360.000 decimals with HT

    [​IMG]
     

    Attached Files:

    • mcpi.jpg
      mcpi.jpg
      File size:
      203.4 KB
      Views:
      401
  7. newtekie1

    newtekie1 Semi-Retired Folder

    Joined:
    Nov 22, 2005
    Messages:
    20,029 (6.15/day)
    Thanks Received:
    6,094
    Apparently not because it to my x6 about 20 minutes to finish.

    I guess it doesn't scale exactly exponentially like I thought.
     
    Crunching for Team TPU 50 Million points folded for TPU
  8. unclewebb

    unclewebb RealTemp Author

    Joined:
    Jun 1, 2008
    Messages:
    949 (0.41/day)
    Thanks Received:
    422
    Thanks for the multi-threaded benchmark. :toast:

    [​IMG]
     
  9. Aquinus

    Aquinus Resident Wat-man

    Joined:
    Jan 28, 2012
    Messages:
    6,460 (6.47/day)
    Thanks Received:
    2,188
    Location:
    Concord, NH
    I did a couple tests with my 3820 and threw the results into an OpenOffice spreadsheet to make some graphs out of it. Enjoy if anyone cares. :)

    It almost looks to me as if it completes in O(n log n) time as far as how many decimals per second get calculated on average for any given decimal length but the increasing number of elements is creating a linear increase in times, so it almost feels like something O(n + n log n) or O((n + n) log n) time if I were to take a guess. I'm not really up for getting more data and doing the math to confirm my hunch. That's also for just my 3820 with 4c/8t, I'm sure it scales differently on different hardware.
    [​IMG]
    [​IMG]
     

    Attached Files:

    Last edited: May 4, 2013
  10. ovidiutabla

    ovidiutabla

    Joined:
    Feb 21, 2008
    Messages:
    40 (0.02/day)
    Thanks Received:
    12
  11. Aquinus

    Aquinus Resident Wat-man

    Joined:
    Jan 28, 2012
    Messages:
    6,460 (6.47/day)
    Thanks Received:
    2,188
    Location:
    Concord, NH
    I feel that I should also note that crunching will get my CPU up to 72-74*C but even for 360 decimals my CPU barely broke 62*C fully loaded with this. Just an observation because crunching for the same amount of time makes that much more heat despite both applications loading the CPU to 100%.
     
  12. newtekie1

    newtekie1 Semi-Retired Folder

    Joined:
    Nov 22, 2005
    Messages:
    20,029 (6.15/day)
    Thanks Received:
    6,094
    Crunching likely uses more areas of the CPU, different instruction sets, better use of the cache, etc. because crunching is designed to be as efficient as possible. While this benchmark seem to be purposely inefficient to make the calculation take a lot longer than it should in order to get results that are more suited to a benchmark(several seconds instead of several ms).

    Also, for the LOLs:

    [​IMG]
     

    Attached Files:

    Last edited: May 4, 2013
    ovidiutabla says thanks.
    Crunching for Team TPU 50 Million points folded for TPU
  13. ovidiutabla

    ovidiutabla

    Joined:
    Feb 21, 2008
    Messages:
    40 (0.02/day)
    Thanks Received:
    12
    UI Update [logo with alpha channel]

    [​IMG]

    [​IMG]
     
  14. ovidiutabla

    ovidiutabla

    Joined:
    Feb 21, 2008
    Messages:
    40 (0.02/day)
    Thanks Received:
    12
    The benchmark is using a very complex formula to calculate decimals of PI.

    Bailey–Borwein–Plouffe formula

    The formula is:

    [​IMG]

    The algorithm is very complex, is slow, but i chose it because it's best suited for parallelization.

    The whole ideea was to develop a perfect multithreaded benchmark that can make use of all the cores available, not to implement the fastest algorithm to calculate PI.

    In 2013, meet Multi Core PI sire. Perfect parralelism for any number of decimals.

    I did NOT artificially make the task really slow, in fact, I didn't made anything that slows down the algorithm.

    Sure, the Multi Core PI algorithm was not optimized for speed but provide perfect parallelism and that was the whole ideea:

    [​IMG]
     
    Last edited: May 10, 2013
    newtekie1 says thanks.
  15. newtekie1

    newtekie1 Semi-Retired Folder

    Joined:
    Nov 22, 2005
    Messages:
    20,029 (6.15/day)
    Thanks Received:
    6,094
    Thanks for the explanation.

    I wasn't knocking you, you achieved exactly what you set out to do and it makes a great benchmark.
     
    Crunching for Team TPU 50 Million points folded for TPU
  16. ovidiutabla

    ovidiutabla

    Joined:
    Feb 21, 2008
    Messages:
    40 (0.02/day)
    Thanks Received:
    12
    Multi Core LINPACK Ultimate

    Meet Multi Core LINPACK Ultimate!

    A multithreaded CPU benchmark that performs numerical linear algebra. It makes use of the BLAS (Basic Linear Algebra Subprograms) libraries for performing basic vector and matrix operations.

    The benchmark is written in C# / WPF [The User Interface], C++ [The Core Algorithm] and provide excellent parallelism.

    [​IMG]

    How it works

    Default setting for benchmark is a Matrix size of 4000. Just hit <Run benchmark> button to start benching your CPU.

    Submit to HWBOT

    First, press <Submit to HWBOT> button. A screenshot of the entire screen and a crypted XML datafile will be created. Attention! CPUZ must be running!
    Second, follow the link provided on the dialog and submit your datafile to HWBOT.

    HWBOT

    http://hwbot.org/benchmark/multi_core_linpack_ultimate/

    Supported operating systems

    Microsoft Windows XP / Server 2003
    Microsoft Windows Vista / 7
    Microsoft Windows 8 / Server 2012

    Website

    http://www.pcgamingxtreme.ro/multi-core-linpack-ultimate/

    Download Link

    http://www.pcgamingxtreme.ro/forum/download/file.php?id=690
     
  17. ovidiutabla

    ovidiutabla

    Joined:
    Feb 21, 2008
    Messages:
    40 (0.02/day)
    Thanks Received:
    12
  18. Arctucas

    Arctucas

    Joined:
    Jul 14, 2006
    Messages:
    1,772 (0.59/day)
    Thanks Received:
    290
    [​IMG]
     
  19. Feänor

    Feänor New Member

    Joined:
    Oct 4, 2005
    Messages:
    512 (0.15/day)
    Thanks Received:
    238
    Poor little g540...
     

    Attached Files:

  20. TRWOV

    TRWOV

    Joined:
    Aug 11, 2011
    Messages:
    3,557 (3.04/day)
    Thanks Received:
    2,126
    Location:
    Mexico
    [​IMG]
     
    Crunching for Team TPU
  21. cheesy999

    Joined:
    Jul 2, 2010
    Messages:
    3,919 (2.49/day)
    Thanks Received:
    581
    Location:
    UK
    I suspect this benchmark might like Intel processors a little bit more than AMD. Unless I'm reading it wrong.

    [​IMG]
     
  22. agent00skid

    Joined:
    May 6, 2012
    Messages:
    569 (0.63/day)
    Thanks Received:
    387
    Seems to run fine on my AMD processor.

    [​IMG]
     
    Crunching for Team TPU
  23. TRWOV

    TRWOV

    Joined:
    Aug 11, 2011
    Messages:
    3,557 (3.04/day)
    Thanks Received:
    2,126
    Location:
    Mexico
    It does give unconsistent results, I give you that. agent00skid's A6-3500 APU gets better times than your unlocked X4 and it's a triple-core. I thought it might be related to instruction sets but the Phenom II and Llano support the same instructions.

    Maybe memory bandwidth plays a role too?


    edit: Maybe your X4 is throttling? Watch the CPU-Z readout while the benchmark is running.

    BTW OP, can we have a logo? Seeing the dull standard EXE icon on the desktop isn't cool.
     
    Last edited: May 19, 2013
    ovidiutabla says thanks.
    Crunching for Team TPU
  24. agent00skid

    Joined:
    May 6, 2012
    Messages:
    569 (0.63/day)
    Thanks Received:
    387
    My N830 at 1,5 Ghz in my laptop took twice as long. So on my end, it's seems to scale appropriately.
     
    Crunching for Team TPU
  25. cheesy999

    Joined:
    Jul 2, 2010
    Messages:
    3,919 (2.49/day)
    Thanks Received:
    581
    Location:
    UK
    I'm on single channel, we should explore this.
     

Currently Active Users Viewing This Thread: 1 (0 members and 1 guest)

Share This Page