• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

NVIDIA Launches the Tesla K40 GPU Accelerator

Joined
Dec 6, 2011
Messages
4,785 (2.14/day)
Likes
1,187
Location
Still on the East Side
#1
NVIDIA today unveiled the NVIDIA Tesla K40 GPU accelerator, the world's highest performance accelerator ever built, delivering extreme performance to a widening range of scientific, engineering, high performance computing (HPC) and enterprise applications.

Providing double the memory and up to 40 percent higher performance than its predecessor, the Tesla K20X GPU accelerator, and 10 times higher performance than today's fastest CPU, the Tesla K40 GPU is the world's first and highest-performance accelerator optimized for big data analytics and large-scale scientific workloads.





Featuring intelligent NVIDIA GPU Boost technology, which converts power headroom into a user-controlled performance boost, the Tesla K40 GPU accelerator enables users to unlock the untapped performance of a broad range of applications.

"GPU accelerators have gone mainstream in the HPC and supercomputing industries, enabling engineers and researchers to consistently drive innovation and scientific discovery," said Sumit Gupta, general manager of Tesla Accelerated Computing products at NVIDIA. "With the breakthrough performance and higher memory capacity of the Tesla K40 GPU, enterprise customers can quickly crunch through massive volumes of data generated by their big data analytics applications."

Ultimate Performance for Science, Big Data
Based on the NVIDIA Kepler compute architecture -- the highest performance, most efficient architecture ever built -- the Tesla K40 GPU accelerator surpasses all other accelerators on two common measures of computational performance: 4.29 teraflops single-precision and 1.43 teraflops double-precision peak floating point performance.

Key features of the Tesla K40 GPU accelerator include:
  • 12 GB of ultra-fast GDDR5 memory allows users to process 2X larger datasets, enabling them to rapidly analyze massive volumes of data.
  • 2,880 CUDA parallel processing cores deliver application acceleration by up to 10X compared to using a CPU alone.
  • Dynamic Parallelism enables GPU threads to dynamically spawn new threads, enabling users to quickly and easily crunch through adaptive and dynamic data structures.
  • PCIe Gen-3 interconnect support accelerates data movement by 2X compared to PCIe Gen-2 technology.

In a related announcement, the Texas Advanced Computing Center (TACC) at The University of Texas at Austin -- one of the leading advanced computing centers in the United States -- plans to deploy "Maverick," a new interactive, remote visualization and data analysis system powered by NVIDIA Tesla K40 GPU accelerators. Maverick is expected to be fully operational in January 2014.

"The Tesla K40 GPU accelerators will help researchers crunch through massive volumes of big data and gain new insights through large-scale, sophisticated visualizations," said Kelly Gaither, director of Visualization at TACC. "With NVIDIA GPUs, Maverick will provide researchers powerful interactive capabilities to advance their most complex scientific challenges."

he Tesla K40 GPU accelerates the broadest range of scientific, engineering, commercial and enterprise HPC and data center applications. Today, more than 240 software applications take advantage of GPU acceleration. The complete catalog of GPU-accelerated applications is available as a free download.

More information about the Tesla K40 GPU accelerator is available at NVIDIA booth 613 at SC13, Nov. 18-21, and on the NVIDIA high performance computing website. To learn more about CUDA or download the latest version, visit the CUDA website.

Users can also try the Tesla K40 GPU accelerator for free on remotely hosted clusters. Visit the GPU Test Drive website for more information.

Availability
Shipping today, the NVIDIA Tesla K40 GPU accelerator is available now and in the coming months from a variety of server manufacturers, including Appro, ASUS, Bull, Cray, Dell, Eurotech, HP, IBM, Inspur, SGI, Sugon, Supermicro and Tyan, as well as from NVIDIA reseller partners.
 
Joined
Jul 23, 2011
Messages
1,556 (0.66/day)
Likes
1,459
Location
Kaunas, Lithuania
System Name Eternal WIP ][ HP ProLiant server blade
Processor AMD FX-8320 @ 4.4/4.1 GHz ][ 2x Intel Xeon L5520 @ 2.27GHz
Motherboard Asus Crosshair V Formula-Z ][ -
Cooling Corsair H55 ][ Stock
Memory 2x4GB Kingston HyperX Genesis @ 1866MHz ][ 4x4 GiB Hynix @ 1333 MHz; ECC on
Video Card(s) 2x MSI GeForce GTX 770 DirectCU II OC ][ None
Storage 1TB WD Black + 1.5TB WD Green + 750GB Samsung + 64GB OCZ Agility 4 ][ 3x 1 TB Hitachi
Display(s) Asus VG278H + Asus VH226H ][ None
Case Antec Nine Hundred ][ it's a server blade, lol
Audio Device(s) Supreme FX II (Integrated); Using optical S/PDIF output ][ none
Power Supply Corsair AX1200i ][ [stock]
Software Gentoo Linux ][ Linux Mint 16
Benchmark Scores 65970 BogoMIPS. Ha. Ha. Ha. >=|
#2
clocks pls :laugh:
 
Joined
Nov 10, 2006
Messages
4,528 (1.11/day)
Likes
6,077
Location
Washington, US
System Name Lappy
Processor i7 6700k
Motherboard Sager NP9870
Cooling A lot smaller than I'd like
Memory Samsung 4x8GB DDR4-2133 SO-DIMM
Video Card(s) GTX 980 (MXM)
Storage 2xSamsung 950 Pro 256GB | 2xHGST 1TB 7.2K
Display(s) 17.3" IPS 1080p G-SYNC
Audio Device(s) Sound Blaster X-FI MB 5, Foster 2.1 channel integrated
Power Supply 330w Brick
Mouse Razer Deathadder
Keyboard 3-zone RGB integrated
Software Windows 10 Pro
Benchmark Scores Not slow
#3
Joined
Jul 23, 2011
Messages
1,556 (0.66/day)
Likes
1,459
Location
Kaunas, Lithuania
System Name Eternal WIP ][ HP ProLiant server blade
Processor AMD FX-8320 @ 4.4/4.1 GHz ][ 2x Intel Xeon L5520 @ 2.27GHz
Motherboard Asus Crosshair V Formula-Z ][ -
Cooling Corsair H55 ][ Stock
Memory 2x4GB Kingston HyperX Genesis @ 1866MHz ][ 4x4 GiB Hynix @ 1333 MHz; ECC on
Video Card(s) 2x MSI GeForce GTX 770 DirectCU II OC ][ None
Storage 1TB WD Black + 1.5TB WD Green + 750GB Samsung + 64GB OCZ Agility 4 ][ 3x 1 TB Hitachi
Display(s) Asus VG278H + Asus VH226H ][ None
Case Antec Nine Hundred ][ it's a server blade, lol
Audio Device(s) Supreme FX II (Integrated); Using optical S/PDIF output ][ none
Power Supply Corsair AX1200i ][ [stock]
Software Gentoo Linux ][ Linux Mint 16
Benchmark Scores 65970 BogoMIPS. Ha. Ha. Ha. >=|
#4
So in other words, the K40 is to the K20X as what the GTX 780ti is to the GTX TITAN, hardware-wise?
WOW. Not impressed much :(
 
Last edited:
Joined
Dec 16, 2010
Messages
1,484 (0.57/day)
Likes
544
System Name My Surround PC
Processor Intel Core i7 4770K @ 4.2 GHz (1.15 V)
Motherboard ASRock Z87 Extreme6
Cooling Swiftech MCP35X / XSPC Rasa CPU / Swiftech MCW82 / Koolance HX-1320 w/ 8 Scythe Fans
Memory 16GB (2 x 8 GB) Mushkin Blackline DDR3-2400 CL11-13-13-31
Video Card(s) MSI Nvidia GeForce GTX 980 Ti Armor 2X
Storage Samsung SSD 850 Pro 256GB, 2 x 4TB HGST NAS HDD in RAID 1
Display(s) 3 x Acer K272HUL 27" in Surround 7860x1440
Case NZXT Source 530
Audio Device(s) Integrated ALC1150 + Logitech Z-5500 5.1
Power Supply Seasonic X-1250 1.25kW
Mouse Gigabyte Aivia Krypton
Keyboard Logitech G15
Software Windows 8.1 Pro x64
#5
Well, there is one thing interesting about this. This card is the first time I've seen 4Gbit GDDR5 chips used. Get ready for another doubling of graphics memory next video card generation.
 
Joined
Nov 10, 2006
Messages
4,528 (1.11/day)
Likes
6,077
Location
Washington, US
System Name Lappy
Processor i7 6700k
Motherboard Sager NP9870
Cooling A lot smaller than I'd like
Memory Samsung 4x8GB DDR4-2133 SO-DIMM
Video Card(s) GTX 980 (MXM)
Storage 2xSamsung 950 Pro 256GB | 2xHGST 1TB 7.2K
Display(s) 17.3" IPS 1080p G-SYNC
Audio Device(s) Sound Blaster X-FI MB 5, Foster 2.1 channel integrated
Power Supply 330w Brick
Mouse Razer Deathadder
Keyboard 3-zone RGB integrated
Software Windows 10 Pro
Benchmark Scores Not slow
#6
So in other words, the K40 is to the K20X as what the GTX 780ti is to the GTX TITAN, hardware-wise?
WOW. Not impressed much :(
Exactly.
 
Joined
Sep 7, 2011
Messages
2,785 (1.20/day)
Likes
1,672
Location
New Zealand
System Name MoneySink
Processor 2600K @ 4.8
Motherboard P8Z77-V
Cooling AC NexXxos XT45 360, RayStorm, D5T+XSPC tank, Tygon R-3603, Bitspower
Memory 16GB Crucial Ballistix DDR3-1600C8
Video Card(s) GTX 780 SLI (EVGA SC ACX + Giga GHz Ed.)
Storage Kingston HyperX SSD (128) OS, WD RE4 (1TB), RE2 (1TB), Cav. Black (2 x 500GB), Red (4TB)
Display(s) Achieva Shimian QH270-IPSMS (2560x1440) S-IPS
Case NZXT Switch 810
Audio Device(s) onboard Realtek yawn edition
Power Supply Seasonic X-1050
Software Win8.1 Pro
Benchmark Scores 3.5 litres of Pale Ale in 18 minutes.
#7
So in other words, the K40 is to the K20X as what the GTX 780ti is to the GTX TITAN, hardware-wise?
Not really.
The GTX 780 Ti has half the onboard memory of the GTX Titan
The Tesla K40 has twice as much onboard memory of the K20X
WOW. Not impressed much :(
Based on your erroneous assumption and the fact that you aren't the target demographic for the board I am not in the least surprised.
There is a "contact us" tab at the Eurotech site should you wish to give these supercomputer manufacturers the benefit of your experience.
 
Joined
Jul 23, 2011
Messages
1,556 (0.66/day)
Likes
1,459
Location
Kaunas, Lithuania
System Name Eternal WIP ][ HP ProLiant server blade
Processor AMD FX-8320 @ 4.4/4.1 GHz ][ 2x Intel Xeon L5520 @ 2.27GHz
Motherboard Asus Crosshair V Formula-Z ][ -
Cooling Corsair H55 ][ Stock
Memory 2x4GB Kingston HyperX Genesis @ 1866MHz ][ 4x4 GiB Hynix @ 1333 MHz; ECC on
Video Card(s) 2x MSI GeForce GTX 770 DirectCU II OC ][ None
Storage 1TB WD Black + 1.5TB WD Green + 750GB Samsung + 64GB OCZ Agility 4 ][ 3x 1 TB Hitachi
Display(s) Asus VG278H + Asus VH226H ][ None
Case Antec Nine Hundred ][ it's a server blade, lol
Audio Device(s) Supreme FX II (Integrated); Using optical S/PDIF output ][ none
Power Supply Corsair AX1200i ][ [stock]
Software Gentoo Linux ][ Linux Mint 16
Benchmark Scores 65970 BogoMIPS. Ha. Ha. Ha. >=|
#8
Not really.
The GTX 780 Ti has half the onboard memory of the GTX Titan
The Tesla K40 has twice as much onboard memory of the K20X
I know this. I simply decided to handwave this fact out in my comparison. As I was pretty much only taking the GPU itself for that comparison.

Based on your erroneous assumption and the fact that you aren't the target demographic for the board I am not in the least surprised.
Well, the way they presented it, it felt as it was supposed to be something heaps faster than the previous top Tesla product. They said it as if it was so fast, it would be more than enough to revolutionize computing. But meh, it's just marginally better [not counting the 2x memory amount]. Shame on them for getting me too excited for a moment there.
 
Joined
Dec 16, 2010
Messages
1,484 (0.57/day)
Likes
544
System Name My Surround PC
Processor Intel Core i7 4770K @ 4.2 GHz (1.15 V)
Motherboard ASRock Z87 Extreme6
Cooling Swiftech MCP35X / XSPC Rasa CPU / Swiftech MCW82 / Koolance HX-1320 w/ 8 Scythe Fans
Memory 16GB (2 x 8 GB) Mushkin Blackline DDR3-2400 CL11-13-13-31
Video Card(s) MSI Nvidia GeForce GTX 980 Ti Armor 2X
Storage Samsung SSD 850 Pro 256GB, 2 x 4TB HGST NAS HDD in RAID 1
Display(s) 3 x Acer K272HUL 27" in Surround 7860x1440
Case NZXT Source 530
Audio Device(s) Integrated ALC1150 + Logitech Z-5500 5.1
Power Supply Seasonic X-1250 1.25kW
Mouse Gigabyte Aivia Krypton
Keyboard Logitech G15
Software Windows 8.1 Pro x64
#9
Well, the way they presented it, it felt as it was supposed to be something heaps faster than the previous top Tesla product. They said it as if it was so fast, it would be more than enough to revolutionize computing. But meh, it's just marginally better [not counting the 2x memory amount]. Shame on them for getting me too excited for a moment there.
Have you not read press releases before? Even the most minor of improvements is "revolutionary" if it's in a press release. :)
 
Joined
Nov 8, 2005
Messages
44 (0.01/day)
Likes
29
Processor Haswell i7 4770
Motherboard Asus Z87-PRO
Memory 32GB DDR3-2133 10-10-10-30
Video Card(s) 2x Radeon R9 390X
Storage Samsung SSD M840 Pro 256GB, 4x320GB mechanical RAID 5
#10
My primary research focus includes a ton of computational electrodynamics work (mainly FDTD in OpenCL) and while the raw crunching power of the GPU is only a moderate upgrade the 12GB of RAM almost justifies the price tag. It's VERY easy to fill up 6GB of RAM, so for sims requiring more RAM you need to get creative with memory management to avoid the large overhead from excess PCIe transfers. 12GB is also pretty easy to fill up, but it gives quite a bit of extra headroom to more creatively manage host-device transfers. Storing the relevant information of a sim across both GPU and system memory can cripple performance by an order of magnitude or more :twitch:

I'd kill for even a mid-range gaming GPU with 16GB+ of RAM...
 
Joined
May 8, 2009
Messages
76 (0.02/day)
Likes
3
#11
K40

This is a nice chunk more performance for those special users and to keep the same TDP is impressive. GPGPU is becoming a much more mainstream tech and Nvidia is at the forefront.