Table 4. 6-Pin PCI Express Power Conn ect or Pi nout ................................................. 12
Table 5. 8-Pin PCI Express Power Conn ect or Pi nout ................................................. 12
Table 6. Auxiliary Power Connectors ................................................................... 13
Table 7. Power Requirements ............................................................................ 14
Table 8. Languages Supported ........................................................................... 16
Tesla K40 GPU Accelerator BD-06902-001_v05 | iv
OVERVIEW
The NVIDIA® Tesla® K40 graphics processing unit (GPU) is a PCI Express, dual-slot
computing module in the Tesla (267 mm length) form factor comprised of a single
GK110B GPU. The Tesla K40 is designed for servers and offers a total of 12 GB of
GDDR5 on-board memory and supports PCI Express Gen3. The Tesla K40 uses a passive
heat sink for cooling.
Tesla K40 boards ship with ECC enabled by default protecting the register files, cache
and DRAM. With ECC enabled, some of the memory is used for the ECC bits, so the
user available memory is reduced by ~6.25%. On the Tesla K40 the total available
memory with ECC turned on will be ~11.25 GB.
Figure 1. K40 Passive Board
Tesla K40 GPU Accelerator BD-06902-001_v05 | 1
KEY FEATURES
GPU
Number of processor cores: 2880
Core clocks
● Base clock: 745 MHz
● Boost clocks: 810 MHz and 875 MHz
Package size: 45 mm × 45 mm 2397-pin ball grid array (S-FCBGA)
Note: All boards ship with core clock set to the base clock value. Boost clocks can
be selected using NVML or NVSMI. Refer to the NVML/NVSMI documentation for
more details.
Board
Overview
PCI Express Gen3 ×16 system interface
Physical dimensions: 111.15 mm (height) × 267 mm (length), dual-slot
Thermal Solution
Passive heat sink
Display Connectors
None
Power Connectors
One 6-pin PCI Express power connector
One 8-pin PCI Express power connector
NVIDIA GPU Boost™ is a feature available on Tesla K40. It makes use of any power
headroom to run the core clock to a higher frequency. Application workloads that have
power headroom can run at high GPU clocks to boost application performance.
Note: The memory clock remains constant at 3 GHz. It's likely that the effective
memory bandwidth uti lization will change depending on the core clock frequency.
NVIDIA GPU Boost for HPC Workloads
NVIDIA GPU Boost for Tesla K40 is optimized to deliver a robust and deterministic
boost behavior for a wide range of HPC workloads.
Tesla K40 gives full control to end-users to select the core clock frequency that fits their
workload the best. The workload may have one or more of the following characteristics.
Problem set is spread across multiple GPUs and requires periodic synchronization.
Problem set spread across multiple GPUs and runs independent of each other.
Workload has “compute spikes.” For example, some portions of the workload are
extremely compute intensive pushing the power higher and some portions are
moderate.
Workload is compute intensive through-out without any spikes.
Workload requires fixed clocks and is sensitive to clocks fluctuating during the
execution.
Workload runs in a cluster where all GPUs need to start, finish, and run at the same
clocks.
Workload or end user requires predictable performance and repeatable results.
Datacenter is used to run different types of workload at different hours in a day to
better manage the power consumption.
Some boards in a cluster have access to better cooling than others.
By default the Tesla K40 ships with the core clock set to the base clock. HPC workloads
can have one or more characteristics as described. When selecting one of the supported
boost clocks a good strategy is to characterize the workload with the available boost
clocks. For example, DGEMM/Linpack are extremely demanding on power. Therefore,
the “base clock” may be the correct choice when running Linpack. Some workloads in
life sciences, manufacturing, CFD, CAD, etc., may have power headroom and can take
advantage of one of the boost clocks.
Tesla K40 GPU Accelerator BD-06902-001_v05 | 3
Loading...
+ 14 hidden pages
You need points to download manuals.
1 point = 1 manual.
You can buy points or you can get point for every manual you upload.