140X HIGHER THROUGHPUT TO KEEP UP WITH EXPLODING DATA
SIMPLIFIED OPERATIONS WITH A SINGLE TRAINING AND INFERENCE PLATFORM
REAL-TIME INFERENCE
FASTER DEPLOYMENT WITH NVIDIA DEEP LEARNING SDK
CUDA Parallel-Processing Cores 5120
NVIDIA Tensor Cores 640
FP64 Performance 7.4 TFLO
FP32 Performance 14.8 TFLOPS
FP16 Performance 29.6 TFLOPS
Tensor Performance 118.5 TFLOPS
Max Power Consumption 250 W
Graphics Bus PCI Express 3.0 x 16
Display Connectors DP 1.4 (4)
Form Factor 4.4 H x 10.5 L Dual Slot
4992 NVIDIA CUDA cores with a dual-GPU design
Up to 2.91 teraflops double-precision performance with NVIDIA GPU Boost
Up to 8.73 teraflops single-precision performance with NVIDIA GPU Boost
24 GB of GDDR5 memory
480 GB/s aggregate memory bandwidth
ECC protection for increased reliability
Server-optimised to deliver the best throughput in the data center