Based on the CUDA architecture codenamed "Fermi", the Tesla M2070 computing module enables seamless integration of GPU computing with host systems for high-performance computing and large data center, scale-out deployments. The 20-series Tesla GPUs deliver greater than 10x the double-precision horsepower of a quad-core x86 CPU and deliver ECC memory. The Tesla M2070 module delivers all of the standard benefits of GPU computing while enabling maximum reliability and tight integration with system monitoring and management tools. This gives data center IT staff much greater choice in how they deploy GPUs, with a wide variety of rack-mount and blade systems and with the remote monitoring and remote management capabilities they need. Compared to CPU-only systems, servers with Tesla 20-series GPU computing module delivers supercomputing power at 1/10th the cost and 1/20th the power consumption while providing the highest compute density.
GPUs powered by Fermi-generation CUDA architecture
Delivers cluster performance at 1/10th the cost and 1/20th the power of CPU-only systems based on the quad-core CPUs.
448 CUDA cores
448 CUDA cores delivers up to 515 Gigaflops of double-precision peak performance in each GPU, enabling servers from leading OEMs to deliver a Teraflop or more of double-precision performance per 1 RU of space. Single precision peak performance is over one Teraflop per GPU.
ECC memory meets a critical requirement for computing accuracy and reliability for datacenters and supercomputing centers. It offers protection of data in memory to enhance data integrity and reliability for applications. Register files, L1/L2 caches, shared memory, and DRAM all are ECC protected.
Up to 6GB of GDDR5 memory per GPU
Maximizes performance and reduces data transfers by keeping larger data sets in local memory that is attached directly to the GPU.
System monitoring features
You can integrate the GPU subsystem with the host system's monitoring and management capabilities. This means IT staff can manage all of the critical components of the computing system through a common management interface such as IPMI or OEM-proprietary tools.
NVIDIA parallel DataCache
Accelerates algorithms such as physics solvers, ray-tracing and sparse matrix multiplication where data addresses are not known beforehand. This includes a configurable L1 cache per streaming multiprocessor block and a unified L2 cache for all of the processor cores.
NVIDIA GigaThread engine
NVIDIA GigaThread engine maximizes the throughput by faster context switching that is 10x faster than previous architecture, concurrent kernel execution, and improved thread block scheduling.
High speed, PCIe Gen 2.0 data transferProduct Highlights
High speed, PCIe Gen 2.0 data transfer maximizes bandwidth between the host system and the Tesla processors. It enables Tesla systems to work with virtually any PCIe-compliant host system with an open PCIe slot.
In The Box
- Designed for maximum reliability
- 2-slot passively cooled second-generation Tesla module with 6 GB memory