NVIDIA’s Multi-Precision Tensor Core T4 GPU Available on Google Cloud

NVIDIA’s Multi-Precision Tensor Core T4 GPU Available on Google Cloud
0 comments, 13/11/2018, by , in News

DALLAS, TX, USA, Nov 13, 2018 – SC18 - NVIDIA announced that the new NVIDIA T4 GPU has received the fastest adoption of any server GPU.




Two months after its introduction, the T4 is featured in 57 separate server designs from the world’s leading computer makers. It is also now available in the cloud, with the first availability of the T4 for Google Cloud Platform customers.

“We have never before seen such rapid adoption of a data center processor,” said Ian Buck, vice president and general manager of Accelerated Computing at NVIDIA. “Just 60 days after the T4’s launch, it’s now available in the cloud and is supported by a worldwide network of server makers. The T4 gives today’s public and private clouds the performance and efficiency needed for compute-intensive workloads at scale.”

The T4 accelerates diverse cloud workloads, including high-performance computing, deep learning training and inference, machine learning, data analytics, and graphics. Based on the new NVIDIA Turing architecture, it features multi-precision Turing Tensor Cores and new RT Cores, which, when combined with accelerated containerized software stacks, deliver unprecedented performance at scale.

“Real-time visualization and online inference workloads need low latency for their end users. We are delighted to partner with NVIDIA to offer T4 GPU support for Google Cloud customers,” said Damion Heredia, senior director of Product Management at Google Cloud. “NVIDIA T4 GPUs for Google Cloud offers a highly scalable, cost-effective, low-latency platform for our ML and visualization customers. Google Cloud’s network capabilities together with the T4 offering enable customers to innovate in new ways, speeding up applications while reducing costs.”

Interested customers can sign up for Google Cloud’s early access program.

Consumer internet companies, including social media and online shopping sites, are among T4’s early adopters and largest end-customer base.

Among server companies featuring the T4 are Dell EMC, Hewlett Packard Enterprise, IBM, Lenovo, and Supermicro.

Flexible Design, Breakthrough Performance

Designed to meet the unique needs of scale-out public and enterprise cloud environments, T4 maximizes throughput, utilization, and user concurrency, helping customers efficiently address exploding user and data growth.

Roughly the size of a candy bar, the low-profile 70-watt T4 GPU has the flexibility to fit into a standard server or any Open Compute Project hyperscale server design. Server designs can range from a single T4 GPU all the way up to 20 GPUs in a single node.

T4’s multi-precision capabilities power breakthrough AI performance for a wide range of AI workloads at four different levels of precision, offering 8.1 TFLOPS at FP32, 65 TFLOPS at FP16 as well as 130 TOPS of INT8 and 260 TOPS of INT4. For AI inference workloads, a server with two T4 GPUs can replace 54 CPU-only servers. For AI training, a server with two T4 GPUs can replace nine dual-socket CPU-only servers.

Keep Current on NVIDIA

Subscribe to the NVIDIA blog, follow us on FacebookTwitterLinkedIn, and Instagram, and view NVIDIA videos on YouTube and images on Flickr.


NVIDIA’s (NASDAQ: NVDA) invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots and self-driving cars that can perceive and understand the world. More information at http://nvidianews.nvidia.com/.

About Guest/blogger

Guest/blogger posts belongs to respective authors. The articles/tips are summarized here, if interested in reading the complete blog post, please follow the links given under each post.