Transforming AI with NVIDIA H100 GPU

August 8, 2023

760

The ND H100 v5 virtual machine series instance from Microsoft Azure provides next-generation performance at scale for workloads requiring generative AI, LLMs, and other computing resources

Users of Microsoft Azure from all across the world may now train and deploy their generative AI applications using the most recent NVIDIA accelerated computing technologies.

The NVIDIA H100 Tensor Core GPUs and NVIDIA Quantum-2 InfiniBand networking used in the Microsoft Azure ND H100 v5 VMs, which are already available, enable scaling generative AI, high performance computing (HPC), and other applications with a click from a browser.

The new instance, which is accessible to consumers all around the United States, comes as academics and developers leverage large language models (LLMs) and faster computation to identify fresh consumer and commercial use cases.

With fourth-generation Tensor Cores, a new Transformer Engine for accelerating LLMs, and the most recent NVLink technology, which enables GPUs to communicate with each other at 900GB/sec, the NVIDIA H100 GPU offers supercomputing-class performance.

The incorporation of NVIDIA Quantum-2 CX7 InfiniBand with 3,200 Gbps cross-node bandwidth enables flawless performance across the GPUs at an enormous scale, matching the potential of the world’s top supercomputers.

Using v5 VMs for Scaling

For training and running inference for more complicated LLMs and computer vision models, ND H100 v5 VMs are the best choice. The most complex and computationally costly generative AI applications, such as question answering, code generation, audio, video, and image generation, speech recognition, and others, are powered by these neural networks.

The ND H100 v5 VMs outperform previous generation instances in LLMs like the BLOOM 175B model for inference by up to 2x, highlighting their potential to further optimize AI applications.

Azure and NVIDIA

The performance, adaptability, and scale of NVIDIA and Azure’s NVIDIA H100 Tensor Core GPUs give businesses the tools they need to accelerate their AI training and inference workloads. The NVIDIA AI Enterprise software suite linked with Azure Machine Learning for MLOps accelerates the creation and deployment of production AI, and the result is record-breaking AI performance in the widely used MLPerf benchmarks.

Additionally, NVIDIA and Microsoft are giving hundreds of millions of Microsoft enterprise users access to potent industrial digitalization and AI supercomputing resources by integrating the NVIDIA Omniverse platform with Azure.

4 COMMENTS

New NVIDIA GeForce RTX 3090 SUPER Founders Edition GPU September 11, 2023 At 4:48 pm
[…] manufacturer NVIDIA a total of one and a half years before they introduced their most cutting-edge graphics card, which was given the moniker RTX 3090 […]
Log in to leave a comment
Promote AI With Grace Hopper: NVIDIA's Superchip September 12, 2023 At 3:17 pm
[…] The NVIDIA GH200 Grace Hopper Superchip, which debuted on the MLPerf industry benchmarks, completed all data center inference tests while maintaining the industry-leading performance of NVIDIA H100 Tensor Core GPUs. […]
Log in to leave a comment
Leveraging AI: CoreWeave And Dell Master Clouds December 16, 2023 At 1:01 pm
[…] Large-scale NVIDIA GPU-accelerated workloads are the focus of CoreWeave, a specialist cloud service. The core infrastructure supporting the company’s cloud solutions, which are designed for workloads including artificial intelligence (AI), machine learning (ML), visual effects (VFX) rendering, and large-scale simulations, will be Dell PowerEdge XE9860 servers equipped with NVIDIA H100 Tensor Core GPUs. […]
Log in to leave a comment
Micron 9400 NVMe SSDs Offer Big Rapid Memory! January 30, 2024 At 1:17 pm
[…] NVIDIA Research’s prototype projects, the NVIDIA Big Accelerator Memory (BaM) and the NVIDIA GPU Initiated Direct Storage (GIDS) dataloader, are not intended for public […]
Log in to leave a comment

Transforming AI with NVIDIA H100 GPU

The ND H100 v5 virtual machine series instance from Microsoft Azure provides next-generation performance at scale for workloads requiring generative AI, LLMs, and other computing resources

Using v5 VMs for Scaling

Azure and NVIDIA

Google NewFront: Display & Video 360 Pricing For Rethink CTV

Dell Nutanix And PowerFlex Enable Scalability, Performance

iOS 18.4.1 Update Addresses Active Security Attacks

4 COMMENTS

LEAVE A REPLY Cancel reply

Page Content

Recent Posts

AMD Radeon Pro W6600 Benchmark in CAD, Video Editing

Intel Core Ultra 5 225H Performance for Everyday Tasks

Intel Core i9 13900K Price, Benchmark, and Specifications

NVIDIA Tesla V100 Price, Features And Specifications

Google Magic Mirror Experience Driven by Gemini Models

Pluto AI: A New Internal AI Platform For Enterprise Growth

About Us