HPE and NVIDIA team up to build enterprise-grade GenAI solution

Spread the love

Bengaluru: HPE and NVIDIA have expanded their strategic collaboration to build an enterprise computing solution for generative AI (GenAI).

According to HPE, this co-engineered, pre-configured AI tuning and inferencing solution enables enterprises of any size to quickly customise foundation models using private data and deploy production applications anywhere, from edge to cloud.

The joint offering removes the complexity of developing and deploying GenAI infrastructure by deploying a full-stack AI tuning and inferencing solution from HPE and NVIDIA.

With Gen AI models, enterprises require a software and infrastructure stack that can be deployed quickly and from wherever the business needs it. The new solution for generative AI is part of an expanded collaboration between HPE and NVIDIA that delivers full-stack, out-of-the-box AI solutions.

These solutions integrate HPE Machine Learning Development Environment, HPE Ezmeral Software, HPE ProLiant Compute and HPE Cray Supercomputers with the NVIDIA AI Enterprise software suite, including the NVIDIA NeMo framework.

“Together, HPE and NVIDIA are in a unique position to deliver a comprehensive AI-native solution that will dramatically ease the journey to develop and deploy AI models with a portfolio of pre-configured solutions,” said Antonio Neri, President and CEO of HPE. “The strategic collaboration between HPE and NVIDIA will dramatically reduce barriers for customers looking to transform their businesses with AI.”

“The generative AI era is ramping at full speed, with enterprises racing to reimagine their businesses,” said Jensen Huang, Founder and CEO of NVIDIA. “Our expanded collaboration with HPE will help enterprises drive unprecedented productivity through AI applications that connect with business data to power accurate assistants, informed chatbots and semantic search.”

Drive AI velocity with a purpose-built AI tuning and inferencing solution

The new AI tuning and inferencing data centre solution provides the ideal entry point for enterprises of all sizes with a ready-out-of-the-box solution to start their AI journey quickly.

With the new enterprise computing solution for generative AI, enterprises can use pre-trained foundation models with their private data to create production applications such as AI chatbots. In addition, retrieval-augmented generation (RAG) workstreams further improve the data quality and accuracy of the application.

Purpose-built and optimized for AI: A rack-scale architecture featuring market-leading HPE ProLiant Compute DL380a pre-configured with NVIDIA L40S GPUs, NVIDIA BlueField-3 DPUs and the NVIDIA Spectrum-X Ethernet Networking Platform for hyperscale AI. The solution was sized to fine-tune a 70 billion-parameter Llama 2 model and includes 16 HPE ProLiant DL380a servers and 64 L40S GPUs

HPE AI software: HPE Machine Learning Development Environment with new generative AI studio capabilities to rapidly prototype and test models, and HPE Ezmeral Software with new GPU-aware capabilities to simplify deployment and accelerate data preparation for AI workloads across the hybrid cloud

NVIDIA AI software: NVIDIA AI Enterprise to accelerate production AI development and deployment with security, stability, manageability and support. It offers the NVIDIA NeMo framework, guardrailing toolkits, data curation tools and pretrained models to streamline enterprise GenAI.

The enterprise computing solution for generative AI will be orderable in Q1CY24.