Home Artificial Intelligence Nvidia AI Enterprise adds generative AI microservices

by Jon Gold

Senior writer

Nvidia AI Enterprise adds generative AI microservices

News

Mar 18, 20243 mins

ContainersGenerative AIMicroservices

A broad new array of generative AI-focused tools for developers is available in Nvidia AI Enterprise 5.0.

Credit: Nvidia

Version 5.0 of Nvidia’s enterprise-spanning AI software platform will feature a smorgasbord of microservices designed to speed app development and provide quick ways to ramp up deployments, the company announced today at its GPU Technology Conference.

These microservices are provided as downloadable software containers used to deploy enterprise applications, Nvidia said in an official blog post. They’re split into two main categories — Nvidia NIM, which covers microservices related to deploying production AI models, and CUDA-X, for microservices like cuOpt, the company’s optimization engine.

For NIM microservices the focus is on deployment times for generative AI apps, which the company said can be reduced “from weeks to minutes” with its services. The microservices include Triton Inference Server for standardizing AI model deployment, and TensorRT-LLM to help optimize and define large language models, making it easier for companies to experiment with LLMs without having to delve into C++ or Nvidia CUDA. They’ll be accessible via Amazon SageMaker, Google Kubernetes Engine, and Microsoft Azure AI, and integrations with AI frameworks like Deepset, LangChain and LlamaIndex are also supported.

CUDA-X microservices, by contrast, are more focused on data preparation and model training, as well as tools to enable developers to tie their generative AI apps to business data, whether that’s numerical information, text, or images. Other microservices in this category are almost applications of their own, like Nvidia Riva for translation and speech AI, the aforementioned cuOpt for process and routing optimization and Earth-2 for climate and weather simulations.

A host of further integrations is also coming to AI Enterprise 5.0, the company said. Business data hosted on Box, Cloudera, Cohesity, Datastax and the like can be used in AI applications as of version 5.0, and Nvidia-powered hardware can be found in servers and PCs from most major vendors, including Dell, HPE and Lenovo.

Nvidia described the microservices as a new layer in its full-stack computing platform, connecting model developers with platform providers and enterprises and providing a standardized path for running custom AI models across clouds, data centers, workstations and PCs.

Nvidia’s AI Enterprise 5.0 is available for developers to tinker with for free as of now, and enterprise licenses can be purchased for $4,500 per GPU per year, or $1 per GPU per hour in the cloud.

by Jon Gold

Senior Writer

Jon Gold covers IoT and wireless networking for Network World. He can be reached at jon_gold@ifoundrycodg.com.

Africa

Americas

Asia

Europe

Oceania

Topics

About

Policies

Our Network

More

Nvidia AI Enterprise adds generative AI microservices

A broad new array of generative AI-focused tools for developers is available in Nvidia AI Enterprise 5.0.

More from this author

US government extends warrantless FISA monitoring

Certinia bakes AI into its latest professional services updates

Cisco-led consortium to spread AI expertise in the workforce

Amazon drops ‘just walk out’ technology at its US retail locations

Show me more

Oracle adds AI capabilities to its Fusion Cloud CX

What LinkedIn learned leveraging LLMs for its billion users

IBM doubles down on hybrid cloud with $6.4B HashiCorp acquisition

CIO Leadership Live Middle East with Ahmed Wattar, Group Information Technology Director at Alfa Medical Group

CIO Leadership Live Middle East with Dr. Mohammad Alshehri, CISO and Cybersecurity Consultant

CIO Leadership Live Middle East with Wissam Al Adany, Chief Information Officer, ADES Holding

3 Leadership Tips: Renate Cuneen, Vice President, Global Corporate Technology, Canada Life

GenAI and Trust: How Companies Are Thinking About the Trustworthiness of AI and GenAI Tools

CIO Leadership Live Middle East with Ahmed Wattar, Group Information Technology Director at Alfa Medical Group

Nvidia AI Enterprise adds generative AI microservices

A broad new array of generative AI-focused tools for developers is available in Nvidia AI Enterprise 5.0.

Related content

TransUnion transforms its business with IT

The 10 highest-paying industries for IT talent

M&A action is gaining momentum, are your cloud security leaders prepared?

CIOs eager to scale AI despite difficulty demonstrating ROI, survey finds

From our editors straight to your inbox

More from this author

US government extends warrantless FISA monitoring

Certinia bakes AI into its latest professional services updates

Cisco-led consortium to spread AI expertise in the workforce

Amazon drops ‘just walk out’ technology at its US retail locations

Show me more

Oracle adds AI capabilities to its Fusion Cloud CX

What LinkedIn learned leveraging LLMs for its billion users

IBM doubles down on hybrid cloud with $6.4B HashiCorp acquisition

CIO Leadership Live Middle East with Ahmed Wattar, Group Information Technology Director at Alfa Medical Group

CIO Leadership Live Middle East with Dr. Mohammad Alshehri, CISO and Cybersecurity Consultant

CIO Leadership Live Middle East with Wissam Al Adany, Chief Information Officer, ADES Holding

3 Leadership Tips: Renate Cuneen, Vice President, Global Corporate Technology, Canada Life

GenAI and Trust: How Companies Are Thinking About the Trustworthiness of AI and GenAI Tools

CIO Leadership Live Middle East with Ahmed Wattar, Group Information Technology Director at Alfa Medical Group