NetApp® (NASDAQ: NTAP) has announced it is collaborating with NVIDIA to advance retrieval-augmented generation (RAG) for generative AI applications.
According to an official release, the collaboration connects the NVIDIA NeMo Retriever microservices, coming to the NVIDIA AI Enterprise software platform for development and deployment of production-grade AI applications, including generative AI, to exabytes of data on NetApp’s intelligent data infrastructure. It’s believed that NetApp ONTAP® customers will be able to “talk to their data” to access proprietary business insights without having to compromise the privacy of their data.
Reportedly, by combining NVIDIA’s NeMo Retriever microservices with NetApp ONTAP’s footprint, enterprises, both on-premises and in the world’s largest public clouds, can access their data wherever it resides. Seemingly, this reduces the friction, cost, and time to value for RAG. From what it’s understood, the new capability allows customers to talk to their corporate data specifically for enterprise RAG complements NetApp’s portfolio of mature AI offerings, which have been leveraged by its joint customers for AI model training and inference, including solutions built on NVIDIA DGX BasePOD and that have certification for NVIDIA DGX SuperPOD, as well as the new NVIDIA OVX systems storage validation program, designed specifically for enterprise RAG.
“I believe retrieval-augmented generation pairs valuable data with the power of AI to make transformative productivity tools. Together, NVIDIA and NetApp can help enterprises build intelligent generative AI applications that let companies talk to their data,” Jensen Huang, founder and CEO, NVIDIA, said.