Cloudera and Pinecone in new tie-up to provide customers with advanced AI-powered solutions
Pinecone’s vector database is a cornerstone of critical infrastructure
Cloudera, the Santa Clara, California-headquartered data company for enterprise artificial intelligence (AI), and Pinecone, the New York-based vector database company that specialises in long-term memory for AI, have entered a strategic partnership wherein Pinecone’s AI vector database expertise seamlessly integrates into Cloudera’s open data platform.
Under the terms of the partnership, Cloudera will integrate Pinecone’s market-leading vector database into the Cloudera Data Platform (CDP), to empower organisations to build and deploy highly scalable, real-time, AI-powered applications on the CDP more easily.
The new initiative includes the release of a new Applied ML Prototype (AMP) that will allow developers to more quickly create and augment new knowledge bases from data on their website, as well as pre-built connectors that will enable customers to set up ingest pipelines in AI applications more quickly.
In the AMP, Pinecone’s vector database uses these knowledge bases to imbue context into chatbot responses, helping to ensure valuable outputs.
Customer benefits
Customers can use this same architecture to set up or improve support chatbots or internal support search systems. This will enable customers to reduce operational costs by decreasing costly human case-handling efforts and improving the customer experience with faster resolution times.
In the dynamic landscape of generative AI, Pinecone’s vector database has emerged as a cornerstone of critical infrastructure. Tailored for the unique demands of AI, Pinecone’s technology stores and efficiently searches AI representations of data known as vector embeddings. This represents a paradigm shift from traditional databases, which often struggle to perform such semantic similarity searches effectively.
The intrinsic value of Pinecone’s vector database becomes especially evident in its ability to provide much-needed context to queries within applications utilising Large Language Models (LLMs).
This additional context plays a pivotal role in mitigating the occurrence of erroneous outputs, colloquially known as “hallucinations.” In practical terms, this capability ensures that search and generative AI applications are empowered to deliver responses that are not only accurate but also highly relevant, significantly enhancing the user experience.
Abhas Ricky, Chief Strategy Officer at Cloudera, said: “We are excited to bring the power of Pinecone vector database and semantic search capabilities to our public cloud customers to accelerate generative AI use cases and significantly improve the developer experience at scale.”
Elan Dekel, Vice President of Product at Pinecone, added: “Cloudera’s extensive expertise in data management combined with Pinecone’s cutting-edge vector database creates a formidable partnership. A lot of our customers already manage their data with Cloudera. Now, it will be easier than ever for them to build AI applications using their embeddings stored with us and data stored with Cloudera. Together, we will enable organisations to deliver unparalleled personalised experiences, drive user engagement, and achieve business success.”
Sanjeev Mohan, founder of SanjMo and a former Gartner analyst, noted: “Integration of Pinecone with CDP adds critical new functionality to help clients build generative AI applications.
“In addition, the planned integration between the open source Apache NiFi-based Cloudera Data Flow (CDF) and Pinecone further bolsters CDP’s emphasis on universal data distribution for AI. CDP customers can bring AI to where their data resides – on-premises, in the cloud or on the edge,” Mohan added.
Featured image: Abhas Ricky, Chief Strategy Officer at Cloudera. Image: Cloudera