Software

Oracle launches In-Database LLMs and In-Database Vector Store with HeatWave GenAI

by VARINDIA 2024-06-29

Oracle announced the general availability of HeatWave GenAI, which includes in-database large language models (LLMs), an automated in-database vector store, scale-out vector processing, and the ability to have contextual conversations in natural language informed by unstructured content. These new capabilities enable customers to bring the power of generative AI to their enterprise data—without requiring AI expertise or having to move data to a separate vector database. HeatWave GenAI is available immediately in all Oracle Cloud regions, Oracle Cloud Infrastructure (OCI) Dedicated Region, and across clouds at no extra cost to HeatWave customers.

With HeatWave GenAI, developers can create a vector store for enterprise unstructured content with a single SQL command, using built-in embedding models. Users can perform natural language searches in a single step using either in-database or external LLMs. Data doesn’t leave the database and, due to HeatWave’s extreme scale and performance, there is no need to provision GPUs. As a result, developers can reduce application complexity, increase performance, improve data security, and lower costs.

New automated and built-in generative AI features include:

· In-database LLMs simplify the development of generative AI applications at a lower cost. Customers can benefit from generative AI without the complexity of external LLM selection and integration, and without worrying about the availability of LLMs in various cloud providers’ data centers. The in-database LLMs enable customers to search data, generate or summarize content, and perform retrieval-augmented generation (RAG) with HeatWave Vector Store. In addition, they can combine generative AI with other built-in HeatWave capabilities such as AutoML to build richer applications. HeatWave GenAI is also integrated with the OCI Generative AI service to access pre-trained, foundational models from leading LLM providers.

· Automated in-database Vector Store enables customers to use generative AI with their business documents without moving data to a separate vector database and without AI expertise. All the steps to create a vector store and vector embeddings are automated and executed inside the database, including discovering the documents in object storage, parsing them, generating embeddings in a highly parallel and optimized way, and inserting them into the vector store making HeatWave Vector Store efficient and easy to use. Using a vector store for RAG helps solve the hallucination challenge of LLMs as the models can search proprietary data with appropriate context to provide more accurate and relevant answers.

· Scale-out vector processing delivers very fast semantic search results without any loss of accuracy. HeatWave supports a new, native VECTOR data type and an optimized implementation of the distance function, enabling customers to perform semantic queries with standard SQL. In-memory hybrid columnar representation and the scale-out architecture of HeatWave enable vector processing to execute at near-memory bandwidth and parallelize across up to 512 HeatWave nodes. As a result, customers get their questions answered rapidly. Users can also combine semantic search with other SQL operators to, for example, join several tables with different documents and perform similarity searches across all documents.

· HeatWave Chat is a Visual Code plug-in for MySQL Shell which provides a graphical interface for HeatWave GenAI and enables developers to ask questions in natural language or SQL. The integrated Lakehouse Navigator enables users to select files from object storage and create a vector store. Users can search across the entire database or restrict the search to a folder. HeatWave maintains context with the history of questions asked, citations of the source documents, and the prompt to the LLM. This facilitates a contextual conversation and allows users to verify the source of answers generated by the LLM. This context is maintained in HeatWave and is available to any application using HeatWave.

“HeatWave GenAI is set to revolutionize how Indian enterprises manage and analyze their rapidly growing data. By combining the power of generative AI with real-time analytics and transaction processing within a single MySQL database service, HeatWave GenAI enables organizations to extract valuable insights more efficiently and at no additional cost. This innovative solution simplifies application complexity and ensures predictable performance, empowering Indian businesses to stay ahead in the competitive landscape through accelerated digital transformation and data-driven decision-making," said P Saravanan, Vice President of Cloud Engineering at Oracle India.

Vector Store Creation and Vector Processing Benchmarks

Creating a vector store for documents in PDF, PPT, WORD, and HTML formats is up to 23X faster with HeatWave GenAI and 1/4th the cost of using Knowledge base for Amazon Bedrock.

As demonstrated by a third-party benchmark using a variety of similarity search queries on tables ranging from 1.6GB to 300GB in size, HeatWave GenAI is 30X faster than Snowflake and costs 25 percent less, 15X faster than Databricks and costs 85 percent less, and 18X faster than Google BigQuery and costs 60 percent less.

A separate benchmark reveals that vector indexes in Amazon Aurora PostgreSQL with pgvector can have a high degree of inaccuracy and can yield incorrect results. In contrast, HeatWave similarity search processing always provides accurate results, has predictable response time, is performed at near memory speed, and is up to 10X-80X faster than Aurora using the same number of cores.