Build AI-driven search applications with Elasticsearch

Liu Xiaoguo Elastic Chief Evangelist, China Community

Currently the Chief Evangelist for the Elastic China community. Holds a Master's degree from the National University of Singapore and Bachelor's and Master's degrees from Northwestern Polytechnical University. Previously worked at companies including Singapore Technologies, Compaq, General Motors, Ericsson, Nokia, the non-profit organization Linaro (Linux for ARM), Ubuntu, and Vantiq. Has experience in computer design, automotive electronics, computer operating systems, telecommunications, and cloud real-time event processing. Has nearly 20 years of experience in community work, starting from Ericsson, then Nokia, Ubuntu, and now Elastic. Enjoys sharing the knowledge he has learned. Believes that helping others is helping oneself. Hopes to share and learn with everyone. Welcome to visit the official Elastic Chinese blog at elasticstack.blog.csdn.net .

Abstract

Elasticsearch is the world's leading search engine. Traditional keyword search cannot meet the needs of today's intelligent era. Contemporary enterprises propose semantic search, which is searching based on the semantics of text rather than simple keyword matching. Additionally, we need to search for other data types, such as images, voice, and video. Since version 8.0, Elasticsearch has provided vector search (dense vectors, sparse vectors). It can perfectly solve text semantic search and multimedia data search. However, vector search is not perfect, especially for text search. We can use hybrid search (keyword search, vector search) for multi-path recall and rank the final results. This method can improve search accuracy and recall rate. In today's AI development, combined with large models, we can integrate the search results with large models and use GenAI to obtain inference results. Since enterprise data or private data is generated every moment, and the knowledge of large models is limited to the time of model generation, and the data of large models is obtained through training on web data. Using large models to infer enterprise or private data without context often leads to hallucinations because this knowledge does not exist in the large model. By combining Elasticsearch's vector search technology to search enterprise data or private data and providing the search results as context to the large model, hallucinations can be eliminated. This technology is also known as RAG (Retrieval-Augmented Generation). This topic will provide a detailed introduction to Elasticsearch's vector search technology and how to use it for RAG application development and the latest agentic RAG.

Details

Search Needs in the Intelligent Era
- Demand for semantic search, rather than simple keyword matching
- Search for multimedia data, such as images, sounds, and videos
- Search for unstructured data
- New solutions brought by vector search
Elasticsearch vector search and latest developments
- Principles of vector search
- Types of vector search (dense vectors, sparse vectors)
- Introduction to hybrid search (multi-channel recall, comprehensive scoring)
- Hardware acceleration, parallelization, scalar quantization, search efficiency
- Relevance adjustment
- Semantic text fields
- Inference API
- AI ecosystem
- Serverless
RAG implementation principles
- How to make large models smarter
- Methods of implementing RAG
Case sharing of using Elasticsearch in enterprise search
- Advanced RAG case sharing
- Agentic RAG
Demos