Retrieval-Augmented Generation chatbot, part 1: LangChain, Hugging Face, FAISS, AWS

Описание к видео Retrieval-Augmented Generation chatbot, part 1: LangChain, Hugging Face, FAISS, AWS

In this video, I'll guide you through the process of creating a Retrieval-Augmented Generation (RAG) chatbot using open-source tools and AWS services, such as LangChain, Hugging Face, FAISS, Amazon SageMaker, and Amazon TextTract.

Part 2:    • Retrieval-Augmented Generation chatbo...   - scaling indexing and search with Amazon OpenSearch Serverless!

⭐️⭐️⭐️ Don't forget to subscribe to be notified of future videos. Follow me on Medium at   / julsimon   or Substack at https://julsimon.substack.com. ⭐️⭐️⭐️

We begin by working with PDF files in the Energy domain. Our first step involves leveraging Amazon TextTract to extract valuable information from these PDFs. Following the extraction, we break down the text into smaller, more manageable chunks. These chunks are then enriched using a Hugging Face feature extraction model before being organized and stored within a FAISS index for efficient retrieval.

To ensure a seamless workflow, we employ LangChain to orchestrate the entire process. With LangChain as our backbone, we query a Mistral Large Language Model (LLM) deployed on Amazon SageMaker. These queries include semantically relevant context retrieved from our FAISS index, enabling our chatbot to provide accurate and context-aware responses.

- Notebook: https://gitlab.com/juliensimon/huggin...
- LangChain: https://www.langchain.com/
- FAISS: https://github.com/facebookresearch/f...
- Embedding leaderboard: https://huggingface.co/spaces/mteb/le...
- Embedding model: https://huggingface.co/BAAI/bge-small...
- LLM: https://huggingface.co/mistralai/Mist...

Комментарии

Информация по комментариям в разработке