LlamaIndex Sessions: Practical Tips and Tricks for Productionizing RAG (feat. Sisil @ Jasper)

Описание к видео LlamaIndex Sessions: Practical Tips and Tricks for Productionizing RAG (feat. Sisil @ Jasper)

In this video, Sisil Mehta (ML eng @, Jasper) walks through practical tips and tricks that his team implemented for productionizing a RAG system at Jasper.ai, backed by LlamaIndex abstractions.

These tricks include the following:
1. Picking a proper PDF parser that can maintain semantic structure, parse text from tables/images, and be represented as XML or Markdown
2. Adding the right "layers" of metadata; besides global document context, also inject summary context from "sub-documents" to more precisely localize context.
3. Hybrid fusion between different retrieval methods
4. LLM-powered reranking. Reduce token usage by reranking summaries that reference underlying chunks.
5. Use XML and emotion prompting to get well-structured outputs free of hallucinations

Комментарии

Информация по комментариям в разработке