Notebooks
N
NVIDIA
05 Complexquery AdvancedRAG

05 Complexquery AdvancedRAG

gpu-accelerationretrieval-augmented-generationllm-inferencetensorrtnvidia-generative-ai-examplesoran-chatbot-multimodallarge-language-modelsevalsmicroservicetriton-inference-servercommunityLLMragnemo

Advanced RAG Testbed for ORAN chatbot

[ ]
[ ]
[ ]

Load Documents

Follow the step number 1 defined here to upload the pdf's to Milvus server.

[ ]
[ ]
[ ]
[ ]

Generate and Store Embeddings

a) Generate Embeddings

Embeddings for documents are created by vectorizing the document text; this vectorization captures the semantic meaning of the text.

[ ]
[ ]
[ ]
[ ]
[ ]
[ ]
[ ]
[ ]
[ ]
[ ]
[ ]

lets try base RAG

[ ]
[ ]

lets try multiple query based RAG

[ ]
[ ]

lets try HYDE (hypothetical llm answer) based RAG

[ ]
[ ]
[ ]