Simplified RAG

This repository contains a Python Jupyter Notebook that implements a simple Retrieval-Augmented Generation (RAG) pipeline using various state-of-the-art AI tools and APIs. This setup demonstrates how you can fetch the data from the web, use APIs to retrieve data, embed documents into a vector database, and create a chat-based language model to answer user queries with relevant information.

Structure

The notebook is structured as follows:

Environment Setup:
- Install necessary libraries and tools.
- Run Playwright for the first time if required.
Data Preparation:
- Scrape documents from the web.
- Load documents from the Perigon API.
- Split documents into chunks.
Document Embedding:
- Embed documents into a vector database using Qdrant.
- Retrieve documents based on similarity queries.
Initialize Language Models:
- Set up various LLMs such as Ollama, Groq, and OpenAI ChatGPT.
Testing Prompts:
- Query the models directly and via Retrieval-Augmented Generation (RAG).

Setup

Pre-requisites

Python 3.7 or above
Jupyter Notebook

Installation

Before Running the Notebook please make sure to set up environment variables:

Create a .env file in the root directory of the project and add the following variables:

PERIGON_API_KEY=your_perigon_api_key
GROQ_API_KEY=your_groq_api_key
OPENAI_API_KEY=your_openai_api_key

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.env.sample		.env.sample
.gitignore		.gitignore
Modelfile		Modelfile
RAG.ipynb		RAG.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Simplified RAG

Structure

Setup

Pre-requisites

Installation

About

Releases

Packages

Languages

goperigon/simplified-rag

Folders and files

Latest commit

History

Repository files navigation

Simplified RAG

Structure

Setup

Pre-requisites

Installation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages