📚 Multi-PDF RAG Streamlit App

A Streamlit-based multi-PDF document Question & Answer system using
Retrieval-Augmented Generation (RAG) powered by Llama-3 via Groq and ChromaDB.

🚀 Features

📄 Upload multiple PDFs
🔍 Semantic search using HuggingFace embeddings
🧠 Accurate answers using Llama-3.3-70B (Groq)
🧩 Vector storage with ChromaDB
❓ Ask multiple questions at once
🧠 Clean Question → Answer UI
⚡ Fast inference via Groq API

🏗️ Project Structure

├── app.py # Streamlit app
├── rag_utility.py # PDF processing + RAG logic
├── requirements.txt # Dependencies
├── env_template.txt # Environment variable template
├── .gitignore
├── LICENSE
└── README.md

🧠 How It Works (RAG Pipeline)

Upload PDFs using Streamlit
PDFs are:
- Loaded using UnstructuredPDFLoader
- Split into chunks
- Converted into embeddings
Embeddings are stored in ChromaDB
User questions are:
- Retrieved via similarity search
- Passed to Llama-3 with context
Model returns grounded answers

⚙️ Setup Instructions

1️⃣ Clone the repository

git clone https://github.com/your-username/multi-pdf-rag-streamlit.git
cd multi-pdf-rag-streamlit

2️⃣ Create a virtual environment (recommended)

conda create -n rag python=3.10
conda activate rag

3️⃣ Install dependencies

pip install -r requirements.txt
(or copy env_template.txt → .env)

🔐 Environment Variables

Create a .env file in the root directory:

GROQ_API_KEY=your_groq_api_key_here

You can refer to env_template.txt for guidance.

▶️ Run the Application

streamlit run app.py

🧪 Example Usage

Upload one or more PDFs
Enter questions (one per line), for example:

What is an ecosystem?
What are the types of ecosystems?
Forerunners of Evo-Devo?

Click Answer
Get structured Question → Answer results

🧩 Tech Stack

Frontend: Streamlit
LLM: Llama-3.3-70B (Groq)
Embeddings: all-MiniLM-L6-v2
Vector Database: ChromaDB
Framework: LangChain
Language: Python

🌍 Deployment

This application is ready to deploy on:

Streamlit Cloud
Docker
Any cloud VM (AWS / GCP / Azure)

📜 License

This project is licensed under the MIT License. You are free to use, modify, and distribute it.

🙌 Acknowledgements

Groq
LangChain
HuggingFace
Streamlit

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📚 Multi-PDF RAG Streamlit App

🚀 Features

🏗️ Project Structure

🧠 How It Works (RAG Pipeline)

⚙️ Setup Instructions

1️⃣ Clone the repository

2️⃣ Create a virtual environment (recommended)

3️⃣ Install dependencies

🔐 Environment Variables

▶️ Run the Application

🧪 Example Usage

🧩 Tech Stack

🌍 Deployment

📜 License

🙌 Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
env_template.txt		env_template.txt
rag_utility.py		rag_utility.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

📚 Multi-PDF RAG Streamlit App

🚀 Features

🏗️ Project Structure

🧠 How It Works (RAG Pipeline)

⚙️ Setup Instructions

1️⃣ Clone the repository

2️⃣ Create a virtual environment (recommended)

3️⃣ Install dependencies

🔐 Environment Variables

▶️ Run the Application

🧪 Example Usage

🧩 Tech Stack

🌍 Deployment

📜 License

🙌 Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages