AI-Powered Content Analysis Platform for Educational and Research Content
Atlas is a comprehensive platform that combines YouTube video analysis, academic paper research, and educational content generation into a unified AI-powered workflow.
- Video Search: Natural language search using YouTube Data API
- Transcript Extraction: Automatic subtitle fetching and processing
- AI Summarization: Technical content analysis with structured insights
- Comparison Analysis: Multi-video comparison with AI-powered insights
- Semantic Search: Query academic papers using natural language
- Citation Tracking: Source papers with relevance scores
- Vector Database: LanceDB-powered semantic search
- Paper Management: Automatic PDF processing and indexing
- Assignment Generation: AI-created hands-on learning exercises
- Learning Objectives: Structured educational outcomes
- Progressive Tasks: Step-by-step skill building activities
- Assessment Criteria: Clear success metrics and rubrics
- Parallel Execution: Concurrent processing for faster results
- Real-time Tracking: Progressive visualization of pipeline steps
- Professional Interface: Modern web UI with responsive design
- Configurable Workers: Adjustable concurrency for optimal performance
- Python 3.8+
- OpenRouter API key
- YouTube Data API key
git clone https://github.com/ishandutta0098/atlas
cd atlas
pip install -r requirements.txt# Create .env file
echo "OPENROUTER_API_KEY=your_openrouter_key" >> .env
echo "YOUTUBE_API_KEY=your_youtube_key" >> .envpython app.pyAccess the web interface at http://localhost:7860
- Enter a search query (e.g., "Python machine learning tutorial")
- Configure max videos and workers
- Click "Start Pipeline" to begin processing
- View results: search → transcripts → summaries → comparison → assignments
- Ensure papers are in
papers/agents/folder - Enter natural language query
- Get AI responses with paper citations and excerpts
Key settings in src/configs/config.yaml:
- Model: OpenRouter model selection (using OpenAI-compatible models)
- Workers: Parallel processing configuration
- API: Timeout and retry settings
- Paths: Output directories and file locations
atlas/
├── app.py # Main Gradio web interface
├── src/
│ ├── youtube_pipeline.py # YouTube processing pipeline
│ ├── papers_rag.py # Academic papers RAG system
│ ├── assignment_generator.py # Educational content generator
│ ├── compare_youtube_outputs.py # Video comparison analysis
│ └── configs/config.yaml # Configuration settings
├── papers/agents/ # Academic papers directory
└── requirements.txt # Python dependencies
MIT License - See LICENSE file for details