🧪 Outskill AI Lab

Production-grade AI Agent projects built with the OpenAI Agents SDK
Multi-agent pipelines · Tool orchestration · Guardrails · Real-world APIs

About

Outskill AI Lab is a collection of production-grade AI Agent projects, each designed as a complete, end-to-end multi-agent system. Every project follows battle-tested agentic design patterns — prompt chaining, routing, tool use, guardrails, human-in-the-loop escalation, and observability — implemented with the OpenAI Agents SDK and powered by OpenRouter.

Each project is self-contained with its own agents, tools, models, guardrails, simulators (or real APIs), documentation, and a working main.py you can run immediately.

Contributing — Want to add a new project or improve an existing one? Open an issue and let's discuss!

Projects

#	Project	Agents	Tools	Description
1	AI Ops Incident Response	6	12	Autonomous incident response — anomaly detection, root-cause analysis, remediation proposals
2	Cybersecurity Threat Detection	6	12	Autonomous SOC analyst — SIEM event correlation, MITRE ATT&CK mapping, containment actions
3	Customer Support	6	18	Autonomous support rep — intent classification, order/billing/technical support, CSAT prediction
4	Deep Research	7	24	Autonomous researcher — multi-source web search, source evaluation, cited reports (real APIs)
5	Browser Automation	6	5	Autonomous browser controller — web scraping, form automation, structured data extraction (Stagehand)

Project Details

1. AI Ops Incident Response Agent

A multi-agent incident response system that autonomously detects anomalies, performs root-cause analysis, correlates signals across observability data, and proposes remediation actions.

Pipeline: Triage → Log Analyzer → Metrics Analyzer → Root Cause Analyzer → Remediation → Incident Reporter
Simulators: Alert, log, metrics, and trace simulators with 5 pre-built scenarios
Guardrails: Input validation + remediation safety (blocks destructive actions in production)
Scenarios: cpu_spike, memory_leak, error_rate_surge, latency_degradation, cascading_failure

📖 README · Architecture · Code Guide

2. Cybersecurity Threat Detection Agent

An autonomous SOC analyst that ingests SIEM-style security events, correlates threats across authentication, network, API, endpoint, and cloud data, assigns threat scores, maps to MITRE ATT&CK, and proposes containment actions.

Pipeline: Alert Intake → Auth Analyzer / Network Analyzer → Threat Intel → Containment → SOC Reporter
Simulators: Auth log, network log, API access, cloud audit, and endpoint simulators with 5 threat scenarios
Guardrails: Input validation + containment safety (validates severity thresholds before action)
Scenarios: brute_force_attack, insider_threat, api_key_compromise, malware_lateral_movement, cloud_misconfiguration

📖 README · Architecture · Code Guide

3. Customer Support Agent

An autonomous customer support representative that classifies intents, looks up orders and billing, searches knowledge bases, processes refunds/returns, escalates complex cases, and predicts customer satisfaction.

Pipeline: Intake & Router → Order / Billing / Technical Support → Escalation → Resolution & CSAT
Simulators: Customer, order, billing, and knowledge base simulators with 5 support scenarios
Guardrails: Input validation + response safety (PII leakage prevention, refund caps, language filtering)
Scenarios: delayed_order, refund_request, billing_dispute, technical_issue, complex_escalation

📖 README · Architecture · Code Guide

4. Deep Research Agent

An autonomous research system that searches the live web, retrieves academic papers, extracts content, evaluates sources, and synthesizes long-form research reports with citations and confidence scores.

Pipeline: Research Planner → Web / Academic / News Researcher → Content Extractor → Synthesizer → Report Writer
Real APIs: Tavily, DuckDuckGo, Wikipedia, arXiv, Semantic Scholar, Jina Reader, YouTube, Google News, Reddit, GitHub, StackExchange
Guardrails: Input validation (query substantiveness) + output quality (citations, structure, no hallucinated URLs)
Capabilities: APA citations, cross-referencing, credibility scoring, confidence assessment, structured markdown reports

📖 README · Architecture · Code Guide

5. Browser Automation Agent

An autonomous browser controller that navigates web pages, interacts with elements, and extracts structured data using Stagehand (local Chrome) for browser control and OpenAI Agents SDK for LLM orchestration.

Pipeline: Task Planner → Navigator → Interactor → Extractor → Validator → Reporter
Browser Backend: Stagehand Python SDK with local headless Chrome (no cloud browser service needed)
Guardrails: Input validation (task + API keys) + output quality (report structure, data presence)
Scenarios: Web scraping (Hacker News posts), form automation (Google Search + extract results)

📖 README · Architecture · Code Guide

Tech Stack

Component	Technology
Agent Framework	OpenAI Agents SDK (`openai-agents`)
LLM Provider	OpenRouter (access to GPT-4.1, GPT-5, Claude, etc.)
Language	Python 3.12+
Package Manager	uv
Type System	Modern Python annotations, dataclasses, Literal types, Protocol classes
Web Search	Tavily, DuckDuckGo
Academic Search	arXiv, Semantic Scholar, Wikipedia
Content Extraction	Jina Reader, BeautifulSoup, YouTube Transcript API
News & Community	Google News RSS, Reddit, GitHub, StackExchange
Browser Automation	Stagehand Python SDK (local Chrome)

Quick Start

Prerequisites

Python 3.12+
uv — fast Python package manager

# Install uv (if not already installed)
curl -LsSf https://astral.sh/uv/install.sh | sh

1. Clone the Repository

git clone https://github.com/ishandutta0098/outskill-ai-lab.git
cd outskill-ai-lab

2. Install Dependencies

uv sync

This installs all dependencies from pyproject.toml into a virtual environment managed by uv.

3. Configure Environment Variables

Create a .env file in the project root (refer to dummy.env for the template):

# Required for all projects — LLM access via OpenRouter
OPENROUTER_API_KEY=your_openrouter_key_here

# Required for Deep Research Agent — Tavily web search
TAVILY_API_KEY=your_tavily_key_here

# Required for Browser Automation Agent — Stagehand's internal LLM
MODEL_API_KEY=your_model_api_key_here

Where to get API keys:

Key	Provider	Free Tier
`OPENROUTER_API_KEY`	openrouter.ai	Free credits on signup
`TAVILY_API_KEY`	tavily.com	1,000 searches/month
`MODEL_API_KEY`	openrouter.ai (or OpenAI)	For Stagehand's browser AI

Note: All other external APIs used by the Deep Research Agent (DuckDuckGo, Wikipedia, arXiv, Semantic Scholar, Jina Reader, YouTube, Google News, Reddit, GitHub, StackExchange) require no API key.

4. Run a Project

Each project has its own main.py. Run from the repository root:

# AI Ops Incident Response Agent
PYTHONPATH=projects uv run python -m aiops_incident_response_agent.main

# Cybersecurity Threat Detection Agent
PYTHONPATH=projects uv run python -m cybersecurity_threat_detection_agent.main

# Customer Support Agent
PYTHONPATH=projects uv run python -m customer_support_agent.main

# Deep Research Agent
PYTHONPATH=projects uv run python -m deep_research_agent.main

# Browser Automation Agent
PYTHONPATH=projects uv run python -m browser_automation_agent.main

Or run programmatically:

import asyncio
from deep_research_agent.main import run_deep_research

async def main():
    report = await run_deep_research("What are the latest advancements in quantum computing?")
    print(report)

asyncio.run(main())

Repository Structure

outskill-ai-lab/
├── .env                          # API keys (create from dummy.env)
├── dummy.env                     # Environment variable template
├── pyproject.toml                # Dependencies and project metadata
├── .cursor/rules/                # Agentic design pattern rules
│   └── agentic_design_patterns.mdc
│
└── projects/
    ├── aiops_incident_response_agent/
    │   ├── main.py               # Pipeline entry point
    │   ├── agents/               # 6 agents (triage → reporter)
    │   ├── tools/                # 12 function tools
    │   ├── models/               # Dataclass models
    │   ├── simulators/           # Event simulators + scenario engine
    │   ├── guardrails/           # Input + output guardrails
    │   ├── utils/                # Config loader
    │   ├── README.md
    │   ├── ARCHITECTURE.md
    │   └── CODE.md
    │
    ├── cybersecurity_threat_detection_agent/
    │   ├── main.py               # Pipeline entry point
    │   ├── agents/               # 6 agents (alert intake → SOC reporter)
    │   ├── tools/                # 12 function tools
    │   ├── models/               # Dataclass models
    │   ├── simulators/           # Security event simulators + scenario engine
    │   ├── guardrails/           # Input + containment safety guardrails
    │   ├── utils/                # Config loader
    │   ├── README.md
    │   ├── ARCHITECTURE.md
    │   └── CODE.md
    │
    ├── customer_support_agent/
    │   ├── main.py               # Pipeline entry point
    │   ├── agents/               # 6 agents (intake → resolution)
    │   ├── tools/                # 18 function tools
    │   ├── models/               # Dataclass models
    │   ├── simulators/           # Customer data simulators + scenario engine
    │   ├── guardrails/           # Input + response safety guardrails
    │   ├── utils/                # Config loader
    │   ├── README.md
    │   ├── ARCHITECTURE.md
    │   └── CODE.md
    │
    ├── deep_research_agent/
    │   ├── main.py               # Pipeline entry point
    │   ├── agents/               # 7 agents (planner → report writer)
    │   ├── tools/                # 24 function tools (real external APIs)
    │   ├── models/               # Dataclass models
    │   ├── guardrails/           # Input + output quality guardrails
    │   ├── utils/                # Config loader
    │   ├── README.md
    │   ├── ARCHITECTURE.md
    │   └── CODE.md
    │
    └── browser_automation_agent/
        ├── main.py               # Pipeline entry point
        ├── agents/               # 6 agents (planner → reporter)
        ├── tools/                # 5 function tools (Stagehand wrappers)
        ├── models/               # Dataclass models
        ├── guardrails/           # Input + output quality guardrails
        ├── utils/                # Config loader
        ├── README.md
        ├── ARCHITECTURE.md
        └── CODE.md

Architecture & Design Patterns

Every project in this lab follows a consistent set of agentic design patterns, distilled from Agentic Design Patterns by Antonio Gulli:

Pattern	Description	Applied In
Default Agent Loop	Goal → Context → Plan → Act → Reflect	All agents
Prompt Chaining	Sequential handoff chain where each agent's output feeds the next	All pipelines
Routing	Entry agent classifies input and routes to specialist	All entry agents
Tool Use	Typed function tools with JSON schemas, strict I/O	71 tools total
Multi-Agent	Single-responsibility agents communicating via handoffs	31 agents total
Guardrails & Safety	Input validation + output safety on entry/terminal agents	All projects
Human-in-the-Loop	Escalation paths for cases beyond automated resolution	Customer Support, Cybersecurity
Reflection	Self-evaluation of source quality and cross-referencing	Deep Research
Evaluation & Monitoring	AgentHooks for real-time observability of all actions	All projects
Exception Handling	Graceful degradation with error-as-return-values in tools	All tools

Shared Architectural Principles

Functional core, imperative shell — Pure dataclasses and tool functions internally; orchestration in main.py
Composition over inheritance — Agents composed from tools, handoffs, and guardrails
Errors as values — Tools return JSON error objects, never raise exceptions
Immutable data models — Frozen dataclasses for all domain types
Structured logging — All tools and agents log via Python's logging module
OpenRouter as LLM provider — Drop-in replacement for OpenAI API, supporting any model

Documentation

Each project includes three documentation files:

File	Purpose
`README.md`	Setup instructions, features, usage examples, project structure
`ARCHITECTURE.md`	System design, agent responsibilities, handoff topology, data flow, guardrails
`CODE.md`	OpenAI Agents SDK constructs, tool patterns, model explanations, scoring algorithms

Contributing

Contributions are welcome! If you'd like to:

Add a new project — Open an issue describing the agent system you'd like to build
Improve an existing project — Open a PR with your changes
Report a bug — Open an issue with reproduction steps

Please follow the coding conventions established in the workspace rules:

Modern Python 3.12+ type annotations
Dataclasses for domain models (prefer frozen)
Literal types over string enums
Global imports, early returns, match-case over if-elif-else
Structured logging (no f-strings in log calls)
Errors as return values (no try-except)

License

This project is licensed under the MIT License.

Built with ❤️ by Ishan Dutta

Name		Name	Last commit message	Last commit date
Latest commit History 160 Commits
.cursor/rules		.cursor/rules
projects		projects
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
dummy.env		dummy.env
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧪 Outskill AI Lab

About

Projects

Project Details

Tech Stack

Quick Start

Prerequisites

1. Clone the Repository

2. Install Dependencies

3. Configure Environment Variables

4. Run a Project

Repository Structure

Architecture & Design Patterns

Shared Architectural Principles

Documentation

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

eng-accelerator/outskill-ai-lab

Folders and files

Latest commit

History

Repository files navigation

🧪 Outskill AI Lab

About

Projects

Project Details

Tech Stack

Quick Start

Prerequisites

1. Clone the Repository

2. Install Dependencies

3. Configure Environment Variables

4. Run a Project

Repository Structure

Architecture & Design Patterns

Shared Architectural Principles

Documentation

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages