Ethos — Autonomous Developer Onboarding Agent

Problem Statement: PS03 – Autonomous Developer Onboarding Agent

Team Name: Ruby

Team Members:

Hritik Pandey (Leader)
Sruti Baliga
Harsh Chaudhari
Kushal Naik

The Three Access Patterns

Think of it like three tiers:

Tier 1 — RAG Ingestion (Always Searchable)
These are chunked, embedded, and stored in a vector DB. The agent retrieves relevant chunks when the user asks questions.
Tier 2 — Agent Logic Files (Read On-Demand by Agent, Not User)
The agent reads these at specific trigger points during its workflow — like Claude Skills. The user never queries these directly.
Tier 3 — Templates (Read Once at Generation Time)
The agent loads these only when it needs to produce a specific output.

Run Backend (Python 3.11)

python -m uvicorn backend.main:app --reload

File-by-File Breakdown

Quick Links

File	Description
company_overview.md	Company info, products, tech stack
engineering_standards.md	Coding rules, PR process, API design
architecture_documentation.md	Services & system architecture
setup_guides.md	Environment setup instructions
policies.md	Security, leave, VPN, compliance
org_structure.md	Teams, contacts, channels
onboarding_faq.md	Common onboarding questions
employee_personas.md	Role-based onboarding personas
onboarding_checklists.md	Step-by-step onboarding checklists
starter_tickets.md	First task tickets by role
email_templates.md	HR email templates
guidelines.md	PS03 guidelines

📚 Tier 1 — RAG Ingestion (Vector DB)

These go into your knowledge base because users will ask questions about this content.

File	Why Ingest	Example User Query
`company_overview.md`	User asks about company, products, tech stack	"What does NovaByte build?"
`engineering_standards.md`	User asks about coding rules, PR process, API design	"What's the branching strategy?"
`architecture_documentation.md`	User asks about services, how things connect	"Which service handles notifications?"
`setup_guides.md`	User asks for help setting up their environment	"How do I install Python for this project?"
`policies.md`	User asks about security, leave, VPN, compliance	"Do I need VPN to access staging?"
`org_structure.md`	User asks about teams, contacts, channels	"Who do I contact for GitHub access?"
`onboarding_faq.md`	User asks common onboarding questions	"How many PR approvals do I need?"

Total: 7 files → All in:

knowledge-base/
company-structure/
faq/

🧠 Tier 2 — Agent Logic Files (Read by Agent, Not by User)

These are not for the user to query.
The agent reads them internally at specific decision points in its workflow — exactly like Claude Skills.

`personas/employee_personas.md`

When to read:
Right after the user introduces themselves

"Hi, I'm Riya, a Backend Intern working on Node.js"
Purpose:
The agent matches the user's input (role, level, stack) against these personas to determine the correct onboarding path.
How it works:
1. Agent extracts → role, experience level, tech stack
2. Matches to closest persona
3. Uses the "Expected Onboarding Focus" section to plan the flow
❗ NOT ingested into RAG — the user should never see or query other employees' profiles

`checklists/onboarding_checklists.md`

When to read:
After persona is identified
Purpose:
Agent loads the Common Checklist + the matching role-specific checklist and guides the user step-by-step.
How it works:
1. Agent identifies persona → e.g., Backend Intern (Node.js)
2. Loads Common Checklist (C-01 to C-28)
3. Loads Backend Intern Checklist (BI-01 to BI-20)
4. Merges them into one ordered onboarding plan
5. Tracks completion as the user progresses
❗ NOT ingested into RAG — the agent uses this as a structured state machine, not searchable content

`starter-tickets/starter_tickets.md`

When to read:
When the user reaches the "First Task" phase of onboarding
Purpose:
Agent selects and assigns the correct starter ticket based on role.
How it works:
1. Agent matches role
2. Selects appropriate ticket
  - Example: FLOW-INTERN-001 for Backend Intern
3. Presents it to the user with full context
❗ NOT ingested into RAG — agent reads this only at a specific workflow step

📝 Tier 3 — Templates (Read Once at Output Time)

`hr-templates/email_templates.md`

When to read:
Only when the agent is about to generate the HR completion email
Purpose:
Agent loads the template and fills placeholder variables using tracked state data (checklist completion, employee info, timestamps).
How it works:
1. User completes all checklist items
2. Agent detects → onboarding_status = COMPLETED
3. Agent reads email_templates.md
4. Picks Template 1 (Completion Report)
5. Fills all {placeholders} with real data
6. Sends/displays the structured email
❗ NOT ingested into RAG — this is a generation template, not knowledge

Visual Summary

flowchart TD
    START([USER MESSAGE ARRIVES]) --> Q1{Is it a question?}

    Q1 -->|YES| T1["🔍 Search RAG — Tier 1
    knowledge-base/all
    org_structure.md
    onboarding_faq.md"]

    Q1 -->|NO — action/flow| Q2{Is it a self-introduction?}

    Q2 -->|YES| T2["👤 Read Personas — Tier 2
    Match role/level/stack
    Load matching checklist"]

    Q2 -->|NO| Q3{"Is it a checklist action?
    done / next step"}

    Q3 -->|YES| T3["✅ Update Checklist State — Tier 2
    Mark item complete
    Serve next item"]

    Q3 -->|NO| Q4{Ready for first task?}

    Q4 -->|YES| T4["🎫 Read Starter Tickets — Tier 2
    Assign matching ticket"]

    Q4 -->|NO| Q5{All items completed?}

    Q5 -->|YES| T5["📧 Read Email Template — Tier 3
    Fill placeholders
    Generate HR email"]

In Code Terms

# Tier 1 — Loaded into vector DB at startup
rag.ingest("knowledge-base/*.md")
rag.ingest("company-structure/org_structure.md")
rag.ingest("faq/onboarding_faq.md")


# Tier 2 — Read by agent at specific triggers
def on_user_introduction(user_input):
    personas = read_file("personas/employee_personas.md")  # Skill-like read
    matched = match_persona(user_input, personas)

    checklist = read_file("checklists/onboarding_checklists.md")  # Skill-like read
    plan = build_onboarding_plan(matched, checklist)

    return plan


def on_first_task_phase():
    tickets = read_file("starter-tickets/starter_tickets.md")  # Skill-like read
    return assign_ticket(current_persona, tickets)


# Tier 3 — Read only at generation time
def on_onboarding_complete(state):
    template = read_file("hr-templates/email_templates.md")  # Template read
    email = fill_template(template, state)
    send_email(email)

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
.agents/workflows		.agents/workflows
backend		backend
data		data
frontend		frontend
.env.example		.env.example
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
README.md		README.md
ethos.db		ethos.db
pyrightconfig.json		pyrightconfig.json
requirements.txt		requirements.txt
run_app.sh		run_app.sh
test_neo4j.py		test_neo4j.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ethos — Autonomous Developer Onboarding Agent

The Three Access Patterns

Run Backend (Python 3.11)

File-by-File Breakdown

Quick Links

📚 Tier 1 — RAG Ingestion (Vector DB)

🧠 Tier 2 — Agent Logic Files (Read by Agent, Not by User)

`personas/employee_personas.md`

`checklists/onboarding_checklists.md`

`starter-tickets/starter_tickets.md`

📝 Tier 3 — Templates (Read Once at Output Time)

`hr-templates/email_templates.md`

Visual Summary

In Code Terms

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Ethos — Autonomous Developer Onboarding Agent

The Three Access Patterns

Run Backend (Python 3.11)

File-by-File Breakdown

Quick Links

📚 Tier 1 — RAG Ingestion (Vector DB)

🧠 Tier 2 — Agent Logic Files (Read by Agent, Not by User)

personas/employee_personas.md

checklists/onboarding_checklists.md

starter-tickets/starter_tickets.md

📝 Tier 3 — Templates (Read Once at Output Time)

hr-templates/email_templates.md

Visual Summary

In Code Terms

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`personas/employee_personas.md`

`checklists/onboarding_checklists.md`

`starter-tickets/starter_tickets.md`

`hr-templates/email_templates.md`

Packages