Skip to content
View sarahal-said's full-sized avatar

Block or report sarahal-said

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
SarahAl-Said/README.md

About

sarah = {
    "role"       : "Software Engineer",
    "focus"      : ["Prompt Engineering", "RAG Architecture", "LLM Evals", "AI Safety"],
    "stack"      : ["Python", "LangChain", "LangGraph", "FastAPI", "AWS"],
    "experience" : "software engineering · healthcare operations",
    "education"  : "BS + MS Computer Science",
    "location"   : "South Florida · Remote",
    "status"     : "Open to Software Engineering roles",
}

I build machine learning systems that are reliable by design — eval-first development, guardrails from day one, observability as a first-class concern. With software engineering experience across financial services, healthcare, and marketing technology, and a background in regulated healthcare operations, I build AI that works in the real world.


What I Build

🔍 RAG Pipelines

Hybrid retrieval with BM25 + dense search, reranking, citation tracing, and hallucination guardrails. Not toy demos — production systems with measurable accuracy improvements.

🧪 LLM Evaluation

Eval frameworks that treat prompts like code. Offline regression tests, online A/B experiments, CI/CD gates that block deploys on faithfulness regression.

🛡️ AI Safety Infrastructure

Drop-in LLM proxies with PII redaction, toxicity filtering, red-team test suites, and real-time observability dashboards for production LLM I/O.

⚡ Prompt Systems at Scale

Prompt architecture that's versioned, measured, and production-hardened. System prompts, few-shot templates, dynamic context injection — all treated as engineered artifacts.

Featured Projects

Modular RAG framework for any document corpus with hybrid retrieval and reranking.

55% improvement in answer relevance over naive RAG

Python LangChain Pinecone FastAPI AWS

🧪 EvalKit

Open-source LLMOps eval harness with prompt versioning and CI/CD regression gating.

CI/CD gate blocks deploys on faithfulness regression

Python RAGAS MLflow DeepEval GitHub Actions

🛡️ SafeLayer

Drop-in LLM safety proxy with PII redaction, adversarial testing, and live observability.

200+ red-team prompt templates across 12 risk categories

Python FastAPI Presidio OpenTelemetry Docker


Tech Stack

LLMs & Orchestration

Python LangChain LlamaIndex LangGraph OpenAI Hugging Face AWS Bedrock

Vector Databases

Pinecone FAISS pgvector Weaviate Chroma

Cloud & DevOps

AWS Docker Kubernetes GitHub Actions Terraform

LLMOps & Observability

MLflow FastAPI OpenTelemetry PostgreSQL Redis


GitHub Stats


Currently

  • 🔨   Building open-source LLM eval and safety tooling in public
  • 📖   Completing MS in Computer Science at Western Governors University
  • 🔍   Open to AI Engineering · Prompt Engineering · LLM Engineer roles
  • ✍️   Portfolio → (https://sarahalsaid.com/)

Good AI isn't clever — it's reliable, measured, and built to last.

Pinned Loading

  1. dev-draw-app dev-draw-app Public

    Forked from DrummerDee/DevDrawApp

    CSS 2

  2. HealthierMindApp HealthierMindApp Public

    Forked from Nafisa-Huda/HealthierMindApp

    A mindfulness-based web app for kids that provides an interactive journey to being more in-tune with their emotions.

    HTML 4

  3. MediTracker MediTracker Public

    Track all your medical appointments in one place

    CSS