jailbreak-research

Here are 2 public repositories matching this topic...

VesperArch / epistemic-frame-probe

A classifier that separates epistemic frame from semantic content before LLM evaluation. Documents and mitigates authority-framing as a safety filter bypass vector.

hri dialog-systems hallucination adversarial-prompting safety-evaluation frame-detection jailbreak-research llm-security-epistemic-framing

Updated May 15, 2026
Python

ndpvt-web / aristotelian-compliance-test

Star

When Aristotle gets a LinkedIn account and starts red-teaming LLMs. System-prompt attack surface testing using first-principles axiom framework. Load it. Ask something terrible. Watch what happens.

Updated Mar 16, 2026

Improve this page

Add a description, image, and links to the jailbreak-research topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the jailbreak-research topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly