Add embedding attack paper to Sentence-level Attack by WhymustIhaveaname · Pull Request #59 · thunlp/TAADpapers

WhymustIhaveaname · 2026-04-04T16:13:49Z

Hi, this PR adds one paper to the Sentence-level Attack section:

Jailbreaking LLMs' Safeguard with Universal Magic Words for Text Embedding Models (arXiv 2501.18280)

The paper discovers universal adversarial suffixes ("magic words") by exploiting bias in text embedding models, enabling both black-box and white-box attacks on LLM safeguards. Tagged as blind and gradient per the repo convention.

Also updated PaperNumber from 155 to 156.

Add embedding attack paper to Sentence-level Attack

f2e3170

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add embedding attack paper to Sentence-level Attack#59

Add embedding attack paper to Sentence-level Attack#59
WhymustIhaveaname wants to merge 1 commit intothunlp:masterfrom
WhymustIhaveaname:add-magic-words-paper

WhymustIhaveaname commented Apr 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

WhymustIhaveaname commented Apr 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant