Submitted by
DongGeon Lee
AI & ML interests
AI Safety & AI Security
Recent Activity
View all activity
Papers
COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs
X-Teaming Evolutionary M2S: Automated Discovery of Multi-turn to Single-turn Jailbreak Templates