The models of the paper "X-Boundary: Establishing Exact Safety Boundary to Shield LLMs from Multi-Turn Jailbreaks without Compromising Usability".
Xiaoya Lu
Ursulalala
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
9 days ago
DeepSight: An All-in-One LM Safety Toolkit
upvoted
a
paper
26 days ago
AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security