Xinfeng Li
LetterJohn
AI & ML interests
Trustworthy AI, AI for Security & Privacy
Recent Activity
upvoted a paper 10 days ago
Internal Safety Collapse in Frontier Large Language Models upvoted a paper about 1 month ago
BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning upvoted a paper 2 months ago
AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security