Chenxiao Yu

Chenxiao Yu

Ph.D. Student in Computer Science

University of Southern California

Research Interests

Human-Centered AINLPInterpretability and ControllabilityAI Safety and Robustness

About

Chenxiao Yu is a Ph.D. student in Computer Science at the University of Southern California, where he is advised by Prof. Yue Zhao. He also works closely with Prof. Morteza Dehghani. He is from Hangzhou, China.

Latest Publications

View All →

Someone Hid It": Query-Agnostic Black-Box Attacks on LLM-Based Retrieval

Jiate Li, Defu Cao, Li Li, Wei Yang, Yuehan Qin, Chenxiao Yu, Tiannuo Yang, Ryan A Rossi, Yan Liu, Xiyang Hu, others

arXiv preprint arXiv:2602.00364

A query-agnostic black-box attack framework that manipulates LLM-based retrieval systems without knowledge of user queries.

Defenses Against Prompt Attacks Learn Surface Heuristics

Shawn Li, Chenxiao Yu, Zhiyu Ni, Hao Li, Charith Peris, Chaowei Xiao, Yue Zhao

arXiv preprint arXiv:2601.07185

Current defenses against prompt injection attacks rely on surface-level heuristics rather than robust understanding of attack patterns.

Tracing Moral Foundations in Large Language Models

Chenxiao Yu, Bowen Yi, Farzan Karimi-Malekabadi, Suhaib Abdurahman, Jinyi Ye, Shrikanth Narayanan, Yue Zhao, Morteza Dehghani

Do LLMs have human-like moral cognition?

Realistic threat perception drives intergroup conflict: A causal, dynamic analysis using generative-agent simulations

Suhaib Abdurahman, Farzan Karimi-Malekabadi, Chenxiao Yu, Nour S. Kteily, Morteza Dehghani

A causal analysis of how realistic threat perception drives intergroup conflict using generative-agent simulations.

Mitigating Hallucinations in Large Language Models via Causal Reasoning

Yuangang Li, Yiqing Shen, Yi Nian, Jiechao Gao, Ziyi Wang, Chenxiao Yu, Shawn Li, Jie Wang, Xiyang Hu, Yue Zhao

Proceedings of the AAAI Conference on Artificial Intelligence

A causal intervention framework that reduces hallucinations in LLMs by identifying and adjusting unstable internal pathways.

News

Feb 2026
Award

Received the USC 2026 CCLS Seed Funding Award! 🏆

Dec 2025
Conference

See U Guys @ NeurIPS 2025🎉