Xiangchen Song

I am a PhD student in Machine Learning Department at Carnegie Mellon University, advised by Prof. Kun Zhang (CMU-CLeaR Group). Previously, I studied Computer Science at UIUC with Prof. Jiawei Han.

research

My research studies large language models and sequence models through the lens of identifiable internal representations. The central goal is to make latent representations provably transparent, so that model behavior can be interpreted, attributed, and steered with principled guarantees.

I pursue this goal along the following connected directions:

Mechanistic interpretability of LLMs: developing identifiable sparse autoencoders (SAEs) with feature consistency for LLM activations, enabling reproducible analysis of internal features and reasoning mechanisms. (blog posts)
Causal representation learning for nonstationary data: recovering identifiable latent structure and causal relations from time series, video, and text, including both time-delayed and instantaneous dependencies in latent representations. (blog posts)

More recently, I have also been exploring reliable and adaptive agentic LLM systems, including agent evaluation, long-horizon workflows, inference-time control, continual learning and test-time training. This line extends my interest in interpretable and controllable models to LLM systems that reason, act, and adapt over time.

contact

Email: xiangchs [at] cs [dot] cmu [dot] edu

news

May 13, 2026	I am happy to be recognized as a Gold Reviewer for ICML 2026. I hope our efforts can contribute to a better peer-review process in the community!!
May 01, 2026	One paper on LLM Agent benchmark and one paper on modular LLM reasoning have been accepted to Forty-third International Conference on Machine Learning (ICML’2026)!!
Apr 08, 2026	I received a Modal for Academics compute grant to support my research on LLM test-time training. Many thanks to Modal for their generous support!
Apr 06, 2026	One paper on mechanistic interpretability and one paper on diffusion large language models have been accepted to The 64th Annual Meeting of the Association for Computational Linguistics (ACL’2026)!!
Sep 23, 2025	Two papers about efficient LLM reasoning have been accepted to NeurIPS 2025 Workshop on Efficient Reasoning (ER@NeurIPS’2025)!!

selected publications

ACL Oral NeurIPS MI Spotlight

Mechanistic Interpretability Should Prioritize Feature Consistency in Sparse Autoencoders

Xiangchen Song^*, Aashiq Muhamed^*, Yujia Zheng, Lingjing Kong, Zeyu Tang, Mona T. Diab, Virginia Smith, and Kun Zhang

In The 64th Annual Meeting of the Association for Computational Linguistics, Jul 2026

Earlier version appeared at the Mechanistic Interpretability Workshop at NeurIPS (Spotlight)

arXiv HTML PDF Code
arXiv

Beyond Perplexity: A Behavioral Evaluation Framework for Deployment-Memory Claims in LLM Test-Time Training

Xiangchen Song, Zhenhao Chen, Lingjing Kong, Shaoan Xie, Xinshuai Dong, Guangyi Chen, and Kun Zhang

Jul 2026

arXiv PDF
NeurIPS

LLM Interpretability with Identifiable Temporal-Instantaneous Representation

Xiangchen Song^*, Jiaqi Sun^*, Zijian Li, Yujia Zheng, and Kun Zhang

In The Thirty-ninth Annual Conference on Neural Information Processing Systems, Dec 2025

arXiv HTML PDF Code
ICLR Oral

On the Identification of Temporal Causal Representation with Instantaneous Dependence

Zijian Li, Yifan Shen, Kaitao Zheng, Ruichu Cai, Xiangchen Song, Mingming Gong, Guangyi Chen, and Kun Zhang

In The Thirteenth International Conference on Learning Representations, May 2025

HTML
NeurIPS

Causal Temporal Representation Learning with Nonstationary Sparse Transition

Xiangchen Song, Zijian Li, Guangyi Chen, Yujia Zheng, Yewen Fan, Xinshuai Dong, and Kun Zhang

In The Thirty-eighth Annual Conference on Neural Information Processing Systems, Dec 2024

arXiv HTML PDF Code
NeurIPS

Temporally Disentangled Representation Learning under Unknown Nonstationarity

Xiangchen Song, Weiran Yao, Yewen Fan, Xinshuai Dong, Guangyi Chen, Juan Carlos Niebles, Eric Xing, and Kun Zhang

In Thirty-seventh Conference on Neural Information Processing Systems, Dec 2023

arXiv HTML PDF Code