Publications

Google Scholar DBLP Semantic Scholar

2025

  • Xiangchen Song*, Jiaqi Sun*, Zijian Li, Yujia Zheng, Kun Zhang, “LLM Interpretability with Identifiable Temporal-Instantaneous Representation”, to appear The Thirty-ninth Conference on Neural Information Processing Systems (NeurIPS’25), Dec. 2025.

  • Xiangchen Song*, Aashiq Muhamed*, Yujia Zheng, Lingjing Kong, Zeyu Tang, Mona T. Diab, Virginia Smith, Kun Zhang, “Position: Mechanistic Interpretability Should Prioritize Feature Consistency in SAEs”, to appear Mechanistic Interpretability Workshop at NeurIPS 2025 (MechInterp@NeurIPS’25). Spotlight

  • Lingjing Kong, Shaoan Xie, Guangyi Chen, Yuewen Sun, Xiangchen Song, Eric P. Xing, Kun Zhang, “Beyond the Black Box: Identifiable Interpretation and Control in Generative Models via Causal Minimality”, to appear Mechanistic Interpretability Workshop at NeurIPS 2025 (MechInterp@NeurIPS’25).

  • Xiangchen Song, Saket Dingliwal, Sai Muralidhar Jayanthi, Vivek Govindan, Khiem Pham, Shobha Vasudevan, Beidi Chen, Sravan Babu Bodapati, Aram Galstyan, “Accelerating Speculative Reasoning with Internal Probing”, to appear NeurIPS 2025 Workshop on Efficient Reasoning (ER@NeurIPS’25).

  • Khiem Pham, Sai Muralidhar Jayanthi, Saket Dingliwal, Bhavana Ganesh, Karthik Valmeekam, Xiangchen Song, Vivek Govindan, Beidi Chen, Sravan Babu Bodapati, Aram Galstyan, “Internal Value Functions: Leveraging Hidden States for Efficient Test-Time Scaling in Large Reasoning Models”, to appear NeurIPS 2025 Workshop on Efficient Reasoning (ER@NeurIPS’25).

  • Zeyu Tang, Zhenhao Chen,Xiangchen Song, Loka Li, Yunlong Deng, Yifan Shen, Guangyi Chen, Peter Spirtes, Kun Zhang , “Reflection-Window Decoding: Text Generation with Selective Refinement”, in Proc. of The Forty-Second International Conference on Machine Learning (ICML’25), Jul. 2025.

  • Zijian Li, Yifan Shen, Kaitao Zheng, Ruichu Cai, Xiangchen Song, Mingming Gong, Guangyi Chen, Kun Zhang, “On the Identification of Temporal Causal Representation with Instantaneous Dependence”, in Proc. of The Thirteenth International Conference on Learning Representations (ICLR’25), May 2025. Oral

2024

2023

2022

2021

2020