Yixuan Xiao

I am currently a PhD student at the University of Stuttgart, supervised by Prof. Dr. Thang Vu. My research interests lie in speech processing tasks such as audio deepfake detection and speech synthesis. My contact info can be found here. I received my M.Sc. in Computational Linguistics, also from the University of Stuttgart. My Master’s thesis was titled “Mitigating Text Domain Mismatch in ASR Systems through Prompt-based Learning” and supervised by Prof. Dr. Thang Vu.

Prior to this, I worked as a senior algorithm engineer at Baidu’s Speech Team (specialized in high-performance computing) and at NetEase Youdao’s AI Team (specialized in ASR and computer-aided pronunciation training). Earlier, I completed a taught Master’s programme in Artificial Intelligence at the University of Edinburgh and a B.Sc. in Computer Science and Technology at Beijing’s Institute of Technology, supervised by Prof. Dr. Xianling Mao.

Teaching

I am/have been the (co-)lecturer for the following courses:

  1. Advanced Deep Learning (2024WS)
  2. Current Topics in Speech Technology (2024WS)
  3. Computational Linguistics Team Laboratory: Phonetics (2025SS)
  4. Current Topics in Speech Technology (2025WS)
  5. Introduction to Deep Learning for Speech and Text Processing (2025 WS)

Publications

  1. Yixuan Xiao and Thang Vu. 2025. “What Affects the Performance of Fake Audio Detection? Analyzing Factors in a Continual Learning Setting”. ICASSP. code.
  2. Yixuan Xiao and Thang Vu. 2025. “Layer-Wise Decision Fusion for Fake Audio Detection Using XLS-R”. To appear at Interspeech.