Xiulong Liu

I am a ML Researcher at Apple AI/ML. Before joining Apple, I obtained my Ph.D. degree from the University of Washington, where I conducted research in the NeuroAI Lab under the supervision of Prof. Eli Shlizerman. My research interests broadly lie in computer vision, audio generation and multi-modal learning. Prior to that, I received my B.S. degree in Electrical Engineering at Shanghai Jiaotong University.
News
Jul 16, 2025 | My Ph.D. dissertation titled “Towards Multi-modal Interactive Systems that Connect Vision, Audio and Beyond” is formally published, please check out here |
---|---|
Jun 30, 2025 | I am excited to share that I’m joining Apple as a ML Researcher! |
May 22, 2025 | I have successfully defended my PhD thesis titled “Towards Multi-modal Interactive Systems that Connects Audio, Vision and Beyond” and become Dr. Dragon! |
Feb 26, 2025 | My first authored paper “Hearing Anywhere in Any Environment” has been accepted to CVPR 2025! Code, Dataset has been released. Please check out here and here! |
Feb 10, 2025 | I pass my PhD General Exam, and become a Ph.D. candidate! |
selected publications
-
AAAICAVEN: An Embodied Conversational Agent for Efficient Audio-Visual Navigation in Noisy EnvironmentsProceedings of the AAAI Conference on Artificial Intelligence Mar 2024
-
WACVLet the Beat Follow You - Creating Interactive Drum Sounds From Body RhythmIn Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Jan 2024
-
WACVTackling Data Bias in MUSIC-AVQA: Crafting a Balanced Dataset for Unbiased Question-AnsweringIn Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision 2024