Xiulong Liu

I am Xiulong Liu, a Ph.D. graduate from the University of Washington, where I conducted research in the NeuroAI Lab under the supervision of Prof. Eli Shlizerman. My research interests broadly lie in computer vision, audio generation and multi-modal learning. Prior to that, I received my B.S. degree in Electrical Engineering at Shanghai Jiaotong University.
News
May 22, 2025 | I have successfully defended my PhD thesis titled “Towards Multi-modal Interactive Systems that Connects Audio, Vision and Beyond” and become Dr. Dragon! |
---|---|
Feb 26, 2025 | My first authored paper “Hearing Anywhere in Any Environment” has been accepted to CVPR 2025! |
Feb 10, 2025 | I pass my PhD General Exam, and become a Ph.D. candidate! |
Sep 25, 2024 | My first authored paper “Tell What You Hear From What You See - Video to Audio Generation Through Text” has been accepted by NeurIPS 2024! |
Feb 26, 2024 | My first co-authored paper “MuseChat: A Conversational Music Recommendation System for Videos” has been accepted by CVPR 2024 as Highlight Poster (Top 2.8%)! |
selected publications
-
CVPRHearing Anywhere in Any EnvironmentIn Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR) Jun 2025
-
NEURIPSTell What You Hear From What You See - Video to Audio Generation Through TextIn Advances in Neural Information Processing Systems 2024
-
CVPRMuseChat: A Conversational Music Recommendation System for VideosIn Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Jun 2024
-
AAAICAVEN: An Embodied Conversational Agent for Efficient Audio-Visual Navigation in Noisy EnvironmentsProceedings of the AAAI Conference on Artificial Intelligence Mar 2024
-
WACVLet the Beat Follow You - Creating Interactive Drum Sounds From Body RhythmIn Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Jan 2024
-
WACVTackling Data Bias in MUSIC-AVQA: Crafting a Balanced Dataset for Unbiased Question-AnsweringIn Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision 2024
-
NEURIPSHow Does it Sound? Generation of Rhythmic Soundtracks for Human Movement VideosAdvances in Neural Information Processing Systems 2021
-
NEURIPSAudeo: Audio generation for a silent performance videoAdvances in Neural Information Processing Systems 2020
-
CVPRPredict & cluster: Unsupervised skeleton based action recognitionIn Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2020