publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

  1. CVPR
    Hearing Anywhere in Any Environment
    Liu, Xiulong, Kumar, Anurag, Calamia, Paul, Amengual, Sebastia V., Murdock, Calvin, Ananthabhotla, Ishwarya, Robinson, Philip, Shlizerman, Eli, Ithapu, Vamsi Krishna, and Gao, Ruohan
    In Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR) Jun 2025

2024

  1. NEURIPS
    Tell What You Hear From What You See - Video to Audio Generation Through Text
    Liu, Xiulong, Su, Kun, and Shlizerman, Eli
    In Advances in Neural Information Processing Systems 2024
  2. ML4PHYS
    Calo-VQ: Vector-quantized two-stage generative model in calorimeter simulation
    Liu, Qibin, Shimmin, Chase, Liu, Xiulong, Shlizerman, Eli, Li, Shu, and Hsu, Shih-Chieh
    arXiv preprint arXiv:2405.06605 2024
  3. ICML
    From Vision to Audio and Beyond: A Unified Model for Audio-Visual Representation and Generation
    Su, Kun, Liu, Xiulong, and Shlizerman, Eli
    In Proceedings of the 41st International Conference on Machine Learning 21–27 jul 2024
  4. CVPR
    MuseChat: A Conversational Music Recommendation System for Videos
    Dong, Zhikang, Liu, Xiulong, Chen, Bin, Polak, Pawel, and Zhang, Peng
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Jun 2024
  5. AAAI
    CAVEN: An Embodied Conversational Agent for Efficient Audio-Visual Navigation in Noisy Environments
    Liu, Xiulong, Paul, Sudipta, Chatterjee, Moitreya, and Cherian, Anoop
    Proceedings of the AAAI Conference on Artificial Intelligence Mar 2024
  6. WACV
    Let the Beat Follow You - Creating Interactive Drum Sounds From Body Rhythm
    Liu, Xiulong, Su, Kun, and Shlizerman, Eli
    In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Jan 2024
  7. WACV
    Tackling Data Bias in MUSIC-AVQA: Crafting a Balanced Dataset for Unbiased Question-Answering
    Liu, Xiulong, Dong, Zhikang, and Zhang, Peng
    In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision 2024

2021

  1. NEURIPS
    How Does it Sound? Generation of Rhythmic Soundtracks for Human Movement Videos
    Su, Kun*, Liu, Xiulong*, and Shlizerman, Eli
    Advances in Neural Information Processing Systems 2021

2020

  1. Arxiv
    Multi-instrumentalist net: Unsupervised generation of music from body movements
    Su, Kun, Liu, Xiulong, and Shlizerman, Eli
    arXiv preprint arXiv:2012.03478 2020
  2. NEURIPS
    Audeo: Audio generation for a silent performance video
    Su, Kun, Liu, Xiulong, and Shlizerman, Eli
    Advances in Neural Information Processing Systems 2020
  3. CVPR
    Predict & cluster: Unsupervised skeleton based action recognition
    Su, Kun, Liu, Xiulong, and Shlizerman, Eli
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2020