publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2025
-
CVPRHearing Anywhere in Any EnvironmentIn Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR) Jun 2025
2024
-
NEURIPSTell What You Hear From What You See - Video to Audio Generation Through TextIn Advances in Neural Information Processing Systems 2024
-
ML4PHYSCalo-VQ: Vector-quantized two-stage generative model in calorimeter simulationarXiv preprint arXiv:2405.06605 2024
-
CVPRMuseChat: A Conversational Music Recommendation System for VideosIn Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Jun 2024
-
AAAICAVEN: An Embodied Conversational Agent for Efficient Audio-Visual Navigation in Noisy EnvironmentsProceedings of the AAAI Conference on Artificial Intelligence Mar 2024
-
WACVLet the Beat Follow You - Creating Interactive Drum Sounds From Body RhythmIn Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Jan 2024
-
WACVTackling Data Bias in MUSIC-AVQA: Crafting a Balanced Dataset for Unbiased Question-AnsweringIn Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision 2024
2021
-
NEURIPSHow Does it Sound? Generation of Rhythmic Soundtracks for Human Movement VideosAdvances in Neural Information Processing Systems 2021
2020
-
ArxivMulti-instrumentalist net: Unsupervised generation of music from body movementsarXiv preprint arXiv:2012.03478 2020
-
NEURIPSAudeo: Audio generation for a silent performance videoAdvances in Neural Information Processing Systems 2020
-
CVPRPredict & cluster: Unsupervised skeleton based action recognitionIn Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2020