top of page
Vamsi K Ithapu
[C43] EgoAdapt: Adaptive Multisensory Distillation and Policy Learning for Efficient Egocentric Perception
S Chowdhury, S. Biswas, S. Nag, T. Nagarajan, C. Murdock, I. Ananthabhotla, Y. Qian, V. K. Ithapu, D. Manocha, Ruohan Gao
International Conference on Computer Vision (ICCV) 2025
PDF
[C42] Modulating state space model with slowfast framework for compute-efficient ultra low-latency speech enhancement
L. Cheng, A. Pandey, B. Xu, T. Delbruck, V. K. Ithapu, S. C. Liu
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2025
PDF
[C41] Hearing anywhere in any environment
X. Liu, A. Kumar, P. Calamia, S. Amengual, C. Murdock, I. Ananthabhotla, P. Robinson, E. Shlizerman, V. K. Ithapu, R. Gao
Computer Vision and Pattern Recognition (CVPR) 2025
PDF
[C40] Spherical world-locking for audio-visual localization in egocentric videos
H.Yun, R. Gao, I. Ananthabhotla, A. Kumar, J. Donley, C. Li, G. Kim, V. K. Ithapu, C. Murdock
European conference on Computer Vision (ECCV) 2024
PDF
[C39] Self motion as supervision for egocentric audio visual localization
C. Murdock, I. Ananthabhotla, H. Lu, V. K. Ithapu
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024
PDF
[C38] Hearing loss detection from facial expressions in 1-1 conversations
Y. Yin, I. Ananthabhotla, V. K. Ithapu, S. Petridis, Y. Wu, C. Miller
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024
PDF
[C37] The audio-visual Conversation graph: From an egocentric perspective
W. Jia, M. Liu, H. Jiang, I. Ananthabhotla, J. Rehg, V. K. Ithapu, R. Gao
Computer Vision and Pattern Recognition (CVPR) 2024
PDF
[C36] Learning to personalize equalization for high-fidelity spatial audio reproduction
A. Gupta, P. Hoffmann, S. Prepelitǎ, P. Robinson, V. K. Ithapu, D. Alon
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023
Link
[C35] LA-VocE: Low-SNR audio-visual speech enhancement using neural vocoders
R. Mira, B. Xu, J. Donley, A. Kumar, S. Petridis, V. K. Ithapu, M. Pantic
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023
Link
[C34] Leveraging heteroscedastic uncertainty in learning complex spectral mapping for single-channel speech enhancement
K. Chen, D. Wong, K. Tan, B. Xu, A. Kumar, V. K. Ithapu
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023
Link
[C33] Towards improved room impulse response estimation for speech recognition
A. Ratnarajah, I. Ananthabhotla, V. K. Ithapu, P. Hoffmann, D. Manocha, P. Calamia
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023
Link
[C32] Egocentric auditory attention localization in conversations
F. Ryan, H. Jiang, A. Shukla, J. M. Rehg, V. K. Ithapu
Computer Vision and Pattern Recognition (CVPR) 2023
PDF
[C31] Novel view Acoustic Synthesis
C. Chen, A. Richard, R. Shapovalov, V. K. Ithapu, N. Neverova, K. Grauman, A. Vedaldi
Computer Vision and Pattern Recognition (CVPR) 2023
PDF
[C30] Chat2Map : Efficient scene mapping from multi-ego conversations
S. Majumder, H. Jiang, P. Moulon, E. Henderson, P. Calamia, K. Grauman, V. K. Ithapu
Computer Vision and Pattern Recognition (CVPR) 2023
PDF
[C29] HRTF personalization based on ear morphology
M. Warnecke, S. Jamison, S. Prepelita, P. Calamia, V. K. Ithapu
Audio Engineering Society Conference 2022
Link
[C28] SAQAM: Spatial Audio Quality Assessment Metric
P. Manocha, A. Kumar, B. Xu, A. Menon, I. Gebru, V. K. Ithapu, P. Calamia
Interspeech 2022
PDF
[C27] Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization,
H. Jiang, C. Murdock, V. K. Ithapu
Computer Vision and Pattern Recognition (CVPR) 2022
PDF
[C26] Ego4D: Around the World in 3,000 Hours of Egocentric Video,
K. Grauman, [many-authors], V. K. Ithapu, [many-authors], J. Malik
Computer Vision and Pattern Recognition (CVPR) 2022
PDF
[C25] Continual self-training with bootstrapped remixing for speech enhancement,
E. Tzinis, Y. Adi, V. K. Ithapu, B. Xu, A. Kumar
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022
PDF
[C24] Deep Impulse Responses: Estimating and Parameterizing Filters with Deep Networks,
A. Richard, P. Dodds, V. K. Ithapu
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022
PDF
[C23] DPLM: A Deep Perceptual Spatial-Audio Localization Metric,
P. Manocha, A. Kumar, B. Xu, A. Menon, I. Gebru, V. K. Ithapu, P. Calamia
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2021
PDF
[C22] Filtered Noise Shaping for Time Domain Room Impulse Response Estimation From Reverberant Speech,
C. Steinmetz, V. K. Ithapu, P. Calamia
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2021
PDF
Best Student Paper Award (News Link)
[C21] Audio Visual Floor Plan Reconstruction,
C. Chen, U. Jain, C. Schissler, S. V. A. Gari, Z. Al-Halah, V. K. Ithapu, P. Robinson, K. Grauman
International Conference on Computer Vision (ICCV) 2021
PDF
[C20] Egocentric Pose Estimation from Human Vision Span,
H. Jiang, V. K. Ithapu
International Conference on Computer Vision (ICCV) 2021
PDF
[C19] Do sound event representations generalize to other audio tasks? A case study in audio transfer learning,
A. Kumar, Y. Wang, V. K. Ithapu, C. Fuegen
InterSpeech 2021
PDF
[C18] On the predictability of HRTFs from ear shapes using deep networks,
Y. Zhou, H. Jiang, V. K. Ithapu
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021
PDF
[C17] A Sequential Self Teaching Approach for Improving Generalization in Sound Event Recognition,
A. Kumar, V. K. Ithapu
International Conference on Machine Learning (ICML) 2020
[C16] SeCoST: Sequential Co-Supervision for Weakly Labeled Audio Event Detection,
A. Kumar, V. K. Ithapu
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020
PDF
[C15] SoundSpaces: Audio-Visual Navigation in 3D Environments,
C. Chen, U. Jain, C. Schissler, S. V. A. Gari, Z. Al-Halah, V. K. Ithapu, P. Robinson, K. Grauman
European Conference on Computer Vision (ECCV) 2020
PDF
[C14] Decoding the Deep: Exploring Class Hierarchies of Deep Representations using Multiresolution Matrix Factorization,
V. K. Ithapu
Explainable Computer Vision Workshop, Computer Vision and Pattern Recognition (CVPR) 2017
[C13] When can Multi-site Datasets be Pooled for Regression: Hypothesis Tests, L2-consistency and Neuroscience Applications,
H. Hao, Y. Zhang, V. K. Ithapu, G. Wahba, S. C. Johnson, V. Singh
International Conference on Machine Learning (ICML) 2017
[C12] The Incremental Multiresolution Matrix Factorization Algorithm,
V. K. Ithapu, R. Kondor, S. C. Johnson, V. Singh
Computer Vision and Pattern Recognition (CVPR) 2017
[C11] On the Interplay of Network Structure and Gradient Convergence in Deep Learning,
V. K. Ithapu, S. Ravi, V. Singh
54th Allerton Conference on Communication, Control and Computing (Allerton) 2016
[C10] Hypothesis Testing in Unsupervised Domain Adaptation with Applications in Alzheimer's Disease,
H. Hao, V. K. Ithapu, S. Ravi, V. Singh, G. Wahba, S. C. Johnson
Neural Information Processing Systems (NeurIPS) 2016
[C9] Experimental Design on a Budget for Sparse Linear Models and Applications,
S. Ravi, V. K. Ithapu, S. C. Johnson, V. Singh
International Conference on Machine Learning (ICML), 2016
[C8] An NMF perspective on Binary Hashing,
L. Mukherjee, S. Ravi, V. K. Ithapu, T. Holmes, V. Singh
International Conference on Computer Vision (ICCV), 2015
[C7] A Projection Free Method for Generalized Eigenvalue Problem with a Nonsmooth Regularizer,
S. J. Hwang, M. Collins, S. Ravi, V. K. Ithapu, N. Adluru, S. C. Johnson, V. Singh,
International Conference on Computer Vision (ICCV), 2015
[C6] Randomized Denoising Autoencoders for Smaller and Efficient Imaging based AD Clinical Trials,
V. K. Ithapu, V. Singh, O. Okonkwo, S. C. Johnson,
Medical Image Computing and Computer Assisted Intervention (MICCAI), 2014
[C5] Speeding up Permutation Testing in Neuroimaging,
V. K. Ithapu*, C. Hinrichs*, Q. Sun, S. C. Johnson, V. Singh,
Neural Information Processing Systems (NeurIPS), 2013
* - equal contribution
PDF (Oral Spotlight)
[C4] GOSUS: Grassmannian Online Subspace Updates with Structured-sparsity,
J. Xu, V. K. Ithapu, L. Mukherjee, J. Rehg, V. Singh,
International Conference on Computer Vision (ICCV), 2013
[C3] Fundus Image Registration for Vestibularis Research,
V. K. Ithapu, A. Fritsche, A. Oppelt, M. Westhofen, T. M. Deserno,
Proceedings of SPIE Medical Imaging, 2010
[C2] Diversity Employment into Target plus Clutter SAR Imaging using MIMO Configuration,
V. K. Ithapu, A. K. Mishra, R. K. Panigrahi,
Indian Antenna Week, 2010
[C1] Hybrid Diversity Strategy using MIMO Radar for Target Tracking,
V. K. Ithapu, A. K. Mishra,
IEEE Applied Electromagnetics Conference (AEMC), 2009
bottom of page