My group in Illinois, former students at IU, and collaborators have had a strong presence at WASPAA 2025, which was one of the best conference experiences I've had so far.…
ISMIR 2025 in Daejeon, Korea was really fun. Learned a lot from the nice papers, how things are organized there, and enjoyed the music so much. Yutong Wen presented a…
Darius Petermann successfully defended his dissertation on "Efficient Native Neural Sub-band Coding through Residual Feature Representation within Hyper-Autoencoded Reconstruction Propagation Networks."
The visit to Fraunhofer IIS and International Audio Labs in Erlangen was so inspiring and heartwarming. Introducing my neural audio coding works to the leading experts in audio coding was…
It was such a nice visit to Timo Gerkmann's signal processing group at the University of Hamburg. I had a great time speaking with the bright students and researchers there,…
Anastasia Kuznetsova successfully defended her dissertation on "Data Efficiency and Model Complexity Reduction for Speech Processing Systems." Congratulations!
ICASSP 2025 was fun! I organized the Generative Data Augmentation (GenDA) workshop along with my colleagues (Dinesh Manocha at U. of Maryland, Johan Hershey at Google, and Trausti Kristjansson at…
Jackie participated in the Generative Data Augmentation (GenDA) workshop as the task captain for the challenge, "Room Acoustics and Speaker Distance Estimation." She also chaired a few sessions.
Jaesung participated in the Generative Data Augmentation (GenDA) workshop as the task captain for the challenge, "Zero-Shot TTS and personalized speech enhancement." He also chaired a few sessions.
My research revolves around making audio and speech AI more practical and useful. I aim to introduce concepts such as efficiency, personalization, scalability, and collaboration into the AI systems I develop. With those goals in mind and by combining signal processing, generative modeling, and machine learning, I develop adaptive systems for learning efficient data representations (e.g., neural audio coding), intelligent signal processing (e.g., speech enhancement and source separation), and generative modeling of audio.
Darius Petermann successfully defended his dissertation on “Efficient Native Neural Sub-band Coding through Residual Feature Representation within Hyper-Autoencoded Reconstruction Propagation…