Skip to content
SPS webinar

Generative Audio Restoration in Multimodal Applications

22 April 2025, 11:00 AM - 12:00 PM (ET)
 
Julius-Richter-Photo-May-2023           
Presented by Mr. Julius Richter

 

About this Topic:

The demand for high sound quality is increasing in both entertainment and communications. Consequently, audio restoration algorithms play a critical role in mitigating distortions and interferences that originate from recording processes or arise from imperfect transmission pipelines. This webinar offers an in-depth examination of generative audio restoration algorithms, with a particular focus on diffusion-based techniques for speech enhancement. The presenter will examine how diffusion models can be effectively employed in audio restoration tasks, including methods for conditioning them on visual modalities to improve performance in challenging acoustic scenarios. Additionally, he will explore various diffusion-based approaches, such as flow matching and the Schrödinger bridge, underscoring their significance in the context of audio restoration. The goal is to offer valuable insights into the theoretical underpinnings and practical applications of these advanced techniques.

About the Presenter:

Julius Richter received the B.Sc. and M.Sc. degrees in electrical engineering from the Technical University of Berlin, Germany in 2017 and 2019 respectively. He is currently a Ph.D. student in the Signal Processing group at the University of Hamburg, Germany.

His research interests include deep generative models and multimodal learning with applications to audio–visual speech processing.

Mr. Richter was the recipient of the VDE ITG Award 2024 for his work on speech enhancement with diffusion-based generative models.

Want to learn more about upcoming events & webinars?

Visit the events section of the Signal Processing website to see all upcoming lectures, workshops, webinars, and more.