Bac Nguyen

Stuttgart, Germany

my_pic.jpeg

I am a Research Scientist at Sony AI specializing in Generative AI and Foundation Models. My work spans multiple domains–including computer vision, speech, and natural language processing–with a current focus on enhancing the efficiency and scalability of deep generative models. I am particularly interested in reducing training costs, accelerating inference, and optimizing large-scale foundation models for real-world impact.

Previously, I obtained my PhD from Ghent University in 2019, co-advised by Carlos Morell and De Baets Bernard. My PhD research focused on a supervised learning problem, called metric learning. Given some supervision information, the goal is to learn from examples a distance function that measures how similar or related two objects are. During my Ph.D., I developed various large-scale optimization techniques for distance metric learning problems under different types of supervision.

selected publications

  1. Improved Object-Centric Diffusion Learning with Registers and Contrastive Alignment
    Bac Nguyen, Yuhta Takida, Naoki Murata, Chieh-Hsin Lai, Toshimitsu Uesaka, Stefano Ermon, and Yuki Mitsufuji
    In The International Conference on Learning Representations, 2026
  2. Improving vector-quantized image modeling with latent consistency-matching diffusion
    Bac Nguyen, Chieh-Hsin Lai, Yuta Takida, Naoki Murata, Toshimitsu Uesaka, Stefano Ermon, and Yuki Mitsufuji
    In International Joint Conference on Neural Networks, 2025
  3. SAFT: Towards out-of-distribution generalization in fine-tuning
    Bac Nguyen, Stefan Uhlich, Fabien Cardinaux, Lukas Mauch, Marzieh Edraki, and Aaron Courville
    In European Conference on Computer Vision, 2024
  4. AutoTTS: End-to-end text-to-speech synthesis through differentiable duration modeling
    Bac Nguyen, Fabien Cardinaux, and Stefan Uhlich
    In International Conference on Acoustics, Speech and Signal Processing, 2023
  5. NVC-Net: End-to-end adversarial voice conversion
    Bac Nguyen and Fabien Cardinaux
    In International Conference on Acoustics, Speech and Signal Processing, 2022