Research

2015-2017 Postdoctoral position, Machine Intelligence Lab, University of Cambridge, UK

High-Quality voice model for STatistical parametric speech Synthesis

Parametric speech synthesis [1] [2] [3] [4]

2014-2015 Postdoctoral position, Ircam, CNRS-UPMC, Paris, France

ChaNTeR National project

Singing voice synthesis [5] [6] [7]

2011-2013 Postdoctoral position, University of Crete and FORTH, Heraklion, Greece

High-Resolution and Multi-Frame Speech Transformation

Adaptive Harmonic model (aHM) [8] [9] [10] [11]
Voice source modeling: Phase Distortion Deviation (PDD), HMPD vocoder [12] [13] [14] [15]

2006-2010 Ph.D. studies, Ircam, Paris, France

Glottal source and vocal-tract filter separation [16]

Estimation of shape parameters of glottal models by phase minimization [17] [18] [19] [20] [21]
Voice transformation/synthesis, SVLN vocoder [22] [23] [24]
Glottal Closure Instant detection [25]
High Speed Videoendoscopy [26]

2002-2003 Undergraduate, UniNE, Neuchatel, Switzerland

Estimation of fundamental frequency for music instruments in mono and polyphonic context (See FMIT)

[1] G. Degottex and Mark Gales. A spectrally weighted mixture of least square error and wasserstein discriminator loss for generative spss. In Proc. Workshop on Spoken Language Technology (SLT). Athens, Greece, December 2018. PDF

[2] G. Degottex, P. Lanchantin, and M. Gales. A log domain pulse model for parametric speech synthesis. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 26(1):57–70, 2018. PDF DOI

[3] G. Degottex, Pierre Lanchantin, and Mark Gales. A pulse model in log-domain for a uniform synthesizer. In Proc. 9th Speech Synthesis Workshop (SSW9). Sunnyvale, CA, USA, September 2016. PDF

[4] G. Degottex, P. Lanchantin, and M. Gales. Light supervised data selection, voice quality normalized training and log domain pulse synthesis. In in Proc. Blizzard Challenge 2017 - EH1. Stockholm, Sweden, 2017. PDF

[5] L. Ardaillon, G. Degottex, and A. Roebel. A multi-layer f0 model for singing voice synthesis using a b-spline representation with intuitive controls. In Proc. Interspeech. Dresden, Germany, September 2015. PDF

[6] G. Degottex, L. Ardaillon, and A. Roebel. Simple multi frame analysis methods for estimation of amplitude spectral envelope estimation in singing voice. In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Shanghai, China, March 2016. PDF

[7] G. Degottex, L. Ardaillon, and A. Roebel. Multi-frame amplitude envelope estimation for modification of singing voice. IEEE Transactions on Audio, Speech, and Language Processing, Accepted 2016. PDF DOI

[8] G. Degottex and Y. Stylianou. Analysis and synthesis of speech using an adaptive full-band harmonic model. IEEE Transactions on Acoustics, Speech and Language Processing, 21(10):2085–2095, 2013. PDF DOI

[9] G. Kafentzis, G. Degottex, O. Rosec, and Y. Stylianou. Time-scale modifications based on a full-band adaptive harmonic model. In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Vancouver, Canada, August 2013. PDF

[10] G. Kafentzis, G. Degottex, O. Rosec, and Y. Stylianou. Pitch modifications of speech based on an adaptive harmonic model. In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Florence, Italy, May 2014. PDF

[11] V. Morfi, G. Degottex, and A. Mouchtaris. A computationally efficient refinement of the fundamental frequency estimate for the adaptive harmonic model. In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Florence, Italy, May 2014. PDF

[12] G. Degottex and N. Obin. Phase distortion statistics as a representation of the glottal source: application to the classification of voice qualities. In Proc. Interspeech. Singapore, September 2014. International Speech Communication Association (ISCA). PDF

[13] M. Koutsogiannaki, O. Simantiraki, G. Degottex, and Y. Stylianou. The importance of phase on voice quality assessment. In Proc. Interspeech. Singapore, September 2014. International Speech Communication Association (ISCA). PDF

[14] G. Degottex and D. Erro. A uniform phase representation for the harmonic model in speech synthesis applications. EURASIP, Journal on Audio, Speech, and Music Processing - Special Issue: Models of Speech - In Search of Better Representations, 2014(1):38, 2014. PDF DOI

[15] G. Degottex and D. Erro. A measure of phase randomness for the harmonic model in speech synthesis. In Proc. Interspeech. Singapore, September 2014. International Speech Communication Association (ISCA). PDF

[16] G. Degottex. Glottal source and vocal tract separation. PhD thesis, UPMC-Ircam-UMR9912-STMS, Paris, France, Paris, France, 2010. PDF

[17] G. Degottex, A. Roebel, and X. Rodet. Phase minimization for glottal model estimation. IEEE Transactions on Acoustics, Speech and Language Processing, 19(5):1080–1090, July 2011. PDF DOI

[18] G. Degottex, A. Roebel, and X. Rodet. Joint estimate of shape and time-synchronization of a glottal source model by phase flatness. In Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 5058–5061. Dallas, USA, 2010. PDF

[19] G. Degottex, A. Roebel, and X. Rodet. Function of phase-distortion for glottal model estimation. In Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 4608–4611. Prague, Czeck Republic, May 2011. PDF DOI

[20] M. Tahon, G. Degottex, and L. Devillers. Usual voice quality features and glottal features for emotional valence detection. In Proc. International Conference on Speech Prosody. Shanghai, China, May 2012. PDF

[21] S. Huber, A. Roebel, and G. Degottex. Glottal source shape parameter estimation using phase minimization variants. In Proc. Interspeech. Portland, USA, September 2012. International Speech Communication Association (ISCA). PDF

[22] G. Degottex, P. Lanchantin, A. Roebel, and X. Rodet. Mixed source model and its adapted vocal tract filter estimate for voice transformation and synthesis. Speech Communication, 55(2):278–294, 2013. PDF DOI

[23] G. Degottex, A. Roebel, and X. Rodet. Pitch transposition and breathiness modification using a glottal source model and its adapted vocal-tract filter. In Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 5128–5131. Prague, Czeck Republic, May 2011. PDF DOI

[24] P. Lanchantin, G. Degottex, and X. Rodet. A HMM-based speech synthesis system using a new glottal source and vocal-tract separation method. In Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 4630–4633. Dallas, USA, 2010. PDF

[25] G. Degottex, A. Roebel, and X. Rodet. Glottal closure instant detection from a glottal shape estimate. In 13th International Conference on Speech and Computer (SPECOM), 226–231. St-Petersburg, Russia, 2009. PDF

[26] G. Degottex, E. Bianco, and X. Rodet. Usual to particular phonatory situations studied with high-speed videoendoscopy. In The 6th International Conference on Voice Physiology and Biomechanics, ICVPB, 19–26. Tempere, Finland, August 2008. PDF