Publications

2020 2019 2018

2020

 

Journal Papers

  • P. Gimeno, I. Viñals, A. Ortega, A. Miguel, E. Lleida "Multiclass audio segmentation based on recurrent neural networks for broadcast domain data". EURASIP Journal on Audio, Speech, and Music Processing, 5, March. 2020. DOI.
  • V. Mingote, A. Miguel, A. Ortega, E. Lleida "Optimization of the area under the ROC curve using neural network supervectors for text-dependent speaker verification". Computer, Speech & Language , vol. 63, Sept. 2020. DOI.

Conferences

  • V. Mingote, A. Miguel, A. Ortega, E. Lleida "Training Speaker Enrollment Models by Network Optimization" 21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020. Shanghai, China. October 2019.
  • P. Gimeno, V. Mingote, A. Ortega, A. Miguel, E. Lleida "Partial AUC Optimisation using Recurrent Neural Networks for Music Detection with Limited Training Data" 21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020. Shanghai, China. October 2019.
  • S. Prieto, A. Ortega, I. López-Espejo, E. Lleida "Shouted Speech Compensation for Speaker Verification Robust to Vocal Effort Conditions" 21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020. Shanghai, China. October 2019.
  • V. Mingote, A. Miguel, D. Ribas, A. Ortega, E. Lleida "Knowledge Distillation and Random Erasing Data Augmentation for Text-Dependent Speaker Verification" ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2020, pp. 6824-6828, DOI.

2019

 

Journal Papers

  • I. Viñals, A. Ortega, Jesus Villalba, A. Miguel, E. Lleida "Unsupervised adaptation of PLDA models for broadcast diarization". EURASIP Journal on Audio, Speech, and Music Processing, 24, Dec. 2019. DOI
  • E. Lleida, A. Ortega, A. Miguel, V. Bazán-Gil, C. Pérez, M. Gómez, A. de Prada "Albayzin 2018 Evaluation: The IberSpeech-RTVE Challenge on Speech Technologies for Spanish Broadcast Media". Applied Sciences, vol. 9, no. 24, p. 5412, Dec. 2019. DOI
  • I. Viñals, A. Ortega, A. Miguel, E. Lleida "An Analysis of the Short Utterance Problem for Speaker Characterization". Applied Sciences, vol. 9, no. 18, p. 3697, Sep. 2019. DOI
  • V. Mingote, A. Miguel, A. Ortega, E. Lleida "Supervector Extraction for Encoding Speaker and Phrase Information with Neural Networks for Text-Dependent Speaker Verification". Applied Sciences, vol. 9, no. 16, p. 3295, Aug. 2019. DOI

Conferences

  • I. Viñals, P. Gimeno, A. Ortega, A. Miguel, E. Lleida "ViVoLAB Speaker Diarization System for the DIHARD 2019 Challenge" 20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. Graz, Austria. September 2019.
  • V. Mingote, A. Miguel, D. Ribas, A. Ortega, E. Lleida "Optimization of False Acceptance/Rejection Rates and Decision Threshold for End-to-End Text-Dependent Speaker Verification Systems" 20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. Graz, Austria. September 2019.
  • V. Mingote, D. Castan, M. McLaren, M. Kumar Nandwana, A. Ortega, E. Lleida, Antonio Miguel"Language Recognition using Triplet Neural Networks" 20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. Graz, Austria. September 2019.
  • I. Viñals, D. Ribas, V. Mingote, J. Llombart, P. Gimeno, A. Miguel, A. Ortega, E. Lleida "Phonetically-aware embeddings, Wide Residual Networks with Time-Delay Neural Networks and Self Attention models for the 2018 NIST Speaker Recognition Evaluation" 20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. Graz, Austria. September 2019.
  • J. Llombart, D. Ribas Gonzalez, A. Miguel, L. Vicente, A. Ortega, E. Lleida "Speech Enhancement with Wide Residual Networks in Reverberant Environments" 20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. Graz, Austria. September 2019.
  • J. Llombart, D. Ribas Gonzalez, A. Miguel, L. Vicente, A. Ortega, E. Lleida "Progressive Speech Enhancement with Residual Connections" 20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. Graz, Austria. September 2019.

2018

 

Conferences

  • I. Viñals, P. Gimeno, A. Ortega, A. Miguel, E. Lleida "Estimation of the Number of Speakers with Variational Bayesian PLDA in the DIHARD Diarization Challenge" 19th Annual Conference of the International Speech Communication Association, INTERSPEECH 2018. Hyderabad, India. September 2018.
  • I. Viñals, P. Gimeno, A. Ortega, A. Miguel, E. Lleida "In-domain Adaptation Solutions for the RTVE 2018 Diarization Challenge" Fourth International Conference, IberSPEECH 2018. Barcelona, Spain. November 2018.
  • V. Mingote, A. Miguel, A. Ortega, E. Lleida "Differentiable Supervector Extraction for Encoding Speaker and Phrase Information in Text Dependent Speaker" Fourth International Conference, IberSPEECH 2018. Barcelona, Spain. November 2018.
  • J. Llombart, A. Miguel, A. Ortega, E. Lleida "Wide Residual Networks 1D for Automatic Text Punctuation" Fourth International Conference, IberSPEECH 2018. Barcelona, Spain. November 2018.
  • I. Viñals, A. Ortega, A. Miguel, E. Lleida "Phonetic Variability Influence on Short Utterances in Speaker Verification" Fourth International Conference, IberSPEECH 2018. Barcelona, Spain. November 2018.
  • P. Gimeno, I. Viñals, A. Ortega, A. Miguel, E. Lleida "A Recurrent Neural Network Approach to Audio Segmentation for Broadcast Domain Data" Fourth International Conference, IberSPEECH 2018. Barcelona, Spain. November 2018.