Publications

202120202019 2018

2021

 

Journal Papers

  • Gimeno, P.; Ortega, A.; Miguel, A.; Lleida, E. "Generalising AUC Optimisation to Multiclass Classification for Audio Segmentation with Limited Training Data". IEEE Signal Processing Letters, 28 , pp. 1135-1139, 2021. DOI.
  • Llombart, J.; Ribas, D.; Miguel, A.; Vicente, L.; Ortega, A.; Lleida, E. "Progressive Loss Functions for Speech Enhancement with Deep Neural Networks". EURASIP Journal on Audio, Speech, and Music Processing, 2021 (1), pp. 1-16, 2021. DOI.
  • Viñals, I.; Ortega, A.; Miguel, A.; Lleida, E. "The Domain Mismatch Problem in the Broadcast Speaker Attribution Task". Applied Sciences, vol. 11, no. 18, p. 8521, Sept. 2021. DOI.

Conferences

  • Gimeno, P; Ortega, A.; Miguel, A.; Lleida, E. "Unsupervised Representation Learning for Speech Activity Detection in the Fearless Steps Challenge 2021" 22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021. Brno, Czech Republic. September 2021. DOI.
  • Mingote, V.; Miguel, A.; Ortega, A.; Lleida, E. "Log-Likelihood-Ratio Cost Function as Objective Loss for Speaker Verification Systems" 22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021. Brno, Czech Republic. September 2021. DOI.
  • Gimeno, P.; Ribas, D.; Ortega, A.; Miguel, A.; Lleida, E. "Convolutional Recurrent Neural Networks for Speech Activity Detection in Naturalistic Audio from Apollo Missions" Fifth International Conference, IberSPEECH 2020. Valladolid, Spain. March 2021. DOI.
  • Mingote, V.; Viñals, I.; Gimeno, P.; Miguel, A.; Ortega, A.; Lleida, E."ViVoLAB Multimodal Diarization System for RTVE 2020 Challenge" Fifth International Conference, IberSPEECH 2020. Valladolid, Spain. March 2021. DOI.
  • Viñals, I.; Gimeno, P.; Ortega, A.; Miguel, A.; Lleida, E."Diarization and Identity Attribution Compatibility in the Albayzin 2020 Challenge" Fifth International Conference, IberSPEECH 2020. Valladolid, Spain. March 2021 DOI.
  • Mingote, V.; Miguel, A.; Ortega, A.; Lleida, E." Memory Layers with Multi-Head Attention Mechanisms for Text-Dependent Speaker Verification" Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021-June , 2021. DOI.

2020

 

Journal Papers

  • P. Gimeno, I. Viñals, A. Ortega, A. Miguel, E. Lleida "Multiclass audio segmentation based on recurrent neural networks for broadcast domain data". EURASIP Journal on Audio, Speech, and Music Processing, 5, March. 2020. DOI.
  • V. Mingote, A. Miguel, A. Ortega, E. Lleida "Optimization of the area under the ROC curve using neural network supervectors for text-dependent speaker verification". Computer, Speech & Language , vol. 63, Sept. 2020. DOI.

Conferences

  • V. Mingote, A. Miguel, A. Ortega, E. Lleida "Training Speaker Enrollment Models by Network Optimization" 21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020. Shanghai, China. October 2020.
  • P. Gimeno, V. Mingote, A. Ortega, A. Miguel, E. Lleida "Partial AUC Optimisation using Recurrent Neural Networks for Music Detection with Limited Training Data" 21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020. Shanghai, China. October 2020.
  • S. Prieto, A. Ortega, I. López-Espejo, E. Lleida "Shouted Speech Compensation for Speaker Verification Robust to Vocal Effort Conditions" 21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020. Shanghai, China. October 2020.
  • V. Mingote, A. Miguel, D. Ribas, A. Ortega, E. Lleida "Knowledge Distillation and Random Erasing Data Augmentation for Text-Dependent Speaker Verification" ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2020, pp. 6824-6828, DOI.

2019

 

Journal Papers

  • I. Viñals, A. Ortega, Jesus Villalba, A. Miguel, E. Lleida "Unsupervised adaptation of PLDA models for broadcast diarization". EURASIP Journal on Audio, Speech, and Music Processing, 24, Dec. 2019.
  • E. Lleida, A. Ortega, A. Miguel, V. Bazán-Gil, C. Pérez, M. Gómez, A. de Prada "Albayzin 2018 Evaluation: The IberSpeech-RTVE Challenge on Speech Technologies for Spanish Broadcast Media". Applied Sciences, vol. 9, no. 24, p. 5412, Dec. 2019.
  • I. Viñals, A. Ortega, A. Miguel, E. Lleida "An Analysis of the Short Utterance Problem for Speaker Characterization". Applied Sciences, vol. 9, no. 18, p. 3697, Sep. 2019.
  • V. Mingote, A. Miguel, A. Ortega, E. Lleida "Supervector Extraction for Encoding Speaker and Phrase Information with Neural Networks for Text-Dependent Speaker Verification". Applied Sciences, vol. 9, no. 16, p. 3295, Aug. 2019.

Conferences

  • I. Viñals, P. Gimeno, A. Ortega, A. Miguel, E. Lleida "ViVoLAB Speaker Diarization System for the DIHARD 2019 Challenge" 20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. Graz, Austria. September 2019.
  • V. Mingote, A. Miguel, D. Ribas, A. Ortega, E. Lleida "Optimization of False Acceptance/Rejection Rates and Decision Threshold for End-to-End Text-Dependent Speaker Verification Systems" 20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. Graz, Austria. September 2019.
  • V. Mingote, D. Castan, M. McLaren, M. Kumar Nandwana, A. Ortega, E. Lleida, Antonio Miguel"Language Recognition using Triplet Neural Networks" 20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. Graz, Austria. September 2019.
  • I. Viñals, D. Ribas, V. Mingote, J. Llombart, P. Gimeno, A. Miguel, A. Ortega, E. Lleida "Phonetically-aware embeddings, Wide Residual Networks with Time-Delay Neural Networks and Self Attention models for the 2018 NIST Speaker Recognition Evaluation" 20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. Graz, Austria. September 2019.
  • J. Llombart, D. Ribas Gonzalez, A. Miguel, L. Vicente, A. Ortega, E. Lleida "Speech Enhancement with Wide Residual Networks in Reverberant Environments" 20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. Graz, Austria. September 2019.
  • J. Llombart, D. Ribas Gonzalez, A. Miguel, L. Vicente, A. Ortega, E. Lleida "Progressive Speech Enhancement with Residual Connections" 20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. Graz, Austria. September 2019.

2018

 

Conferences

  • I. Viñals, P. Gimeno, A. Ortega, A. Miguel, E. Lleida "Estimation of the Number of Speakers with Variational Bayesian PLDA in the DIHARD Diarization Challenge" 19th Annual Conference of the International Speech Communication Association, INTERSPEECH 2018. Hyderabad, India. September 2018.
  • I. Viñals, P. Gimeno, A. Ortega, A. Miguel, E. Lleida "In-domain Adaptation Solutions for the RTVE 2018 Diarization Challenge" Fourth International Conference, IberSPEECH 2018. Barcelona, Spain. November 2018.
  • V. Mingote, A. Miguel, A. Ortega, E. Lleida "Differentiable Supervector Extraction for Encoding Speaker and Phrase Information in Text Dependent Speaker" Fourth International Conference, IberSPEECH 2018. Barcelona, Spain. November 2018.
  • J. Llombart, A. Miguel, A. Ortega, E. Lleida "Wide Residual Networks 1D for Automatic Text Punctuation" Fourth International Conference, IberSPEECH 2018. Barcelona, Spain. November 2018.
  • I. Viñals, A. Ortega, A. Miguel, E. Lleida "Phonetic Variability Influence on Short Utterances in Speaker Verification" Fourth International Conference, IberSPEECH 2018. Barcelona, Spain. November 2018.
  • P. Gimeno, I. Viñals, A. Ortega, A. Miguel, E. Lleida "A Recurrent Neural Network Approach to Audio Segmentation for Broadcast Domain Data" Fourth International Conference, IberSPEECH 2018. Barcelona, Spain. November 2018.