← Back to home

Publications

Complete List

Explore journal papers, conferences, preprints, and books grouped by year.

Publications

Curated selection of peer-reviewed articles, conference papers, and preprints.

Journal Papers

  • Estevez, M.; Bonomi, C.; Ribas, D.; Ortega, A.; Ferrer, L.

    Beyond Global Metrics: A Fairness Analysis for Interpretable Voice Disorder Detection Systems

    JOURNAL OF VOICE, 2025

Conferences

  • Almudévar, A.; Hernández-Lobato, J. M.; Khurana, S.; Marxer, R.; Ortega, A.

    Aligning Multimodal Representations through an Information Bottleneck

    International Conference on Machine Learning (pp. 1250-1270). PMLR

Preprints

  • Mingote, V.; Ortega, A.; Miguel, A.; Lleida, E.

    Audio-visual speaker diarization: Current databases, approaches and challenges

    arXiv preprint arXiv:2409.05659

Journal Papers

  • Vidal, J.; Ribas, D.; Bonomi, C.; Lleida, E.; Ferrer, L.; Ortega, A.

    Automatic voice disorder detection from a practical perspective

    JOURNAL OF VOICE, 2024

Conferences

  • Lebourdais, M.; Gimeno, P.; Mariotte, T.; Tahon, M.; Ortega, A.; Larcher, A.

    3MAS: a multitask, multilabel, multidataset semi-supervised audio segmentation model

    Proc. odyssey 2024 (pp. 232-239)

  • Gimeno, P.; Ortega, A.

    Advances in Binary and Multiclass Audio Segmentation with Deep Learning Techniques: A PhD Thesis Overview

    Proc. IberSPEECH 2024 (pp. 237-241)

  • Mariotte, T.; Almudévar, A.; Tahon, M.; Ortega, A.

    An explainable proxy model for Multilabel audio segmentation

    ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 531-535). IEEE

  • Miguel A Pastor, Alfonso Ortega, Dayana Ribas

    Analysis of the domain mismatch problem in the Speech Emotion Recognition Task

    Proc. IberSPEECH 2024

  • Rubio Felipo, S.; Ribas González, D.; Lleida Solano, E.; Ortega Giménez, A.; Artiaga, A. M.

    Assessing the Impact and Potential of TTS for Pathological Voice Data Augmentation on Pathology Detection Systems

    Proc. IberSPEECH 2024 (pp. 41-45)

  • Mingote, V.; Ortega, A.; Miguel, A.; Lleida, E.

    Encouraging Internal Representations with Speaker Information in End-to-End Neural Diarization by Adding Speaker Loss

    Proc. IberSPEECH 2024 (pp. 191-195)

  • Lebourdais, M.; Mariotte, T.; Almudévar, A.; Tahon, M.; Ortega, A.

    Explainable by-design Audio Segmentation through Non-Negative Matrix Factorization and Probing.

    In Proc. Interspeech 2024

  • Almudévar, A.; Mariotte, T.; Ortega, A.; Tahon, M.; Vicente, L.; Miguel, A.; Lleida, E.

    Predefined Prototypes for Intra-Class Separation and Disentanglement

    Proc. Interspeech 2024 (pp. 3809-3813)

  • María García Cutando, Eduardo Lleida Solano, Virginia Bazán Gil, Alfonso Ortega Giménez, Antonio Miguel Artiaga

    Semantic Information Retrieval through Autonomous Agents

    Proc. IberSPEECH 2024

  • Pastor, M. Á.; Ortega, A.; Miguel, A.; Ribas, D.

    The ViVoLab System for the Odyssey Emotion Recognition Challenge 2024 Evaluation

    Proc. odyssey 2024 (pp. 274-280)

  • Almudévar, A.; Mariotte, T.; Ortega, A.; Tahon, M.

    Unsupervised multiple domain translation through controlled disentanglement in variational autoencoder

    ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 7010-7014). IEEE

Journal Papers

  • Lleida, E.; Rodriguez-Fuentes, L. J.; Tejedor, J.; Ortega, A.; Miguel, A.; Bazán, V.; Arzelus, H.

    An Overview of the IberSpeech-RTVE 2022 Challenges on Speech Technologies

    Applied Sciences, 13(15), 8577

  • Ribas, D.; Pastor, M. A.; Miguel, A.; Martínez, D.; Ortega, A.; Lleida, E.

    Automatic voice disorder detection using self-supervised representations

    Ieee Access, 11, 14915-14927

  • Mingote, V.; Miguel, A.; Ortega, A.; Lleida, E

    Class token and knowledge distillation for multi-head self-attention speaker verification systems

    Digital Signal Processing, vol. 133, 2023

  • Pastor, M. A.; Ribas, D.; Ortega, A.; Miguel, A.; Lleida, E.

    Cross-corpus training strategy for speech emotion recognition using self-supervised representations

    Applied Sciences, 13(16), 9062

  • Barrio, R.; Lozano, Á.; Mayora-Cebollero, A.; Mayora-Cebollero, C.; Miguel, A.; Ortega, A.; Vigara, R.

    Deep learning for chaos detection

    Chaos: An Interdisciplinary Journal of Nonlinear Science, 33(7)

Conferences

  • López-Espejo, I.; Prieto, S.; Ortega, A.; Lleida, E.

    Improved Vocal Effort Transfer Vector Estimation For Vocal Effort-Robust Speaker Verification

    2023 IEEE 33rd International Workshop on Machine Learning for Signal Processing (MLSP) (pp. 1-6). IEEE

  • Ribas D., Miguel A.

    On the Problem of Data Availability in Automatic Voice Disorder Detection

    Proceedings of the 16th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2023) - Volume 5: HEALTHINF; ISBN 978-989-758-631-6, SciTePress, pages 330-337. DOI: 10.5220/0011669300003414

  • Almudévar, A.; Ortega, A.; Vicente, L.; Miguel, A.; Lleida, E.

    Variational Classifier for Unsupervised Anomalous Sound Detection under Domain Generalization

    Proc. Interspeech 2023 (pp. 2823-2827)

Journal Papers

  • Mingote, V.; Miguel, A.; Ribas, D.; Ortega, A.; Lleida, E

    aDCF Loss Function for Deep Metric Learning in End-to-End Text-Dependent Speaker Verification Systems

    IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 30, pp. 772-784, 2022

  • Mingote, V.; Viñals, I.; Gimeno, P.; Miguel, A.; Ortega, A.; Lleida, E

    Multimodal Diarization Systems by Training Enrollment Models as Identity Representations

    Applied Sciences, vol. 12, no. 3, pp. 1141, 2022

  • Prieto, S.; Ortega, A.; López-Espejo, I.; Lleida, E

    Shouted and whispered speech compensation for speaker verification systems

    Digital Signal Processing, vol. 127, pp. 103536, 2022

  • Gimeno, P.; Ribas, D.; Ortega, A.; Miguel, A.; Lleida, E

    Unsupervised Adaptation of Deep Speech Activity Detection Models to Unseen Domains

    Applied Sciences, vol. 12, no. 4, pp. 1832, 2022

  • Almudévar, A.; Sevillano, P.; Vicente, L.; Preciado-Garbayo, J.; Ortega, A

    Unsupervised Anomaly Detection Applied to Φ-OTDR

    Sensors, vol. 22, no. 17, pp. 6515, 2022

  • Ribas, D.; Miguel, A.; Ortega, A.; Lleida, E

    Wiener Filter and Deep Neural Networks: A Well-Balanced Pair for Speech Enhancement

    Applied Sciences, vol. 12, no. 18, pp. 9000, 2022

Conferences

  • Gimeno, P.; Ortega, A.; Miguel, A.; Lleida, E

    A Study on the Use of wav2vec Representations for Multiclass Audio Segmentation

    Iberspeech 2022. Granada, Spain. Novemberr 2021

  • Pastor, M.; Ribas, D.; Ortega, A.; Miguel, A.; Lleida, E

    Cross-Corpus Speech Emotion Recognition with HuBERT Self-Supervised Representation

    Iberspeech 2022. Granada, Spain. Novemberr 2021

  • Ribas, D.; Pastor, M.; Miguel, A.; Martínez, D.; Ortega, A.; Lleida, E

    S3prl-Disorder: Open-Source Voice Disorder Detection System based in the Framework of S3PRL-toolkit

    Iberspeech 2022. Granada, Spain. Novemberr 2021

  • Miguel, A.; Ortega, A.; Lleida, E

    ViVoLAB System Description for the S2TC IberSPEECH-RTVE 2022 challenge

    Iberspeech 2022. Granada, Spain. Novemberr 2021

Journal Papers

  • Gimeno, P.; Ortega, A.; Miguel, A.; Lleida, E

    Generalising AUC Optimisation to Multiclass Classification for Audio Segmentation with Limited Training Data

    IEEE Signal Processing Letters, 28 , pp. 1135-1139, 2021

  • Llombart, J.; Ribas, D.; Miguel, A.; Vicente, L.; Ortega, A.; Lleida, E

    Progressive Loss Functions for Speech Enhancement with Deep Neural Networks

    EURASIP Journal on Audio, Speech, and Music Processing, 2021 (1), pp. 1-16, 2021

  • Viñals, I.; Ortega, A.; Miguel, A.; Lleida, E

    The Domain Mismatch Problem in the Broadcast Speaker Attribution Task

    Applied Sciences, vol. 11, no. 18, p. 8521, Sept. 2021

Conferences

  • Mingote, V.; Miguel, A.; Ortega, A.; Lleida, E

    Memory Layers with Multi-Head Attention Mechanisms for Text-Dependent Speaker Verification

    Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021-June , 2021

  • Gimeno, P.; Ribas, D.; Ortega, A.; Miguel, A.; Lleida, E

    Convolutional Recurrent Neural Networks for Speech Activity Detection in Naturalistic Audio from Apollo Missions

    Fifth International Conference, IberSPEECH 2020. Valladolid, Spain. March 2021

  • Viñals, I.; Gimeno, P.; Ortega, A.; Miguel, A.; Lleida, E

    Diarization and Identity Attribution Compatibility in the Albayzin 2020 Challenge

    Fifth International Conference, IberSPEECH 2020. Valladolid, Spain. March 2021

  • Mingote, V.; Miguel, A.; Ortega, A.; Lleida, E

    Log-Likelihood-Ratio Cost Function as Objective Loss for Speaker Verification Systems

    22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021. Brno, Czech Republic. September 2021

  • Gimeno, P; Ortega, A.; Miguel, A.; Lleida, E

    Unsupervised Representation Learning for Speech Activity Detection in the Fearless Steps Challenge 2021

    22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021. Brno, Czech Republic. September 2021

  • Mingote, V.; Viñals, I.; Gimeno, P.; Miguel, A.; Ortega, A.; Lleida, E

    ViVoLAB Multimodal Diarization System for RTVE 2020 Challenge

    Fifth International Conference, IberSPEECH 2020. Valladolid, Spain. March 2021

Journal Papers

  • P. Gimeno, I. Viñals, A. Ortega, A. Miguel, E. Lleida

    Multiclass audio segmentation based on recurrent neural networks for broadcast domain data

    EURASIP Journal on Audio, Speech, and Music Processing, 5, March. 2020

  • V. Mingote, A. Miguel, A. Ortega, E. Lleida

    Optimization of the area under the ROC curve using neural network supervectors for text-dependent speaker verification

    Computer, Speech & Language , vol. 63, Sept. 2020

Conferences

  • V. Mingote, A. Miguel, D. Ribas, A. Ortega, E. Lleida

    Knowledge Distillation and Random Erasing Data Augmentation for Text-Dependent Speaker Verification

    ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2020, pp. 6824-6828

  • P. Gimeno, V. Mingote, A. Ortega, A. Miguel, E. Lleida

    Partial AUC Optimisation using Recurrent Neural Networks for Music Detection with Limited Training Data

    21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020. Shanghai, China. October 2020

  • S. Prieto, A. Ortega, I. López-Espejo, E. Lleida

    Shouted Speech Compensation for Speaker Verification Robust to Vocal Effort Conditions

    21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020. Shanghai, China. October 2020

  • V. Mingote, A. Miguel, A. Ortega, E. Lleida

    Training Speaker Enrollment Models by Network Optimization

    21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020. Shanghai, China. October 2020

Journal Papers

  • E. Lleida, A. Ortega, A. Miguel, V. Bazán-Gil, C. Pérez, M. Gómez, A. de Prada

    Albayzin 2018 Evaluation: The IberSpeech-RTVE Challenge on Speech Technologies for Spanish Broadcast Media

    Applied Sciences, vol. 9, no. 24, p. 5412, Dec. 2019

  • I. Viñals, A. Ortega, A. Miguel, E. Lleida

    An Analysis of the Short Utterance Problem for Speaker Characterization

    Applied Sciences, vol. 9, no. 18, p. 3697, Sep. 2019

  • V. Mingote, A. Miguel, A. Ortega, E. Lleida

    Supervector Extraction for Encoding Speaker and Phrase Information with Neural Networks for Text-Dependent Speaker Verification

    Applied Sciences, vol. 9, no. 16, p. 3295, Aug. 2019

  • I. Viñals, A. Ortega, Jesus Villalba, A. Miguel, E. Lleida

    Unsupervised adaptation of PLDA models for broadcast diarization

    EURASIP Journal on Audio, Speech, and Music Processing, 24, Dec. 2019

Conferences

  • V. Mingote, D. Castan, M. McLaren, M. Kumar Nandwana, A. Ortega, E. Lleida, Antonio Miguel

    Language Recognition using Triplet Neural Networks

    20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. Graz, Austria. September 2019

  • V. Mingote, A. Miguel, D. Ribas, A. Ortega, E. Lleida

    Optimization of False Acceptance/Rejection Rates and Decision Threshold for End-to-End Text-Dependent Speaker Verification Systems

    20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. Graz, Austria. September 2019

  • I. Viñals, D. Ribas, V. Mingote, J. Llombart, P. Gimeno, A. Miguel, A. Ortega, E. Lleida

    Phonetically-aware embeddings, Wide Residual Networks with Time-Delay Neural Networks and Self Attention models for the 2018 NIST Speaker Recognition Evaluation

    20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. Graz, Austria. September 2019

  • J. Llombart, D. Ribas Gonzalez, A. Miguel, L. Vicente, A. Ortega, E. Lleida

    Progressive Speech Enhancement with Residual Connections

    20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. Graz, Austria. September 2019

  • J. Llombart, D. Ribas Gonzalez, A. Miguel, L. Vicente, A. Ortega, E. Lleida

    Speech Enhancement with Wide Residual Networks in Reverberant Environments

    20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. Graz, Austria. September 2019

  • I. Viñals, P. Gimeno, A. Ortega, A. Miguel, E. Lleida

    ViVoLAB Speaker Diarization System for the DIHARD 2019 Challenge

    20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. Graz, Austria. September 2019

Conferences

  • P. Gimeno, I. Viñals, A. Ortega, A. Miguel, E. Lleida

    A Recurrent Neural Network Approach to Audio Segmentation for Broadcast Domain Data

    Fourth International Conference, IberSPEECH 2018. Barcelona, Spain. November 2018

  • V. Mingote, A. Miguel, A. Ortega, E. Lleida

    Differentiable Supervector Extraction for Encoding Speaker and Phrase Information in Text Dependent Speaker

    Fourth International Conference, IberSPEECH 2018. Barcelona, Spain. November 2018

  • I. Viñals, P. Gimeno, A. Ortega, A. Miguel, E. Lleida

    Estimation of the Number of Speakers with Variational Bayesian PLDA in the DIHARD Diarization Challenge

    19th Annual Conference of the International Speech Communication Association, INTERSPEECH 2018. Hyderabad, India. September 2018

  • I. Viñals, P. Gimeno, A. Ortega, A. Miguel, E. Lleida

    In-domain Adaptation Solutions for the RTVE 2018 Diarization Challenge

    Fourth International Conference, IberSPEECH 2018. Barcelona, Spain. November 2018

  • I. Viñals, A. Ortega, A. Miguel, E. Lleida

    Phonetic Variability Influence on Short Utterances in Speaker Verification

    Fourth International Conference, IberSPEECH 2018. Barcelona, Spain. November 2018

  • J. Llombart, A. Miguel, A. Ortega, E. Lleida

    Wide Residual Networks 1D for Automatic Text Punctuation

    Fourth International Conference, IberSPEECH 2018. Barcelona, Spain. November 2018

Conferences

  • I. Viñals, J. Villalba, A. Ortega, A. Miguel, E. Lleida

    Domain Adaptation of PLDA models in Broadcast Diarization by means of Unsupervised Speaker Clustering

    18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017. Stockholm, Sweden. August 2017

  • A. Miguel, J. Llombart, E. Lleida, A. Ortega

    Tied Hidden Factors in Neural Networks for End-to-End Speaker Recognition

    18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017. Stockholm, Sweden. August 2017

Journal Papers

  • J. Villalba, A. Ortega, A. Miguel, E. Lleida

    Analysis of Speech Quality Measures for the Task of Estimating the Reliability of Speaker Verification Decisions

    Speech Communication, Elsevier. April 2016

  • J. Villalba, A. Miguel, A. Ortega, E. Lleida

    Bayesian Networks to Model the Variability of Speaker Verification Scores in Adverse Environments

    IEEE/ACM Transactions on Audio, Speech, and Language Processing. December 2016

Conferences

  • J. Olcoz, P. Gimeno, A. Ortega, A. Arguedas, A. Miguel, E. Lleida

    Automatic Text-to-Audio Alignment of Multimedia Broadcast Content

    Third International Conference, IberSPEECH 2016. Lisbon, Portugal. November 2016

  • I. Viñals, J. Villalba, A. Ortega, A. Miguel, E. Lleida

    Bottleneck Based Front-End for Diarization Systems

    Third International Conference, IberSPEECH 2016. Lisbon, Portugal. November 2016

  • J. Llombart, A. Miguel, E. Lleida, A. Ortega

    Character Sequence to Sequence Applications: Subtitle Segmentation and Part-of-Speech Tagging

    Third International Conference, IberSPEECH 2016. Lisbon, Portugal. November 2016

  • J. Olcoz, J. Llombart, A. Miguel, A. Ortega, E. Lleida

    The ViVoLab-I3A-UZ System for Albayzin 2016 Search-on-Speech Evaluation

    Third International Conference, IberSPEECH 2016. Lisbon, Portugal. November 2016

Journal Papers

  • D. Castan, D. Tavarez, P. Lopez-Otero, J. Franco-Pedroso, H. Delgado, E. Navas, L. Docio-Fernández, D. Ramos, J. Serrano, A. Ortega, E. Lleida

    Albayzín-2014 evaluation: audio segmentation and classification in broadcast news domains

    EURASIP Journal on Audio, Speech, and Music Processing. December 2015

  • D. Martinez, E. Lleida, P. Green, H. Christensen, A. Ortega, A. Miguel

    Intelligibility Assessment and Speech Recognizer Word Accuracy Rate Prediction for Dysarthric Speakers in a Factor Analysis Subspace

    ACM Transactions on Accessible Computing 6(3):1-21 June 2015

Conferences

  • J. Villalba, A. Miguel, A. Ortega, E. Lleida

    Spoofing Detection with DNN and One-class SVM for the ASVspoof 2015 Challenge

    16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015. Dresden, Germany. September 2015

  • J. Villalba, A. Miguel, A. Ortega, E. Lleida

    Variational Bayesian PLDA for Speaker Diarization in the MGB Challenge

    2015 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2015). Scottsdale, Arizona, USA. December 2015

Journal Papers

  • D. Castan, A. Ortega, A. Miguel, E. Lleida

    Audio Segmentation-by-Classification Approach Based on Factor Analysis in Broadcast News Domain

    EURASIP Journal on Audio, Speech, and Music Processing. Vol. 34. August 2014

  • J. E. García, A. Ortega, A. Miguel, E. Lleida

    Low Bit Rate Compression Methods of Feature Vectors for Distributed Speech Recognition

    Speech Communication, Elsevier. Vol. 58. pp. 111-123. March 2014

Conferences

  • D. Castán, A. Ortega, A. Miguel, E. Lleida

    A preliminary study of Acoustic Events Classification with Factor Analysis in Meeting Rooms

    Iberspeech 2014. Las Palmas de Gran Canaria (Spain) November 2014

  • A. Miguel, J. Olcoz, J. Villalba, A. Ortega, E. Lleida

    Albayzin 2014 Search on Speech @ ViVolab UZ

    Iberspeech 2014. Las Palmas de Gran Canaria (Spain) November 2014

  • J. Olcoz, A. Ortega, A. Miguel, E. Lleida

    Confidence Measures in Automatic Speech Recognition for Error Detection in Restricted Domains

    Iberspeech 2014. Las Palmas de Gran Canaria (Spain) November 2014

  • A. Miguel, J. Villalba, A. Ortega, E. Lleida, C. Vaquero

    Factor Analysis with Sampling Methods for Text Dependent Speaker Recognition

    15th Annual Conference of the International Speech Communication Association, INTERSPEECH 2014. Singapore September 2014

  • A. Arguedas, E. Lleida, A. Ortega, A. Miguel, J. E. García

    Subtitling Tools Based On Automatic Speech Recognition

    Iberspeech 2014. Las Palmas de Gran Canaria (Spain) November 2014

  • D. Martínez, J. Villalba, E. Lleida, A. Ortega

    Unsupervised Accent Modeling for Language Identification

    Iberspeech 2014. Las Palmas de Gran Canaria (Spain) November 2014

Journal Papers

  • C. Vaquero, A. Ortega, A. Miguel, E. Lleida

    Quality Assessment for Speaker Diarization and its Application in Speaker Characterization

    IEEE Transactions on Audio, Speech, and Language Processing. Vol. 21. pp. 816-827. April 2013

Conferences

  • J. Villalba, E. Lleida, A. Ortega, A. Miguel

    A New Bayesian Network to Assess the Reliability of Speaker Verification Decisions

    14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013. Lyon (France) August 2013

  • D. Castán, A. Ortega, A. Miguel, E. Lleida

    Broadcast News Segmentation with Factor Analysis System

    SLAM 2013 Speech, Language and Audio in Multimedia. Marseille (France) August 2013

  • D. Martinez, E. Lleida, A. Ortega, A. Miguel

    Prosodic features and formant modeling for an ivector-based language recognition system

    IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Vancouver (Canada) May 2013

  • D. Castán, A. Ortega, J. Villalba, A. Miguel, E. Lleida

    Segmentation-by-classification system based on factor analysis

    IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Vancouver (Canada) May 2013

  • D. Martínez, D. Ribas, E. Lleida, A. Ortega, A. Miguel

    Suprasegmental Information Modelling for Autism Disorder Spectrum and Specific Language Impairment Classification

    14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013. Lyon (France) August 2013

  • J. Villalba, E. Lleida, A. Ortega, A. Miguel

    The I3A Speaker Recognition System for NIST SRE12: Post-evaluation Analysis

    14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013. Lyon (France) August 2013

Conferences

  • A. Ortega, C. Vaquero, A. Miguel, E. Lleida

    Diarization for Speaker Characterization

    VI Jornadas de Reconocimiento Biométrico de Personas JRBP 2012. Las Palmas de Gran Canaria (Spain) February 2012

  • D. Ribas, J. E. Garcia, A. Miguel, A. Ortega, E. Lleida, J. R. Calvo

    Evaluation of a New Beam-Search Formant Tracking Algorithm in Noisy Environments

    Iberspeech 2012. Madrid (Spain) November 2012

  • D. Castán, A. Ortega, E. Lleida

    Factor Analysis Segmentation and Classification in Broadcast News Domain

    Iberspeech 2012. Madrid (Spain) November 2012

  • J. Villalba, E. Lleida, A. Ortega, A. Miguel

    Reliability Estimation of the Speaker Verification Decisions Using Bayesian Networks to Combine Information from Multiple Speech Quality Measures

    Iberspeech 2012. Madrid (Spain) November 2012

  • D. Martínez, E. Lleida, A. Ortega, A. Miguel, J. Villalba

    Score Level versus Audio Level Fusion for Voice Pathology Detection on the Saarbrücken Voice Database

    Iberspeech 2012. Madrid (Spain) November 2012

  • L. J. Rodriguez, M. Penagarikano, A. Varona, M. Diez, G. Bordel, A. Abad, D. Martinez, J. Villalba, A. Ortega, E. Lleida

    The BLZ Submission to the NIST 2011 LRE: Data Collection, System Development and Performance

    13th Annual Conference of the International Speech Communication Association. Interspeech 2012. Portland (Oregon, USA) September 2012

  • D. Martínez, E. Lleida, A. Ortega, A. Miguel, J. Villalba

    Voice Pathology Detection on the Saarbrücken Voice Database with Calibration and Fusion of Scores Using Mul-tiFocal Toolkit

    Iberspeech 2012. Madrid (Spain) November 2012

Journal Papers

  • A. Miguel , A. Ortega, L. Buera, E. Lleida

    Bayesian Networks for Discrete Observation Distributions in Speech Recognition

    IEEE Transactions on Audio, Speech, and Language Processing. Vol. 19. No. 6. pp. 1476-1489. August 2011

Conferences

  • D. Castán, Carlos Vaquero, A. Ortega, D. Martínez, J. Villalba, E. Lleida

    Hierarchical Auido Segmentation with HMM and Factor Analysis in Broadcast News Domain

    Interspeech 2011. Florence (Italy) August 2011

  • D. Martínez, J. Villalba, A. Miguel, A. Ortega, E. Lleida

    I3A Language Recognition System for Albayzin 2010 LRE

    Interspeech 2011, Florence (Italy). August 2011

  • C. Vaquero, A. Ortega, E. Lleida

    Intra-session variability compensation and hypothesis generation and selection strategy for speaker segmentation

    International Conference on Acoustics, Speech and Signal Processing ICASSP 2011. Prague (CZech Republic). May 2011

  • C. Vaquero, A. Ortega, E. Lleida

    Partitioning of Two-Speaker Conversation Datasets

    Interspeech 2011. Florence (Italy) August 2011

Journal Papers

  • L. Buera, A. Miguel, O. Saz, A. Ortega, E. Lleida

    Unsupervised Data-Driven Feature Vector Normalization With Acoustic Model Adaptation for Robust Speech Recognition

    IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 18, NO. 2, Februrary 2010. pp. 296-309

Conferences

  • O. Saz, E. Lleida, J. E. Garcia, A. Ortega

    A Prototype of Distributed Speech Technologies for the Development of Websites Accessible to the Blind Community

    FALA 2010 "VI Jornadas en Tecnología del Habla" and II Iberian SLTech Workshop. November. 2010

  • C. Vaquero, A. Ortega, J. A. Villalba, A. Miguel, E. Lleida

    Confidence Measures for Speaker Segmentation and their Relation to Speaker Verification

    Interspeech 2010, Makuhari (Japan). September 2010

  • C. Vaquero, A. Ortega, E. Lleida

    Intra-session variability compensation for speaker segmentation

    FALA 2010 "VI Jornadas en Tecnología del Habla" and II Iberian SLTech Workshop. November 2010

  • J. E. García, A. Ortega, A. Miguel, E. Lleida

    Non-Linear Predictive Vector Quantization of Feature Vectors for Distributed Speech Recognition

    Interspeech 2010, Makuhari (Japan). September 2010

  • J. E. García, A. Ortega, A. Miguel, E. Lleida. "

    Predictive vector quantization using the M-algorithm for distributed speech recognition

    FALA 2010 "VI Jornadas en Tecnología del Habla" and II Iberian SLTech Workshop. November 2010

  • D. Becerril, Oscar Saz, C. Vaquero, A. Ortega, E. Lleida

    Speaker Tree Generation for Model Selection in Automatic Speech Recognition

    FALA 2010 "VI Jornadas en Tecnología del Habla" and II Iberian SLTech Workshop. November 2010

  • D. Castán, A. Ortega, E. Lleida

    Speech/Music classification by using the C4.5 decision tree algorithm

    FALA 2010 "VI Jornadas en Tecnología del Habla" and II Iberian SLTech Workshop. November 2010

Conferences

  • J.E. García, A. Ortega, E. Lleida, T. Lozano, E. Bernués, D. Sánchez

    Audio and Text Synchronization for TV news Subtitling based on Automatic Speech Recognition

    IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB). May. 2009

  • J.E. García, A. Ortega, A. Miguel, E. Lleida

    Differential Vector Quantization of Feature Vectors for Distributed Speech Recognition

    10th Annual Conference of the International Speech Communication Association. INTERSPEECH 2009. September. 2009

  • A. Miguel, A. Ortega, L. Buera, E. Lleida

    Graphical Models for Discrete Hidden Markov Models in Speech Recognition

    10th Annual Conference of the International Speech Communication Association. INTERSPEECH 2009. September. 2009

  • A. Miguel, A. Ortega, Luis Buera, E. Lleida

    Local Projections and Support Vector Based Feature Selection in Speech Recognition

    10th Annual Conference of the International Speech Communication Association. INTERSPEECH 2009. September. 2009

  • A. Ortega, J.E. García, A. Miguel, E. Lleida

    Real-Time Live Broadcast News Subtitling System for Spanish

    10th Annual Conference of the International Speech Communication Association. INTERSPEECH 2009. September. 2009

  • A. Ortega, J.E. García, E. Lleida, E. Bernués, D. Sánchez, M. Ferrer

    Subtitulado en Tiempo Real de Informativos en Directo para la Televisión Mediante Reconocimiento Automático del Habla

    IV Congreso de Accesibilidad a los Medios Audiovisuales para Personas con Discapacidad. AMADIS 09. June 2009

  • L. Buera,, A. Miguel, A. Ortega, E. Lleida, R. M. Stern

    Unsupervised Training Scheme with Non-Stereo Data for Empirical Feature Vector Compensation

    10th Annual Conference of the International Speech Communication Association. INTERSPEECH 2009. September. 2009

Book Chapters

  • L. Buera, A. Miguel, E. Lleida, A. Ortega, O. Saz

    Cross-Probability Model Based on GMM for Feature Vector Normalization

    Chapter 14 in "In-Vehicle Corpus and Signal Processing for Driver Behavior" H. Abut, J.H.L. Hansen, Hakan Erdogan and K. Takeda (Eds.), Springer Science, New York, NY. 2009

Journal Papers

  • A. Miguel, E. Lleida, R. Rose, L. Buera, O. Saz, A. Ortega

    Capturing local variability for speaker normalization in speech recognition

    IEEE Trans on Audio, Speech and Language Processing, Vol 16, No 3, pp 578-593 . March 2008

Conferences

  • J.E. García, A. Ortega, A. Miguel, E. Lleida

    Arquitectura distrubuida para el desarrollo de sistemas de diálogo hablado, edecán

    V jornadas en tecnología del habla. November 2008

  • J. E. García, A. Ortega, A. Miguel, E. Lleida

    Cuantificación vectorial diferencial para la transmisión eficiente de parámetros acústicos en sistemas de reconocimiento automático del habla distribuido

    V Jornadas en tecnología del habla. November 2008

  • J. A. Villalba, C. Vaquero, E. Lleida, A. Ortega, A. Miguel, J. E. García, L. Buera, O. Saz

    Experiencia del I3A en la Evaluación de Reconocimiento de Locutor NIST 2008

    Jornadas de Reconocimiento Biométrico de Personas, Valladolid, España. September 2008

  • L. Buera, A. Miguel, O. Saz, A. Ortega, E. Lleida

    Feature Vector Normalization with Combined Standard and Throat Microphones for Robust ASR

    Interspeech 2008. September 2008

  • A. Miguel, E. Lleida, A. Ortega

    Generalized gaussians for continuous observation distributions in speech recognition

    V Jornadas en tecnología del habla. November 2008

  • A. Miguel, E. Lleida, A. Ortega

    Graphical models for discrete observation distributions in speech recognition

    V jornadas en tecnología del habla. November 2008

  • J.E. García, A. Ortega, A. Miguel, E. Lleida

    Sistema de reconocimiento automático del habla distribuido aplicado a entornos logísticos

    V jornadas en tecnología del habla. November 2008

Journal Papers

  • L. Buera, E. Lleida, A. Miguel, A. Ortega, O. Saz

    Cepstral Vector Normalization Based on Stereo Data for Robust Speech Recognition

    IEEE Trans. On Audio Speech and Language Processing, vol.15, pp.1098-1113. March 2007

Conferences

  • L. Buera, A. Miguel, E. Lleida, A. Ortega, O. Saz

    Cross-Probability Model based on GMM for Feature Vector Normalization in Car Environments

    Biennial on DSP for in-Vehicle and Mobile Systems, Istanbul, Turkey. June. 2007

  • P. García, A. Hernández, J. P. Martínez, I. Martinez, E. Mayordomo, A. Ortega, I. Salinas, J. R. Solera, L. Vicente

    Distribución de la Carga Discente: Estudio sobre las titulaciones del Centro Politécnico Superior de la Universidad de Zaragoza.

    II Jornadas de Innovación Educativa de la Escuela Politécnica Superior de Zamora. June 2007

  • L. Buera, A. Miguel, O. Saz, E. Lleida, A. Ortega

    Evaluation of the Combined Use of MEMLIN and MLLR on the Non-native Adaptation Task of Hiwire Project Database

    Interspeech, August. 2007

  • L. Buera, A. Miguel, E. Lleida, O. Saz, A. Ortega

    On the Jointly Unsupervised Feature Vector Normalization and Acoustic Model Compensation for Robust Speech Recognition

    Interspeech, August. 2007

  • A. Miguel, L. Buera, E. Lleida, A. Ortega, O. Saz

    On-Line Feature and Acoustic Model Space Compensation for Robust Speech Recognition in Car Environment

    IEEE Intelligent Vehicles Symposium. June. 2007

  • L. Buera, A. Miguel, E. Lleida, O. Saz, A. Ortega

    Robust Speech Recognition with on-line Unsupervised Acoustic Feature

    IEEE Automatic Speech Recognition and Understanding Workshop, ASRU, December 2007

Conferences

  • A. Uría, A. Ortega, M. I. Torres, A. Miguel, V. Guijarrubia, L. Buera, J. Garmendia, E. Lleida, O. Aizpuru, O. Saz

    A virtual butler controlled by speech

    IV Jornadas en Tecnología del Habla, Zaragoza, Spain. November 2006

  • J. P. Martínez, A. Ortega, A. Hernández, I. Salinas, P. García, L. Vicente, I. Martinez, J. Fernández

    Estudio de los perfiles y competencias profesionales en la titulación de Ingeniería de Telecomunicación

    IV Congreso Internacional de Docencia Universitaria e Innovación (CIDUI), pp.535 (ISBN: 84-8458-244-4), Barcelona (Spain). July 2006

  • P. García, J. P. Martínez, E. Mayordomo, A. Ortega , I. Salinas, J. R. Solera, L. Vicente

    Estudio sobre la carga de trabajo del estudiante en las titulaciones del Centro Politécnico Superior

    I Jornadas de Innovación Docente, Tecnologías de la Información y la Comunicación e Investigación Educativa en la Universidad de Zaragoza. November 2006

  • J. P. Martínez, A. Ortega, A. Hernández, I. Salinas, P. García, L. Vicente, I. Martinez, J. Fernández

    Evaluación de la carga discente de la titulación de Ingeniería de Telecomunicación: asignación de créditos ECTS

    IV Congreso Internacional de Docencia Universitaria e Innovación (CIDUI), pp. 288 (ISBN: 84-8458-244-4), Barcelona (Spain). July 2006

  • A. Miguel, E. Lleida, A. Juan, L. Buera, A. Ortega, O. Saz

    Local Transformation Models for Speech Recognition

    in Interspeech - International Conference on Spoken Language Processing, ICSLP. Pittsburgh, USA, Sept 2006, pp. 1598–1601. September 2006

  • A. Ortega, E. Lleida, E. J. Masgrau, L. Buera, A. Miguel

    Stability Control in a Two-Channel Speech Reinforcement System for Vehicles

    International Conference on Acoustics, Speech and Signal Processing (ICASSP 2006). May 2006

  • O. Saz, A. Miguel, E. Lleida, A. Ortega, L. Buera

    Study of Time and Frequency Variability in Pathological Speech and Error Reduction Methods for Automatic Speech Recognition

    International Conference on Spoken Language Processing, (ICSLP 2006). September 2006

  • L. Buera, E. Lleida, A. Miguel, A. Ortega, O. Saz

    Time-dependent Cross-Probability Model for Feature Vector Normalization

    IV Jornadas en Tecnología del Habla, Zaragoza, Spain. November 2006

  • L. Buera, E. Lleida, J. A. Nolazco-Flores, A. Miguel, A. Ortega

    Time-dependent cross-probability model for Multi-Environment Model based LInear Normalization

    International Conference on Spoken Language Processing, (ICSLP 2006). September 2006

  • L. Buera, E. Lleida, J. D. Rosas, J. Villalba, A. Miguel , A. Ortega , O. Saz

    Verificación e Identificación de Locutor con Normalización de Vectores de Características en Entornos Acústicos Adversos

    Terceras Jornadas de Reconocimiento Biométrico de Personas, Sevilla, Spain. November 2006

Book Chapters

  • A. Ortega, E. Lleida, E. Masgrau, L. Buera, A. Miguel

    Acoustic Echo Reduction in a Two-Channel Speech Reinforcement System for Vehicles

    Chapter 15 in "Digital Signal Processing for In-Vehicle and Mobile Systems 2", H. Abut, J.H.L. Hansen and K. Takeda (Eds.), Springer Science, New York, NY. May 2006

Journal Papers

  • P. García, J. de Mingo, A. Valdovinos, A. Ortega

    An Adaptive digital method of imbalances cancellation in LINC transmitters

    IEEE Transactions on Vehicular Technology, vol. 54, no. 3, pp. 879-888. May 2005

  • P. García, A. Ortega, J. de Mingo, A. Valdovinos

    Nonlinear Distortion Cancellation using LINC Transmitters in OFDM Systems

    IEEE Transactions on Broadcasting, vol. 51, no. 1, pp. 84-93. March 2005

  • A. Ortega, E. Lleida, E. Masgrau

    Speech reinforcement system for car cabin communications

    IEEE Transactions on Speech and Audio Processing. vol. 13 no. 5. pp. 917-929. September 2005

Conferences

  • A. Ortega, E. Lleida, E. Masgrau, L. Buera, A. Miguel

    Acoustic Echo Reduction in a Two-Channel Speech Reinforcement System for Vehicles

    Biennial on DSP for in-Vehicle and Mobile Systems Sesimbra, Portugal, September 2-3. Septiembre 2005

  • A. Ortega, E. Lleida, E. Masgrau, L. Buera, A. Miguel

    Acoustic Feedback Cancellation in Speech Reinforcement System for Vehicles

    Interspeech'2005 - Eurospeech — 9th European Conference on Speech Communication and Technology. September 2005

  • A. Miguel, E. Lleida, R. Rose, L. Buera, A. Ortega

    Augmented State Space Acoustic Decoding for Modeling Local Variability in Speech

    Interspeech'2005 - Eurospeech — 9th European Conference on Speech Communication and Technology. September 2005

  • L. Buera, E. Lleida, A. Miguel, A. Ortega

    Multi-Environment Linear Normalization for robust speech analysis in cars

    Biennial on DSP for in-Vehicle and Mobile Systems Sesimbra, Portugal, September 2005

  • L. Buera, E. Lleida, A. Miguel, A. Ortega

    Recent Advances in PD-MEMLIN for Speech Recognition in Car Conditions

    IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2005. San Juan, Puerto Rico. November 2005

  • L. Buera, E. Lleida, A. Miguel, A. Ortega

    Robust Speech Recognition in Cars using Phoneme Dependent Multi-Environment Linear Normalization

    Interspeech'2005 - Eurospeech — 9th European Conference on Speech Communication and Technology. September 2005

  • L. Buera , E. Lleida, J. D. Rosas, J. Villalba, A. Miguel , A. Ortega, O. Saz

    Speaker verification and identification using Phoneme Dependent Multi-Environment Models based LInear Normalization in adverse and dynamic acoustic environments

    In Proc. "Summer School for Advanced studies on Biometrics for Secure Authentication: Multimodality ans System Integration", Alghero, Italy. June 2005

  • E. Masgrau, A. Ortega, P. Ramos, L. Vicente, E. Lleida

    Tratamiento Robusto del Sonido en el Interior de Vehículos

    XX Simposium Nacional de la Unión Científica Internacional de Radio URSI 2005. Gandía (Valencia). September 2005

Conferences

  • P. García, J. de Mingo, A. Valdovinos, A. Ortega

    A Novel Digital Imbalances Cancellation Method in LINC Transmitters

    The Seventh International Symposium on WIRELESS PERSONAL MULTIMEDIA COMMUNICATIONS. September 2004

  • P. García, J. de Mingo, A. Valdovinos, A. Ortega

    Adaptive Digital Correction of Gain and Phase Imbalances in LINC Transmitters

    2004 IEEE 59TH Vehicular Technolgy Conference, VTC2004-Spring. Milan (Italy),CD Procedings. May. 2004

  • P. García, J. de Mingo, A. Valdovinos, A. Ortega

    Adaptive Imbalances Correction in LINC Transmitters

    The 5th European Wireless Conference: Mobile and Wireless Systems beyond 3G, Barcelona (Spain) Procedings, Pp. 235-239. February 2004

  • O. Saz, L. Buera, E. Lleida, A. Miguel, A. Ortega

    Algoritmos de Compensacion de Caracteristicas Cepstrales para Reconocimiento Automatico del Habla Robusto

    III Jornadas en Tecnologias del Habla, Valencia, España. November 2004

  • O. Saz, L. Buera , E. Lleida, A. Miguel , A. Ortega

    Algoritmos de Compensación de Características Cepstrales para Reconocimiento Automático del Habla Robusto

    III Jornadas en Tecnología del Habla. November 2004

  • A. Ortega, F. Sukno, E. Lleida, A. Frangi, A. Miguel, L. Buera

    AV@CAR: A Spanish Multichannel Multimodal Corpus for In-Vehicle Automatic Audio-Visual Speech Recognition

    4th International Conferencece on Language Resources and Evaluation, Lisboa, Portugal. May 2004

  • L. Buera , E. Lleida, A. Miguel , A. Ortega, O. Saz

    Avances en la Normalizacion Cepstral con Señal Estereo para el Reconocimiento Robusto de Voz en el Entorno del Vehiculo

    III Jornadas de Tecnologías del Habla, Valencia, España. November 2004

  • L. Buera , E. Lleida , A. Ortega , A. Miguel , O. Saz

    Avances en la Normalización Cepstral con Señal Estéreo para el Reconocimiento Robusto de Voz en el Entorno del Vehículo

    III Jornadas en Tecnología del Habla,. November 2004

  • A. Ortega , F. Sukno, E. Lleida , A. Frangi , A. Miguel , L. Buera , E. Zacur

    Base de Datos Audiovisual y Multicanal en Castellano para Reconocimiento Automático del Habla Multimodal en el Automóvil

    III Jornadas en Tecnología del Habla. November 2004

  • A. Miguel , Richard , E. Lleida , L. Buera , A. Ortega , O. Saz

    Decodificador Eficiente para Normalización del Tracto Vocal en Reconocimiento Automático del Habla en Tiempo Real

    III Jornadas en Tecnología del Habla. November 2004

  • L. Buera, E. Lleida, A. Miguel, A. Ortega

    Multi-environment models based linear normalization for robust speech recognition

    Proceedings of the International Conference "Speech and Computer", SPECOM-2004, St. Petersburg, Russia. September 2004

  • L. Buera, E. Lleida, A. Miguel, A. Ortega

    Multi-Environments Model Based Linear Normalization for speech recognition in Car Conditions

    Proceedings of the International Conference on Audio, Speech and Signal Processing, ICASSP-2004, Montreal, Canada. May 2004

  • P. García, A. Ortega, J. de Mingo, A. Valdovinos

    Nonlinear Distortion Cancellation in Ofdm Systems using an Adaptive Linc Structure

    2004 15th International Symposium on Personal, Indoor and Mobile Radio Communications (pimrc 2004). September 2004

  • E. A. Viruete , C. Hernández , J. Ruiz , J. Fernández , A. Alesanco , E. Lleida , A. Ortega , A. Hernández , A. Valdovinos , J. García

    Sistema de telemonitorización en vehículos de emergencias médicas sobre UMTS

    Actas del XXII Congreso Anual de la Sociedad Española de Ingeniería Biomédica CASEIB 2004, Santiago de Compostela, pp. 111-114. November 2004

Conferences

  • P. García , J. de Mingo , A. Valdovinos , A. Ortega

    Método adaptativo para el equilibrio de las ramas de un transmisor LINC

    XVIII Simposiun Nacional de U.R.S.I. (Union of Radio Science International).Libro de Actas: CD-ROM, ISBN-84-9749-081-9. September 2003

  • A. Ortega , E. Lleida , E. Masgrau

    Residual Echo Power Estimation for Speech Reinforcement Systems in Vehicles

    Proceedings de EUROSPEECH’03. Ginebra (Suiza). September 2003

Conferences

  • A. Ortega , E. Lleida , E. Masgrau , F. Gallego

    Cabin car communication system to improve communication inside a car

    Proccedings of IEEE Int. Conf. on Acoustics, Speech and Signal Processing ICASSP'02. Orlando (USA), vol. 4, pp. 386-389. May 2002

  • A. Ortega , E. Lleida , E. Masgrau

    DSP to improve oral communications inside vehicles

    Proceedings of European Signal Processing Conference EUSIPCO'02. Toulouse (France). September 2002

  • E. Lleida , E. Masgrau , A. Ortega , A. Miguel , L. Buera

    Reconocimiento Automático del Habla en vehículos, resultados con Speech-Dat Car

    Speech-Dat Car database results. December 2002

  • E. Lleida , E. Masgrau , A. Ortega , A. Miguel

    Reconocimiento Automático del Habla en Vehículos, Resultados con Speech-Dat Car

    Libro de Actas Jornadas en Tecnologías del Habla. Granada 2002

  • A. Ortega , E. Lleida , E. Masgrau

    Speech reinforce inside vehicles

    Proceedings of the 21st International Conference of the Audio Engineering Society. AES 2002. St. Petersburg (Russia). pp. 91-99. June 2002

Conferences

  • E. Lleida , E. J. Masgrau , A. Ortega

    Acoustic Echo Control and Noise Reduction for Cabin Car Communication

    Proccedings of European Conference on Speech Communication and Technology EUROSPEECH'01. Aalborg (Denmark), vol. 3, pp 1585-1588. September 2001

  • A. Ortega , E. Lleida , E. Masgrau

    Sistema de Comunicación oral para el interior de automóviles

    Libro de Actas Simposium Nacional de la Unión Internacional de Radio URSI 01. Madrid. 541-542. September 2001

Conferences

  • A. Ortega, E. J. Masgrau , E. Lleida

    Control activo de ruido con ecualización del camino secundario

    Libro de Actas Simposium Nacional de la Unión Internacional de Radio URSI 00. Zaragoza. 55-56. September 2000