Publications

Complete List

Explore journal papers, conferences, preprints, and books grouped by year.

Publications

Curated selection of peer-reviewed articles, conference papers, and preprints.

2025

Journal Papers

Estevez, M.; Bonomi, C.; Ribas, D.; Ortega, A.; Ferrer, L.
Beyond Global Metrics: A Fairness Analysis for Interpretable Voice Disorder Detection Systems
JOURNAL OF VOICE, 2025
DOI / VIEW PAPER

Conferences

Almudévar, A.; Hernández-Lobato, J. M.; Khurana, S.; Marxer, R.; Ortega, A.
Aligning Multimodal Representations through an Information Bottleneck
International Conference on Machine Learning (pp. 1250-1270). PMLR
DOI / VIEW PAPER

2024

Preprints

Mingote, V.; Ortega, A.; Miguel, A.; Lleida, E.
Audio-visual speaker diarization: Current databases, approaches and challenges
arXiv preprint arXiv:2409.05659
VIEW ARXIV

Journal Papers

Vidal, J.; Ribas, D.; Bonomi, C.; Lleida, E.; Ferrer, L.; Ortega, A.
Automatic voice disorder detection from a practical perspective
JOURNAL OF VOICE, 2024
DOI / VIEW PAPER

Conferences

Lebourdais, M.; Gimeno, P.; Mariotte, T.; Tahon, M.; Ortega, A.; Larcher, A.
3MAS: a multitask, multilabel, multidataset semi-supervised audio segmentation model
Proc. odyssey 2024 (pp. 232-239)
DOI / VIEW PAPER
Gimeno, P.; Ortega, A.
Advances in Binary and Multiclass Audio Segmentation with Deep Learning Techniques: A PhD Thesis Overview
Proc. IberSPEECH 2024 (pp. 237-241)
DOI / VIEW PAPER
Mariotte, T.; Almudévar, A.; Tahon, M.; Ortega, A.
An explainable proxy model for Multilabel audio segmentation
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 531-535). IEEE
DOI / VIEW PAPER
Miguel A Pastor, Alfonso Ortega, Dayana Ribas
Analysis of the domain mismatch problem in the Speech Emotion Recognition Task
Proc. IberSPEECH 2024
DOI / VIEW PAPER
Rubio Felipo, S.; Ribas González, D.; Lleida Solano, E.; Ortega Giménez, A.; Artiaga, A. M.
Assessing the Impact and Potential of TTS for Pathological Voice Data Augmentation on Pathology Detection Systems
Proc. IberSPEECH 2024 (pp. 41-45)
DOI / VIEW PAPER
Mingote, V.; Ortega, A.; Miguel, A.; Lleida, E.
Encouraging Internal Representations with Speaker Information in End-to-End Neural Diarization by Adding Speaker Loss
Proc. IberSPEECH 2024 (pp. 191-195)
DOI / VIEW PAPER
Lebourdais, M.; Mariotte, T.; Almudévar, A.; Tahon, M.; Ortega, A.
Explainable by-design Audio Segmentation through Non-Negative Matrix Factorization and Probing.
In Proc. Interspeech 2024
DOI / VIEW PAPER
Almudévar, A.; Mariotte, T.; Ortega, A.; Tahon, M.; Vicente, L.; Miguel, A.; Lleida, E.
Predefined Prototypes for Intra-Class Separation and Disentanglement
Proc. Interspeech 2024 (pp. 3809-3813)
DOI / VIEW PAPER
María García Cutando, Eduardo Lleida Solano, Virginia Bazán Gil, Alfonso Ortega Giménez, Antonio Miguel Artiaga
Semantic Information Retrieval through Autonomous Agents
Proc. IberSPEECH 2024
DOI / VIEW PAPER
Pastor, M. Á.; Ortega, A.; Miguel, A.; Ribas, D.
The ViVoLab System for the Odyssey Emotion Recognition Challenge 2024 Evaluation
Proc. odyssey 2024 (pp. 274-280)
DOI / VIEW PAPER
Almudévar, A.; Mariotte, T.; Ortega, A.; Tahon, M.
Unsupervised multiple domain translation through controlled disentanglement in variational autoencoder
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 7010-7014). IEEE
DOI / VIEW PAPER

2023

Journal Papers

Lleida, E.; Rodriguez-Fuentes, L. J.; Tejedor, J.; Ortega, A.; Miguel, A.; Bazán, V.; Arzelus, H.
An Overview of the IberSpeech-RTVE 2022 Challenges on Speech Technologies
Applied Sciences, 13(15), 8577
DOI / VIEW PAPER
Ribas, D.; Pastor, M. A.; Miguel, A.; Martínez, D.; Ortega, A.; Lleida, E.
Automatic voice disorder detection using self-supervised representations
Ieee Access, 11, 14915-14927
DOI / VIEW PAPER
Mingote, V.; Miguel, A.; Ortega, A.; Lleida, E
Class token and knowledge distillation for multi-head self-attention speaker verification systems
Digital Signal Processing, vol. 133, 2023
DOI / VIEW PAPER
Pastor, M. A.; Ribas, D.; Ortega, A.; Miguel, A.; Lleida, E.
Cross-corpus training strategy for speech emotion recognition using self-supervised representations
Applied Sciences, 13(16), 9062
DOI / VIEW PAPER
Barrio, R.; Lozano, Á.; Mayora-Cebollero, A.; Mayora-Cebollero, C.; Miguel, A.; Ortega, A.; Vigara, R.
Deep learning for chaos detection
Chaos: An Interdisciplinary Journal of Nonlinear Science, 33(7)
DOI / VIEW PAPER

Conferences

López-Espejo, I.; Prieto, S.; Ortega, A.; Lleida, E.
Improved Vocal Effort Transfer Vector Estimation For Vocal Effort-Robust Speaker Verification
2023 IEEE 33rd International Workshop on Machine Learning for Signal Processing (MLSP) (pp. 1-6). IEEE
DOI / VIEW PAPER
Ribas D., Miguel A.
On the Problem of Data Availability in Automatic Voice Disorder Detection
Proceedings of the 16th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2023) - Volume 5: HEALTHINF; ISBN 978-989-758-631-6, SciTePress, pages 330-337. DOI: 10.5220/0011669300003414
DOI / VIEW PAPER
Almudévar, A.; Ortega, A.; Vicente, L.; Miguel, A.; Lleida, E.
Variational Classifier for Unsupervised Anomalous Sound Detection under Domain Generalization
Proc. Interspeech 2023 (pp. 2823-2827)
DOI / VIEW PAPER

2022

Journal Papers

Mingote, V.; Miguel, A.; Ribas, D.; Ortega, A.; Lleida, E
aDCF Loss Function for Deep Metric Learning in End-to-End Text-Dependent Speaker Verification Systems
IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 30, pp. 772-784, 2022
DOI / VIEW PAPER
Mingote, V.; Viñals, I.; Gimeno, P.; Miguel, A.; Ortega, A.; Lleida, E
Multimodal Diarization Systems by Training Enrollment Models as Identity Representations
Applied Sciences, vol. 12, no. 3, pp. 1141, 2022
DOI / VIEW PAPER
Prieto, S.; Ortega, A.; López-Espejo, I.; Lleida, E
Shouted and whispered speech compensation for speaker verification systems
Digital Signal Processing, vol. 127, pp. 103536, 2022
DOI / VIEW PAPER
Gimeno, P.; Ribas, D.; Ortega, A.; Miguel, A.; Lleida, E
Unsupervised Adaptation of Deep Speech Activity Detection Models to Unseen Domains
Applied Sciences, vol. 12, no. 4, pp. 1832, 2022
DOI / VIEW PAPER
Almudévar, A.; Sevillano, P.; Vicente, L.; Preciado-Garbayo, J.; Ortega, A
Unsupervised Anomaly Detection Applied to Φ-OTDR
Sensors, vol. 22, no. 17, pp. 6515, 2022
DOI / VIEW PAPER
Ribas, D.; Miguel, A.; Ortega, A.; Lleida, E
Wiener Filter and Deep Neural Networks: A Well-Balanced Pair for Speech Enhancement
Applied Sciences, vol. 12, no. 18, pp. 9000, 2022
DOI / VIEW PAPER

Conferences

Gimeno, P.; Ortega, A.; Miguel, A.; Lleida, E
A Study on the Use of wav2vec Representations for Multiclass Audio Segmentation
Iberspeech 2022. Granada, Spain. Novemberr 2021
DOI / VIEW PAPER
Pastor, M.; Ribas, D.; Ortega, A.; Miguel, A.; Lleida, E
Cross-Corpus Speech Emotion Recognition with HuBERT Self-Supervised Representation
Iberspeech 2022. Granada, Spain. Novemberr 2021
DOI / VIEW PAPER
Ribas, D.; Pastor, M.; Miguel, A.; Martínez, D.; Ortega, A.; Lleida, E
S3prl-Disorder: Open-Source Voice Disorder Detection System based in the Framework of S3PRL-toolkit
Iberspeech 2022. Granada, Spain. Novemberr 2021
DOI / VIEW PAPER
Miguel, A.; Ortega, A.; Lleida, E
ViVoLAB System Description for the S2TC IberSPEECH-RTVE 2022 challenge
Iberspeech 2022. Granada, Spain. Novemberr 2021

2021

Journal Papers

Gimeno, P.; Ortega, A.; Miguel, A.; Lleida, E
Generalising AUC Optimisation to Multiclass Classification for Audio Segmentation with Limited Training Data
IEEE Signal Processing Letters, 28 , pp. 1135-1139, 2021
DOI / VIEW PAPER
Llombart, J.; Ribas, D.; Miguel, A.; Vicente, L.; Ortega, A.; Lleida, E
Progressive Loss Functions for Speech Enhancement with Deep Neural Networks
EURASIP Journal on Audio, Speech, and Music Processing, 2021 (1), pp. 1-16, 2021
DOI / VIEW PAPER
Viñals, I.; Ortega, A.; Miguel, A.; Lleida, E
The Domain Mismatch Problem in the Broadcast Speaker Attribution Task
Applied Sciences, vol. 11, no. 18, p. 8521, Sept. 2021
DOI / VIEW PAPER

Conferences

Mingote, V.; Miguel, A.; Ortega, A.; Lleida, E
Memory Layers with Multi-Head Attention Mechanisms for Text-Dependent Speaker Verification
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021-June , 2021
DOI / VIEW PAPER
Gimeno, P.; Ribas, D.; Ortega, A.; Miguel, A.; Lleida, E
Convolutional Recurrent Neural Networks for Speech Activity Detection in Naturalistic Audio from Apollo Missions
Fifth International Conference, IberSPEECH 2020. Valladolid, Spain. March 2021
DOI / VIEW PAPER
Viñals, I.; Gimeno, P.; Ortega, A.; Miguel, A.; Lleida, E
Diarization and Identity Attribution Compatibility in the Albayzin 2020 Challenge
Fifth International Conference, IberSPEECH 2020. Valladolid, Spain. March 2021
DOI / VIEW PAPER
Mingote, V.; Miguel, A.; Ortega, A.; Lleida, E
Log-Likelihood-Ratio Cost Function as Objective Loss for Speaker Verification Systems
22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021. Brno, Czech Republic. September 2021
DOI / VIEW PAPER
Gimeno, P; Ortega, A.; Miguel, A.; Lleida, E
Unsupervised Representation Learning for Speech Activity Detection in the Fearless Steps Challenge 2021
22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021. Brno, Czech Republic. September 2021
DOI / VIEW PAPER
Mingote, V.; Viñals, I.; Gimeno, P.; Miguel, A.; Ortega, A.; Lleida, E
ViVoLAB Multimodal Diarization System for RTVE 2020 Challenge
Fifth International Conference, IberSPEECH 2020. Valladolid, Spain. March 2021
DOI / VIEW PAPER

2020

Journal Papers

P. Gimeno, I. Viñals, A. Ortega, A. Miguel, E. Lleida
Multiclass audio segmentation based on recurrent neural networks for broadcast domain data
EURASIP Journal on Audio, Speech, and Music Processing, 5, March. 2020
DOI / VIEW PAPER
V. Mingote, A. Miguel, A. Ortega, E. Lleida
Optimization of the area under the ROC curve using neural network supervectors for text-dependent speaker verification
Computer, Speech & Language , vol. 63, Sept. 2020
DOI / VIEW PAPER

Conferences

V. Mingote, A. Miguel, D. Ribas, A. Ortega, E. Lleida
Knowledge Distillation and Random Erasing Data Augmentation for Text-Dependent Speaker Verification
ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2020, pp. 6824-6828
DOI / VIEW PAPER
P. Gimeno, V. Mingote, A. Ortega, A. Miguel, E. Lleida
Partial AUC Optimisation using Recurrent Neural Networks for Music Detection with Limited Training Data
21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020. Shanghai, China. October 2020
S. Prieto, A. Ortega, I. López-Espejo, E. Lleida
Shouted Speech Compensation for Speaker Verification Robust to Vocal Effort Conditions
21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020. Shanghai, China. October 2020
V. Mingote, A. Miguel, A. Ortega, E. Lleida
Training Speaker Enrollment Models by Network Optimization
21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020. Shanghai, China. October 2020

2019

Journal Papers

E. Lleida, A. Ortega, A. Miguel, V. Bazán-Gil, C. Pérez, M. Gómez, A. de Prada
Albayzin 2018 Evaluation: The IberSpeech-RTVE Challenge on Speech Technologies for Spanish Broadcast Media
Applied Sciences, vol. 9, no. 24, p. 5412, Dec. 2019
DOI / VIEW PAPER
I. Viñals, A. Ortega, A. Miguel, E. Lleida
An Analysis of the Short Utterance Problem for Speaker Characterization
Applied Sciences, vol. 9, no. 18, p. 3697, Sep. 2019
DOI / VIEW PAPER
V. Mingote, A. Miguel, A. Ortega, E. Lleida
Supervector Extraction for Encoding Speaker and Phrase Information with Neural Networks for Text-Dependent Speaker Verification
Applied Sciences, vol. 9, no. 16, p. 3295, Aug. 2019
DOI / VIEW PAPER
I. Viñals, A. Ortega, Jesus Villalba, A. Miguel, E. Lleida
Unsupervised adaptation of PLDA models for broadcast diarization
EURASIP Journal on Audio, Speech, and Music Processing, 24, Dec. 2019
DOI / VIEW PAPER

Conferences

V. Mingote, D. Castan, M. McLaren, M. Kumar Nandwana, A. Ortega, E. Lleida, Antonio Miguel
Language Recognition using Triplet Neural Networks
20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. Graz, Austria. September 2019
V. Mingote, A. Miguel, D. Ribas, A. Ortega, E. Lleida
Optimization of False Acceptance/Rejection Rates and Decision Threshold for End-to-End Text-Dependent Speaker Verification Systems
20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. Graz, Austria. September 2019
I. Viñals, D. Ribas, V. Mingote, J. Llombart, P. Gimeno, A. Miguel, A. Ortega, E. Lleida
Phonetically-aware embeddings, Wide Residual Networks with Time-Delay Neural Networks and Self Attention models for the 2018 NIST Speaker Recognition Evaluation
20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. Graz, Austria. September 2019
J. Llombart, D. Ribas Gonzalez, A. Miguel, L. Vicente, A. Ortega, E. Lleida
Progressive Speech Enhancement with Residual Connections
20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. Graz, Austria. September 2019
J. Llombart, D. Ribas Gonzalez, A. Miguel, L. Vicente, A. Ortega, E. Lleida
Speech Enhancement with Wide Residual Networks in Reverberant Environments
20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. Graz, Austria. September 2019
I. Viñals, P. Gimeno, A. Ortega, A. Miguel, E. Lleida
ViVoLAB Speaker Diarization System for the DIHARD 2019 Challenge
20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. Graz, Austria. September 2019

2018

Conferences

P. Gimeno, I. Viñals, A. Ortega, A. Miguel, E. Lleida
A Recurrent Neural Network Approach to Audio Segmentation for Broadcast Domain Data
Fourth International Conference, IberSPEECH 2018. Barcelona, Spain. November 2018
V. Mingote, A. Miguel, A. Ortega, E. Lleida
Differentiable Supervector Extraction for Encoding Speaker and Phrase Information in Text Dependent Speaker
Fourth International Conference, IberSPEECH 2018. Barcelona, Spain. November 2018
I. Viñals, P. Gimeno, A. Ortega, A. Miguel, E. Lleida
Estimation of the Number of Speakers with Variational Bayesian PLDA in the DIHARD Diarization Challenge
19th Annual Conference of the International Speech Communication Association, INTERSPEECH 2018. Hyderabad, India. September 2018
I. Viñals, P. Gimeno, A. Ortega, A. Miguel, E. Lleida
In-domain Adaptation Solutions for the RTVE 2018 Diarization Challenge
Fourth International Conference, IberSPEECH 2018. Barcelona, Spain. November 2018
I. Viñals, A. Ortega, A. Miguel, E. Lleida
Phonetic Variability Influence on Short Utterances in Speaker Verification
Fourth International Conference, IberSPEECH 2018. Barcelona, Spain. November 2018
J. Llombart, A. Miguel, A. Ortega, E. Lleida
Wide Residual Networks 1D for Automatic Text Punctuation
Fourth International Conference, IberSPEECH 2018. Barcelona, Spain. November 2018

2017

Conferences

I. Viñals, J. Villalba, A. Ortega, A. Miguel, E. Lleida
Domain Adaptation of PLDA models in Broadcast Diarization by means of Unsupervised Speaker Clustering
18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017. Stockholm, Sweden. August 2017
A. Miguel, J. Llombart, E. Lleida, A. Ortega
Tied Hidden Factors in Neural Networks for End-to-End Speaker Recognition
18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017. Stockholm, Sweden. August 2017

2016

Journal Papers

J. Villalba, A. Ortega, A. Miguel, E. Lleida
Analysis of Speech Quality Measures for the Task of Estimating the Reliability of Speaker Verification Decisions
Speech Communication, Elsevier. April 2016
J. Villalba, A. Miguel, A. Ortega, E. Lleida
Bayesian Networks to Model the Variability of Speaker Verification Scores in Adverse Environments
IEEE/ACM Transactions on Audio, Speech, and Language Processing. December 2016

Conferences

J. Olcoz, P. Gimeno, A. Ortega, A. Arguedas, A. Miguel, E. Lleida
Automatic Text-to-Audio Alignment of Multimedia Broadcast Content
Third International Conference, IberSPEECH 2016. Lisbon, Portugal. November 2016
I. Viñals, J. Villalba, A. Ortega, A. Miguel, E. Lleida
Bottleneck Based Front-End for Diarization Systems
Third International Conference, IberSPEECH 2016. Lisbon, Portugal. November 2016
J. Llombart, A. Miguel, E. Lleida, A. Ortega
Character Sequence to Sequence Applications: Subtitle Segmentation and Part-of-Speech Tagging
Third International Conference, IberSPEECH 2016. Lisbon, Portugal. November 2016
J. Olcoz, J. Llombart, A. Miguel, A. Ortega, E. Lleida
The ViVoLab-I3A-UZ System for Albayzin 2016 Search-on-Speech Evaluation
Third International Conference, IberSPEECH 2016. Lisbon, Portugal. November 2016

2015

Journal Papers

D. Castan, D. Tavarez, P. Lopez-Otero, J. Franco-Pedroso, H. Delgado, E. Navas, L. Docio-Fernández, D. Ramos, J. Serrano, A. Ortega, E. Lleida
Albayzín-2014 evaluation: audio segmentation and classification in broadcast news domains
EURASIP Journal on Audio, Speech, and Music Processing. December 2015
D. Martinez, E. Lleida, P. Green, H. Christensen, A. Ortega, A. Miguel
Intelligibility Assessment and Speech Recognizer Word Accuracy Rate Prediction for Dysarthric Speakers in a Factor Analysis Subspace
ACM Transactions on Accessible Computing 6(3):1-21 June 2015

Conferences

J. Villalba, A. Miguel, A. Ortega, E. Lleida
Spoofing Detection with DNN and One-class SVM for the ASVspoof 2015 Challenge
16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015. Dresden, Germany. September 2015
J. Villalba, A. Miguel, A. Ortega, E. Lleida
Variational Bayesian PLDA for Speaker Diarization in the MGB Challenge
2015 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2015). Scottsdale, Arizona, USA. December 2015

2014

Journal Papers

D. Castan, A. Ortega, A. Miguel, E. Lleida
Audio Segmentation-by-Classification Approach Based on Factor Analysis in Broadcast News Domain
EURASIP Journal on Audio, Speech, and Music Processing. Vol. 34. August 2014
J. E. García, A. Ortega, A. Miguel, E. Lleida
Low Bit Rate Compression Methods of Feature Vectors for Distributed Speech Recognition
Speech Communication, Elsevier. Vol. 58. pp. 111-123. March 2014

Conferences

D. Castán, A. Ortega, A. Miguel, E. Lleida
A preliminary study of Acoustic Events Classification with Factor Analysis in Meeting Rooms
Iberspeech 2014. Las Palmas de Gran Canaria (Spain) November 2014
A. Miguel, J. Olcoz, J. Villalba, A. Ortega, E. Lleida
Albayzin 2014 Search on Speech @ ViVolab UZ
Iberspeech 2014. Las Palmas de Gran Canaria (Spain) November 2014
J. Olcoz, A. Ortega, A. Miguel, E. Lleida
Confidence Measures in Automatic Speech Recognition for Error Detection in Restricted Domains
Iberspeech 2014. Las Palmas de Gran Canaria (Spain) November 2014
A. Miguel, J. Villalba, A. Ortega, E. Lleida, C. Vaquero
Factor Analysis with Sampling Methods for Text Dependent Speaker Recognition
15th Annual Conference of the International Speech Communication Association, INTERSPEECH 2014. Singapore September 2014
A. Arguedas, E. Lleida, A. Ortega, A. Miguel, J. E. García
Subtitling Tools Based On Automatic Speech Recognition
Iberspeech 2014. Las Palmas de Gran Canaria (Spain) November 2014
D. Martínez, J. Villalba, E. Lleida, A. Ortega
Unsupervised Accent Modeling for Language Identification
Iberspeech 2014. Las Palmas de Gran Canaria (Spain) November 2014

2013

Journal Papers

C. Vaquero, A. Ortega, A. Miguel, E. Lleida
Quality Assessment for Speaker Diarization and its Application in Speaker Characterization
IEEE Transactions on Audio, Speech, and Language Processing. Vol. 21. pp. 816-827. April 2013

Conferences

J. Villalba, E. Lleida, A. Ortega, A. Miguel
A New Bayesian Network to Assess the Reliability of Speaker Verification Decisions
14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013. Lyon (France) August 2013
D. Castán, A. Ortega, A. Miguel, E. Lleida
Broadcast News Segmentation with Factor Analysis System
SLAM 2013 Speech, Language and Audio in Multimedia. Marseille (France) August 2013
D. Martinez, E. Lleida, A. Ortega, A. Miguel
Prosodic features and formant modeling for an ivector-based language recognition system
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Vancouver (Canada) May 2013
D. Castán, A. Ortega, J. Villalba, A. Miguel, E. Lleida
Segmentation-by-classification system based on factor analysis
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Vancouver (Canada) May 2013
D. Martínez, D. Ribas, E. Lleida, A. Ortega, A. Miguel
Suprasegmental Information Modelling for Autism Disorder Spectrum and Specific Language Impairment Classification
14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013. Lyon (France) August 2013
J. Villalba, E. Lleida, A. Ortega, A. Miguel
The I3A Speaker Recognition System for NIST SRE12: Post-evaluation Analysis
14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013. Lyon (France) August 2013

2012

Conferences

A. Ortega, C. Vaquero, A. Miguel, E. Lleida
Diarization for Speaker Characterization
VI Jornadas de Reconocimiento Biométrico de Personas JRBP 2012. Las Palmas de Gran Canaria (Spain) February 2012
D. Ribas, J. E. Garcia, A. Miguel, A. Ortega, E. Lleida, J. R. Calvo
Evaluation of a New Beam-Search Formant Tracking Algorithm in Noisy Environments
Iberspeech 2012. Madrid (Spain) November 2012
D. Castán, A. Ortega, E. Lleida
Factor Analysis Segmentation and Classification in Broadcast News Domain
Iberspeech 2012. Madrid (Spain) November 2012
J. Villalba, E. Lleida, A. Ortega, A. Miguel
Reliability Estimation of the Speaker Verification Decisions Using Bayesian Networks to Combine Information from Multiple Speech Quality Measures
Iberspeech 2012. Madrid (Spain) November 2012
D. Martínez, E. Lleida, A. Ortega, A. Miguel, J. Villalba
Score Level versus Audio Level Fusion for Voice Pathology Detection on the Saarbrücken Voice Database
Iberspeech 2012. Madrid (Spain) November 2012
L. J. Rodriguez, M. Penagarikano, A. Varona, M. Diez, G. Bordel, A. Abad, D. Martinez, J. Villalba, A. Ortega, E. Lleida
The BLZ Submission to the NIST 2011 LRE: Data Collection, System Development and Performance
13th Annual Conference of the International Speech Communication Association. Interspeech 2012. Portland (Oregon, USA) September 2012
D. Martínez, E. Lleida, A. Ortega, A. Miguel, J. Villalba
Voice Pathology Detection on the Saarbrücken Voice Database with Calibration and Fusion of Scores Using Mul-tiFocal Toolkit
Iberspeech 2012. Madrid (Spain) November 2012

2011

Journal Papers

A. Miguel , A. Ortega, L. Buera, E. Lleida
Bayesian Networks for Discrete Observation Distributions in Speech Recognition
IEEE Transactions on Audio, Speech, and Language Processing. Vol. 19. No. 6. pp. 1476-1489. August 2011

Conferences

D. Castán, Carlos Vaquero, A. Ortega, D. Martínez, J. Villalba, E. Lleida
Hierarchical Auido Segmentation with HMM and Factor Analysis in Broadcast News Domain
Interspeech 2011. Florence (Italy) August 2011
D. Martínez, J. Villalba, A. Miguel, A. Ortega, E. Lleida
I3A Language Recognition System for Albayzin 2010 LRE
Interspeech 2011, Florence (Italy). August 2011
C. Vaquero, A. Ortega, E. Lleida
Intra-session variability compensation and hypothesis generation and selection strategy for speaker segmentation
International Conference on Acoustics, Speech and Signal Processing ICASSP 2011. Prague (CZech Republic). May 2011
C. Vaquero, A. Ortega, E. Lleida
Partitioning of Two-Speaker Conversation Datasets
Interspeech 2011. Florence (Italy) August 2011

2010

Journal Papers

L. Buera, A. Miguel, O. Saz, A. Ortega, E. Lleida
Unsupervised Data-Driven Feature Vector Normalization With Acoustic Model Adaptation for Robust Speech Recognition
IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 18, NO. 2, Februrary 2010. pp. 296-309

Conferences

O. Saz, E. Lleida, J. E. Garcia, A. Ortega
A Prototype of Distributed Speech Technologies for the Development of Websites Accessible to the Blind Community
FALA 2010 "VI Jornadas en Tecnología del Habla" and II Iberian SLTech Workshop. November. 2010
C. Vaquero, A. Ortega, J. A. Villalba, A. Miguel, E. Lleida
Confidence Measures for Speaker Segmentation and their Relation to Speaker Verification
Interspeech 2010, Makuhari (Japan). September 2010
C. Vaquero, A. Ortega, E. Lleida
Intra-session variability compensation for speaker segmentation
FALA 2010 "VI Jornadas en Tecnología del Habla" and II Iberian SLTech Workshop. November 2010
J. E. García, A. Ortega, A. Miguel, E. Lleida
Non-Linear Predictive Vector Quantization of Feature Vectors for Distributed Speech Recognition
Interspeech 2010, Makuhari (Japan). September 2010
J. E. García, A. Ortega, A. Miguel, E. Lleida. "
Predictive vector quantization using the M-algorithm for distributed speech recognition
FALA 2010 "VI Jornadas en Tecnología del Habla" and II Iberian SLTech Workshop. November 2010
D. Becerril, Oscar Saz, C. Vaquero, A. Ortega, E. Lleida
Speaker Tree Generation for Model Selection in Automatic Speech Recognition
FALA 2010 "VI Jornadas en Tecnología del Habla" and II Iberian SLTech Workshop. November 2010
D. Castán, A. Ortega, E. Lleida
Speech/Music classification by using the C4.5 decision tree algorithm
FALA 2010 "VI Jornadas en Tecnología del Habla" and II Iberian SLTech Workshop. November 2010

2009

Conferences

J.E. García, A. Ortega, E. Lleida, T. Lozano, E. Bernués, D. Sánchez
Audio and Text Synchronization for TV news Subtitling based on Automatic Speech Recognition
IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB). May. 2009
J.E. García, A. Ortega, A. Miguel, E. Lleida
Differential Vector Quantization of Feature Vectors for Distributed Speech Recognition
10th Annual Conference of the International Speech Communication Association. INTERSPEECH 2009. September. 2009
A. Miguel, A. Ortega, L. Buera, E. Lleida
Graphical Models for Discrete Hidden Markov Models in Speech Recognition
10th Annual Conference of the International Speech Communication Association. INTERSPEECH 2009. September. 2009
A. Miguel, A. Ortega, Luis Buera, E. Lleida
Local Projections and Support Vector Based Feature Selection in Speech Recognition
10th Annual Conference of the International Speech Communication Association. INTERSPEECH 2009. September. 2009
A. Ortega, J.E. García, A. Miguel, E. Lleida
Real-Time Live Broadcast News Subtitling System for Spanish
10th Annual Conference of the International Speech Communication Association. INTERSPEECH 2009. September. 2009
A. Ortega, J.E. García, E. Lleida, E. Bernués, D. Sánchez, M. Ferrer
Subtitulado en Tiempo Real de Informativos en Directo para la Televisión Mediante Reconocimiento Automático del Habla
IV Congreso de Accesibilidad a los Medios Audiovisuales para Personas con Discapacidad. AMADIS 09. June 2009
L. Buera,, A. Miguel, A. Ortega, E. Lleida, R. M. Stern
Unsupervised Training Scheme with Non-Stereo Data for Empirical Feature Vector Compensation
10th Annual Conference of the International Speech Communication Association. INTERSPEECH 2009. September. 2009

Book Chapters

L. Buera, A. Miguel, E. Lleida, A. Ortega, O. Saz
Cross-Probability Model Based on GMM for Feature Vector Normalization
Chapter 14 in "In-Vehicle Corpus and Signal Processing for Driver Behavior" H. Abut, J.H.L. Hansen, Hakan Erdogan and K. Takeda (Eds.), Springer Science, New York, NY. 2009

2008

Journal Papers

A. Miguel, E. Lleida, R. Rose, L. Buera, O. Saz, A. Ortega
Capturing local variability for speaker normalization in speech recognition
IEEE Trans on Audio, Speech and Language Processing, Vol 16, No 3, pp 578-593 . March 2008

Conferences

J.E. García, A. Ortega, A. Miguel, E. Lleida
Arquitectura distrubuida para el desarrollo de sistemas de diálogo hablado, edecán
V jornadas en tecnología del habla. November 2008
J. E. García, A. Ortega, A. Miguel, E. Lleida
Cuantificación vectorial diferencial para la transmisión eficiente de parámetros acústicos en sistemas de reconocimiento automático del habla distribuido
V Jornadas en tecnología del habla. November 2008
J. A. Villalba, C. Vaquero, E. Lleida, A. Ortega, A. Miguel, J. E. García, L. Buera, O. Saz
Experiencia del I3A en la Evaluación de Reconocimiento de Locutor NIST 2008
Jornadas de Reconocimiento Biométrico de Personas, Valladolid, España. September 2008
L. Buera, A. Miguel, O. Saz, A. Ortega, E. Lleida
Feature Vector Normalization with Combined Standard and Throat Microphones for Robust ASR
Interspeech 2008. September 2008
A. Miguel, E. Lleida, A. Ortega
Generalized gaussians for continuous observation distributions in speech recognition
V Jornadas en tecnología del habla. November 2008
A. Miguel, E. Lleida, A. Ortega
Graphical models for discrete observation distributions in speech recognition
V jornadas en tecnología del habla. November 2008
J.E. García, A. Ortega, A. Miguel, E. Lleida
Sistema de reconocimiento automático del habla distribuido aplicado a entornos logísticos
V jornadas en tecnología del habla. November 2008

2007

Journal Papers

L. Buera, E. Lleida, A. Miguel, A. Ortega, O. Saz
Cepstral Vector Normalization Based on Stereo Data for Robust Speech Recognition
IEEE Trans. On Audio Speech and Language Processing, vol.15, pp.1098-1113. March 2007

Conferences

L. Buera, A. Miguel, E. Lleida, A. Ortega, O. Saz
Cross-Probability Model based on GMM for Feature Vector Normalization in Car Environments
Biennial on DSP for in-Vehicle and Mobile Systems, Istanbul, Turkey. June. 2007
P. García, A. Hernández, J. P. Martínez, I. Martinez, E. Mayordomo, A. Ortega, I. Salinas, J. R. Solera, L. Vicente
Distribución de la Carga Discente: Estudio sobre las titulaciones del Centro Politécnico Superior de la Universidad de Zaragoza.
II Jornadas de Innovación Educativa de la Escuela Politécnica Superior de Zamora. June 2007
L. Buera, A. Miguel, O. Saz, E. Lleida, A. Ortega
Evaluation of the Combined Use of MEMLIN and MLLR on the Non-native Adaptation Task of Hiwire Project Database
Interspeech, August. 2007
L. Buera, A. Miguel, E. Lleida, O. Saz, A. Ortega
On the Jointly Unsupervised Feature Vector Normalization and Acoustic Model Compensation for Robust Speech Recognition
Interspeech, August. 2007
A. Miguel, L. Buera, E. Lleida, A. Ortega, O. Saz
On-Line Feature and Acoustic Model Space Compensation for Robust Speech Recognition in Car Environment
IEEE Intelligent Vehicles Symposium. June. 2007
L. Buera, A. Miguel, E. Lleida, O. Saz, A. Ortega
Robust Speech Recognition with on-line Unsupervised Acoustic Feature
IEEE Automatic Speech Recognition and Understanding Workshop, ASRU, December 2007

2006

Conferences

A. Uría, A. Ortega, M. I. Torres, A. Miguel, V. Guijarrubia, L. Buera, J. Garmendia, E. Lleida, O. Aizpuru, O. Saz
A virtual butler controlled by speech
IV Jornadas en Tecnología del Habla, Zaragoza, Spain. November 2006
J. P. Martínez, A. Ortega, A. Hernández, I. Salinas, P. García, L. Vicente, I. Martinez, J. Fernández
Estudio de los perfiles y competencias profesionales en la titulación de Ingeniería de Telecomunicación
IV Congreso Internacional de Docencia Universitaria e Innovación (CIDUI), pp.535 (ISBN: 84-8458-244-4), Barcelona (Spain). July 2006
P. García, J. P. Martínez, E. Mayordomo, A. Ortega , I. Salinas, J. R. Solera, L. Vicente
Estudio sobre la carga de trabajo del estudiante en las titulaciones del Centro Politécnico Superior
I Jornadas de Innovación Docente, Tecnologías de la Información y la Comunicación e Investigación Educativa en la Universidad de Zaragoza. November 2006
J. P. Martínez, A. Ortega, A. Hernández, I. Salinas, P. García, L. Vicente, I. Martinez, J. Fernández
Evaluación de la carga discente de la titulación de Ingeniería de Telecomunicación: asignación de créditos ECTS
IV Congreso Internacional de Docencia Universitaria e Innovación (CIDUI), pp. 288 (ISBN: 84-8458-244-4), Barcelona (Spain). July 2006
A. Miguel, E. Lleida, A. Juan, L. Buera, A. Ortega, O. Saz
Local Transformation Models for Speech Recognition
in Interspeech - International Conference on Spoken Language Processing, ICSLP. Pittsburgh, USA, Sept 2006, pp. 1598–1601. September 2006
A. Ortega, E. Lleida, E. J. Masgrau, L. Buera, A. Miguel
Stability Control in a Two-Channel Speech Reinforcement System for Vehicles
International Conference on Acoustics, Speech and Signal Processing (ICASSP 2006). May 2006
O. Saz, A. Miguel, E. Lleida, A. Ortega, L. Buera
Study of Time and Frequency Variability in Pathological Speech and Error Reduction Methods for Automatic Speech Recognition
International Conference on Spoken Language Processing, (ICSLP 2006). September 2006
L. Buera, E. Lleida, A. Miguel, A. Ortega, O. Saz
Time-dependent Cross-Probability Model for Feature Vector Normalization
IV Jornadas en Tecnología del Habla, Zaragoza, Spain. November 2006
L. Buera, E. Lleida, J. A. Nolazco-Flores, A. Miguel, A. Ortega
Time-dependent cross-probability model for Multi-Environment Model based LInear Normalization
International Conference on Spoken Language Processing, (ICSLP 2006). September 2006
L. Buera, E. Lleida, J. D. Rosas, J. Villalba, A. Miguel , A. Ortega , O. Saz
Verificación e Identificación de Locutor con Normalización de Vectores de Características en Entornos Acústicos Adversos
Terceras Jornadas de Reconocimiento Biométrico de Personas, Sevilla, Spain. November 2006

Book Chapters

A. Ortega, E. Lleida, E. Masgrau, L. Buera, A. Miguel
Acoustic Echo Reduction in a Two-Channel Speech Reinforcement System for Vehicles
Chapter 15 in "Digital Signal Processing for In-Vehicle and Mobile Systems 2", H. Abut, J.H.L. Hansen and K. Takeda (Eds.), Springer Science, New York, NY. May 2006

2005

Journal Papers

P. García, J. de Mingo, A. Valdovinos, A. Ortega
An Adaptive digital method of imbalances cancellation in LINC transmitters
IEEE Transactions on Vehicular Technology, vol. 54, no. 3, pp. 879-888. May 2005
P. García, A. Ortega, J. de Mingo, A. Valdovinos
Nonlinear Distortion Cancellation using LINC Transmitters in OFDM Systems
IEEE Transactions on Broadcasting, vol. 51, no. 1, pp. 84-93. March 2005
A. Ortega, E. Lleida, E. Masgrau
Speech reinforcement system for car cabin communications
IEEE Transactions on Speech and Audio Processing. vol. 13 no. 5. pp. 917-929. September 2005

Conferences

A. Ortega, E. Lleida, E. Masgrau, L. Buera, A. Miguel
Acoustic Echo Reduction in a Two-Channel Speech Reinforcement System for Vehicles
Biennial on DSP for in-Vehicle and Mobile Systems Sesimbra, Portugal, September 2-3. Septiembre 2005
A. Ortega, E. Lleida, E. Masgrau, L. Buera, A. Miguel
Acoustic Feedback Cancellation in Speech Reinforcement System for Vehicles
Interspeech'2005 - Eurospeech — 9th European Conference on Speech Communication and Technology. September 2005
A. Miguel, E. Lleida, R. Rose, L. Buera, A. Ortega
Augmented State Space Acoustic Decoding for Modeling Local Variability in Speech
Interspeech'2005 - Eurospeech — 9th European Conference on Speech Communication and Technology. September 2005
L. Buera, E. Lleida, A. Miguel, A. Ortega
Multi-Environment Linear Normalization for robust speech analysis in cars
Biennial on DSP for in-Vehicle and Mobile Systems Sesimbra, Portugal, September 2005
L. Buera, E. Lleida, A. Miguel, A. Ortega
Recent Advances in PD-MEMLIN for Speech Recognition in Car Conditions
IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2005. San Juan, Puerto Rico. November 2005
L. Buera, E. Lleida, A. Miguel, A. Ortega
Robust Speech Recognition in Cars using Phoneme Dependent Multi-Environment Linear Normalization
Interspeech'2005 - Eurospeech — 9th European Conference on Speech Communication and Technology. September 2005
L. Buera , E. Lleida, J. D. Rosas, J. Villalba, A. Miguel , A. Ortega, O. Saz
Speaker verification and identification using Phoneme Dependent Multi-Environment Models based LInear Normalization in adverse and dynamic acoustic environments
In Proc. "Summer School for Advanced studies on Biometrics for Secure Authentication: Multimodality ans System Integration", Alghero, Italy. June 2005
E. Masgrau, A. Ortega, P. Ramos, L. Vicente, E. Lleida
Tratamiento Robusto del Sonido en el Interior de Vehículos
XX Simposium Nacional de la Unión Científica Internacional de Radio URSI 2005. Gandía (Valencia). September 2005

2004

Conferences

P. García, J. de Mingo, A. Valdovinos, A. Ortega
A Novel Digital Imbalances Cancellation Method in LINC Transmitters
The Seventh International Symposium on WIRELESS PERSONAL MULTIMEDIA COMMUNICATIONS. September 2004
P. García, J. de Mingo, A. Valdovinos, A. Ortega
Adaptive Digital Correction of Gain and Phase Imbalances in LINC Transmitters
2004 IEEE 59TH Vehicular Technolgy Conference, VTC2004-Spring. Milan (Italy),CD Procedings. May. 2004
P. García, J. de Mingo, A. Valdovinos, A. Ortega
Adaptive Imbalances Correction in LINC Transmitters
The 5th European Wireless Conference: Mobile and Wireless Systems beyond 3G, Barcelona (Spain) Procedings, Pp. 235-239. February 2004
O. Saz, L. Buera, E. Lleida, A. Miguel, A. Ortega
Algoritmos de Compensacion de Caracteristicas Cepstrales para Reconocimiento Automatico del Habla Robusto
III Jornadas en Tecnologias del Habla, Valencia, España. November 2004
O. Saz, L. Buera , E. Lleida, A. Miguel , A. Ortega
Algoritmos de Compensación de Características Cepstrales para Reconocimiento Automático del Habla Robusto
III Jornadas en Tecnología del Habla. November 2004
A. Ortega, F. Sukno, E. Lleida, A. Frangi, A. Miguel, L. Buera
AV@CAR: A Spanish Multichannel Multimodal Corpus for In-Vehicle Automatic Audio-Visual Speech Recognition
4th International Conferencece on Language Resources and Evaluation, Lisboa, Portugal. May 2004
L. Buera , E. Lleida, A. Miguel , A. Ortega, O. Saz
Avances en la Normalizacion Cepstral con Señal Estereo para el Reconocimiento Robusto de Voz en el Entorno del Vehiculo
III Jornadas de Tecnologías del Habla, Valencia, España. November 2004
L. Buera , E. Lleida , A. Ortega , A. Miguel , O. Saz
Avances en la Normalización Cepstral con Señal Estéreo para el Reconocimiento Robusto de Voz en el Entorno del Vehículo
III Jornadas en Tecnología del Habla,. November 2004
A. Ortega , F. Sukno, E. Lleida , A. Frangi , A. Miguel , L. Buera , E. Zacur
Base de Datos Audiovisual y Multicanal en Castellano para Reconocimiento Automático del Habla Multimodal en el Automóvil
III Jornadas en Tecnología del Habla. November 2004
A. Miguel , Richard , E. Lleida , L. Buera , A. Ortega , O. Saz
Decodificador Eficiente para Normalización del Tracto Vocal en Reconocimiento Automático del Habla en Tiempo Real
III Jornadas en Tecnología del Habla. November 2004
L. Buera, E. Lleida, A. Miguel, A. Ortega
Multi-environment models based linear normalization for robust speech recognition
Proceedings of the International Conference "Speech and Computer", SPECOM-2004, St. Petersburg, Russia. September 2004
L. Buera, E. Lleida, A. Miguel, A. Ortega
Multi-Environments Model Based Linear Normalization for speech recognition in Car Conditions
Proceedings of the International Conference on Audio, Speech and Signal Processing, ICASSP-2004, Montreal, Canada. May 2004
P. García, A. Ortega, J. de Mingo, A. Valdovinos
Nonlinear Distortion Cancellation in Ofdm Systems using an Adaptive Linc Structure
2004 15th International Symposium on Personal, Indoor and Mobile Radio Communications (pimrc 2004). September 2004
E. A. Viruete , C. Hernández , J. Ruiz , J. Fernández , A. Alesanco , E. Lleida , A. Ortega , A. Hernández , A. Valdovinos , J. García
Sistema de telemonitorización en vehículos de emergencias médicas sobre UMTS
Actas del XXII Congreso Anual de la Sociedad Española de Ingeniería Biomédica CASEIB 2004, Santiago de Compostela, pp. 111-114. November 2004

2003

Conferences

P. García , J. de Mingo , A. Valdovinos , A. Ortega
Método adaptativo para el equilibrio de las ramas de un transmisor LINC
XVIII Simposiun Nacional de U.R.S.I. (Union of Radio Science International).Libro de Actas: CD-ROM, ISBN-84-9749-081-9. September 2003
A. Ortega , E. Lleida , E. Masgrau
Residual Echo Power Estimation for Speech Reinforcement Systems in Vehicles
Proceedings de EUROSPEECH’03. Ginebra (Suiza). September 2003

2002

Conferences

A. Ortega , E. Lleida , E. Masgrau , F. Gallego
Cabin car communication system to improve communication inside a car
Proccedings of IEEE Int. Conf. on Acoustics, Speech and Signal Processing ICASSP'02. Orlando (USA), vol. 4, pp. 386-389. May 2002
A. Ortega , E. Lleida , E. Masgrau
DSP to improve oral communications inside vehicles
Proceedings of European Signal Processing Conference EUSIPCO'02. Toulouse (France). September 2002
E. Lleida , E. Masgrau , A. Ortega , A. Miguel , L. Buera
Reconocimiento Automático del Habla en vehículos, resultados con Speech-Dat Car
Speech-Dat Car database results. December 2002
E. Lleida , E. Masgrau , A. Ortega , A. Miguel
Reconocimiento Automático del Habla en Vehículos, Resultados con Speech-Dat Car
Libro de Actas Jornadas en Tecnologías del Habla. Granada 2002
A. Ortega , E. Lleida , E. Masgrau
Speech reinforce inside vehicles
Proceedings of the 21st International Conference of the Audio Engineering Society. AES 2002. St. Petersburg (Russia). pp. 91-99. June 2002

2001

Conferences

E. Lleida , E. J. Masgrau , A. Ortega
Acoustic Echo Control and Noise Reduction for Cabin Car Communication
Proccedings of European Conference on Speech Communication and Technology EUROSPEECH'01. Aalborg (Denmark), vol. 3, pp 1585-1588. September 2001
A. Ortega , E. Lleida , E. Masgrau
Sistema de Comunicación oral para el interior de automóviles
Libro de Actas Simposium Nacional de la Unión Internacional de Radio URSI 01. Madrid. 541-542. September 2001

2000

Conferences

A. Ortega, E. J. Masgrau , E. Lleida
Control activo de ruido con ecualización del camino secundario
Libro de Actas Simposium Nacional de la Unión Internacional de Radio URSI 00. Zaragoza. 55-56. September 2000