Study and define a profile based on user pathologies and case scenarios.
UNIZAR, M1 to M18
WP1 will build the user model through the
definition of a user profile including information about the person's cognitive level, sensorial and physical
disabilities. The user profile will be filled initially during the user registration, through a set of simple tests
in order to gather information about the three axes (cognitive, sensorial and physical). As the user's
limitations have a great impact on the interaction, the user profile must be adapted continuously to the
user's impairments and attempt to define how the platform should react to the user inputs and how to
display the output messages in the most comprehensible way for the user.
Multimodal interaction based on speech, silent speech, gesture and tactile/haptic devices.
MSFT, M3 to M28
This WP will provide a set of natural interfaces to be used in a multimodal way.
MSFT, leader of this WP, will develop a multimodal silent speech interface in collaboration with
UAVR based on Visual Speech Recognition. MSFT will also explore the use of gesture as an
alternative modality for Portuguese. METU will provide mentoring support for denoising and source
localization for higher quality speech recordings to enhance the performance of speech recognition and speaker identification. METU will also explore the integration of tactile/haptic devices as alternative or parallel interfaces. UAVR, based on its wide experience on the subject will provide valuable information about the Human Speech production model using technologies such as Real-time Magnetic Resonance Imaging, encompassing data acquisition and data processing and analysis, paving the way for the development of robust speech and silent speech interfaces by providing data for articulatory speech characterization and for validation and improved understanding of different data acquisition modalities such as Electromyography (CITE). UNIZAR will provide acoustic confidence measures for acoustic model adaptation for users with speech impairments.
System output based on animated characters, pictograms and Personalized Synthetic Voices.
FIM, M3 to M28
This WP will provide a Visual Speech Synthesis platform. FIM will be responsible for an
automatic method of facial animation playback of human and animated characters based in speech
recognition and for the integration of this system into the IRIS platform. In collaboration with UAVR and
MSFT they will improve their system with a coarticulation model and personalized synthetic voices. MSFT
will develop a platform for creating personalized synthetic voices (a synthetic voice based on the user voice
font). This way the user will have the possibility of using a familiar (or even his own) voice to use the
system. MSFT will also explore the creation of personalized synthetic voices for users in risk of losing their
ability to produce audible speech in the future. MSFT participation will be complemented by the
development of a coarticulation model for the Visual Speech Synthesis system built by FIM. This will be
performed in collaboration with UAVR and will profit from existing experience concerning the articulatory
characterization of European Portuguese. METU will explore the application of Spatial Audio Object Coding
(SAOC) to the synchronization of audio-visual objects, in particular between the animated characters
(developed by FIM) and speech. METU will provide mentoring on GPU units. UNIZAR will be responsible for
introducing the pictograms as system output for those users requiring this communication channel.
Experiments with Pictograms:
Fusion of modalities and Interaction Management.
UAVR, M12 to M32
The work carried out
includes careful analysis of what modalities to fuse, when and how, in order to provide adequate response
to users' goals and context, striving for additional robustness in situations, such as noisy environments, or
where privacy issues and existing disabilities might hinder single modality interaction. The envisaged fusion
engine should not provide a generic/versatile approach to the fusion problem but a very much focused
approach based on the modalities used and defined application scenarios aiming for simplicity.
Authentication and Authorization based on biometrics.
UNIZAR, M6 to M28
In the recent years a great advance has been achieved in the field of personal identification based on voice
and multimodal biometrics. Nowadays, the use of natural ways of interacting with electronic devices must
also include useful and easy methods to verify the identity of each one of the allowed users of this kind of
platforms in order to provide the required access security to personal information and privacy.
The users profiles
established according to the tasks carried out in WP1 will be used to present to the user the most
appropriate authentication method given his/her special needs (speech impairments, age, etc.). In addition
to this, the enrolment phase of the biometric authentication method must be adapted for each user profile
to make it simple, avoiding stress and fatigue. The automatic detection of speech and physical impairments
systems developed WP1 will allow to develop tailored authentication methods for each
impairment.
Development of an ambient assisted living platform for indoor scenarios.
MSFT, M12 to M36
In this work package the goal is to provide a stable platform that integrates and represents the work
performed in the previous WPs. The platform will be deployed in at least two partners. This WP will not
only be about integration but will also aim at transferring the knowledge acquired so far among the
partners.
Framework for objective evaluation including biological and psychophysical measures.
METU, M24 to M40
The development of a framework for an objective evaluation is the backbone of the research-based,
systematic approach to the evaluation of interfaces within the framework of Human Computer Interaction
and cognitive science research. The platform that was developed in the previous work packages will
provide the input for this work package.
First, in a series of
subjective experiments in the lab environment, it will explore the development of a set of appropriate noninvasive
biological and psychophysical measures, including eye tracking measures that aim to measure the
attention of and the cognitive load on the user during the course of interaction with the interface, and the
fNIR (functional near-infrared) spectroscopy for the purpose of functional neuroimaging, aiming at
measuring the brain activity, in particular cognitive load on the user through hemodynamic responses
associated with neural behavior. Interaction analysis will be used as the complementary methodology for
the development of the measures. Second, based on the user profile definition developed in WP1, we
will explore the development of Quality of Experience (QoE) models using the objective metrics in order to
predict QoE in the future without further subjective experiments, and the development of the methods for
creating the mappings between user profiles, preferences and QoE and carrying out network analysis to
determine subgroups of users according to the similarities of their QoEs.
Usability evaluation in virtual and real world indoor scenarios.
UAVR, M6 to 46
This WP must, first of all, result in a set of
recommendations (guidelines) extracted from the relevant literature concerning usability in multimodal
interfaces design which the remaining partners can use as a reference when conducting their design and
development tasks. The evaluation scenarios and features to evaluate should then be identified and the
evaluation protocols defined, tested and applied. Considering the unpredictable nature of some of the
solutions proposed on the other WPs, evaluation scenarios and protocols might need readjustments along
the way. The framework that will be provided after WP7 will allow gathering objective data to characterize
user attention, reactions and cognitive load and will support its integration in the usability studies.
HCI experts participating in the project will prepare satisfaction questionnaires and conduct the reviews
based on performance analyses, in particular, efficiency and effectivity that will be measured through
reaction times, gesture analysis where applicable, and eye tracking. The eye tracking data will be collected
by a non-intrusive eye tracker. Where applicable, the brain imaging data will be collected by fNIRS optical
imaging facility, which is also a non-intrusive data collection equipment. The experimental investigations
will aim at developing a Quality of Experience (QoE) model based on objective measures.
All feedback will be collected through questionnaires and non-invasive technologies such
as eye-tracking developed and described in WP7.
The participants profile and their level of impairment is
going to be defined in WP1 and it is foreseen to have a psychologist reviewing the experimental protocols
and monitoring these studies. Regular pauses and short duration sessions, for example, will be considered
whenever deemed necessary, so that no mental/physical stress is caused on participants. This is also
valuable, in the methodological point of view, since a relaxed and stress free participant is a very important
condition to attain better evaluation results. The conducted studies will also be previously approved by
ethical committees of each country and comply with national and EU legislation, and FP7 rules.
Dissemination and exploitation.
MSFT, M1 to 48
The strategy for the dissemination will be based on five main areas:
-
Publications in prestigious journals and international conferences, in the fields of Human
Computer Interaction, Speech, Bio-signals, and Signal Processing. MSFT plans to submit at least one paper to the "Futures, Microsoft Innovation Magazine", widely distributed in Europe, providing an overview of the achieved innovation and breakthroughs, as well as reporting the results of exchange of scientists and engineers across the project partners.
- Public Media releases: semestral newsletter and project dissemination in the public media (TV, newspapers and magazines).
- Project website to keep track of the progress and with a public section for helping disseminating the results and to report general progress.
- Organization of workshops and courses, to share knowledge and experiences with all members of the consortium, as well as external researchers, in order to ensure the adequate dissemination of IRIS project. Some of them will be restricted to the consortium and other ones open to anyone interested.
- Seminars on complementary skills, in order to guarantee a transfer of culture which will improve the personal skills of IRIS staff.
Country |
Activity |
Description |
Audience |
Indented Date |
Portugal |
UAVR Journal and Newsletter |
Dissemination of relevant content from IRIS in the UAVR online journal and newsletter. Dissemination also performed to the mass media (e.g., Lusa, the Portuguese news agency) |
Academia and general public |
Whenever relevant |
University Open Day |
UAVR opens doors to students across the country and organizes activities to show its research outputs including the different outcomes from IRIS. |
General public (strong incidence in students of all ages) |
Every year after the first |
IRIS Demonstration Event |
Allow the general public to experience the different applications proposed by IRIS and collect informal feedback. |
General public (with some focus on elderly) |
Close to the end of IRIS |
E-newsletter |
A semestral e-newsletter will be published online with the news and advances achieved during that time. |
General public |
Every 6 months |
MSFT Open Day |
MSFT opens door to students across the country to their premises. In these tours a demonstration of the multiple prototypes developed in IRIS will be included. |
National students of all ages |
Every year after the second |
Turkey |
Workshop Day |
Workshop for general public about the outcomes and possible implications of the project. |
Broadcasted through METU Webinar system and made available for all other universities |
Close to the end of IRIS |
Press release |
Dissemination of IRIS to the mass media via the METU Press Coordinator Office |
General public |
Whenever relevant |
METU Open Day |
In the open day, METU introduces research projects to national students of all ages. A demonstration of an IRIS prototype will be included into this activity. |
National students of all ages |
Close to the end of IRIS |
CEBIT Bilisim Eurasia Exposition, Istanbul |
A leading Eurasian IT, Technology and Communication Platform that brings ICT companies, government and media in the Eurasian region together in an exposition. |
General public |
Close to the end of IRIS |
Spain |
Engineering and Architecture week |
Spread to the society, especially to primary and secondary school students, but also to the general public (university or not) research and teaching activities.
Includes workshops, contests, 3D video projections, lectures, demonstrations, exhibitions, etc. |
General public (Kids to elder people and all the members of the university community) |
Every year after the first |
University open day and welcome day |
General dissemination mainly focus on engineering students and future students. |
General public with focus on students |
Every year after the first |
Public talks, public demonstrations and articles in newspapers |
Allow a general dissemination of IRIS to general public through radio and TV talks, demonstrations in trade fairs and collaboration in newspapers. |
General public |
>Whenever relevant |
Project Management.
MSFT, M1 to 48
This WP will ensure that all planned objectives are successfully met during the lifetime of the project. It includes: inception, specification and planning; project quality control; financial administration; coordination of partners.