Interacting with Global Content

Interacting with Global Content

This research activity allows people to interact with digital content in ways that are more intuitive and that mimic the richness of human perception in all interaction. This research goes beyond text and speech-based exchange of content, to full multimodal interfaces that interpret information from a multitude of audio and visual cues. By furthering both the understanding and automatic analysis of human interaction with digital content and other humans, we are transforming the retrieval, understanding, and delivery of multimodal content for users.

In driving a more complete understanding of multimodal interaction between humans and for humans with digital content, we build automatic systems that track human engagement and affective response, and judge how best to retrieve and render responsive content for the user.

Research team

Publications

Interacting with Global Content

A randomized controlled trial of an internet-delivered treatment: Its potential as a low-intensity community intervention for adults with symptoms of depression

  • Posted: 31 Mar 2016
  • Author: , D. Richards, L. Timulak, N. Vigano, E. O'Brien, G. Doherty, J. Sharry, C. Hayes
Interacting with Global Content

Quantifying difference in vocalizations of bird populations

  • Posted: 9 Jun 2015
  • Author: Naomi Harte, Colm O'Reilly, Nicola M. Marples, David J. Kelly
  • Publication: INTERSPEECH 2015
Conference

Speech Rate Calculations with Short Utterances: A Study from a Speech-to-Speech, Machine Translation Mediated Map Task

  • Posted: 7 May 2018
  • Author: Carl Vogel, Saturnino Luz, Akira Hayakawa
  • Publication: LREC 2018 - 11th International Conference on Language Resources and Evaluation
Interacting with Global Content

Utilisation of Metadata Fields and Query Expansion in Cross-Lingual Search of User-Generated Internet Video

  • Posted: 27 Jan 2016
  • Author: , G.J.F.Jones
  • Publication: Journal of Artificial Intelligence Research

Research Goals

New methods are being developed to process both speech-only and audio-visual data, and train statistical engines to infer attentional state. The ability to track user engagement and interest in conversational interaction is key to reproducing natural interactions in the future, be that with a robot, personal assistant or avatar. The research explores what makes an avatar, or computer generated speaker, engaging to a user. The research uniquely combines ADAPT expertise on expressive synthesis, the role of paralinguistic cues in speech, and avatar animation.

Also addressed are issues of multimodal content relevant to interaction from two perspectives: The first addresses challenges of locating and isolating objects of interest in a visual stream and exploiting visual cues in speech to augment speech recognition capabilities. Learning techniques from unstructured multimodal data streams are also examined. The second is addressed by establishing methods for the exploitation of dialogue in user interaction in information retrieval, and to the exploitation of context to enable proactive information retrieval.

Twitter
4 pm
@Adaptcentre
In this CPD workshop for JCT STE(A)M, the ADAPT Centre for Digital Content Technology will use their ‘Think-In’ for… twitter.com/i/web/status/1…

Newsletter

Sign up to our newsletter for all the latest updates on ADAPT news, events and programmes.
Archived newsletters can be viewed here.