ABSTRACT
The number of speech interfaces and services made available through them continue to grow. This has opened up interactions to people who rely on speech as a critical modality for interacting with systems. However, people with diverse speech patterns such as those who stammer are at risk of being negatively affected or excluded from speech interface interaction. In this paper, we consider what an inclusive speech interface future may look like for people who stammer. In doing so, we identify three key challenges: (1) developing effective speech recognition, (2) understanding the user experiences of people who stammer and (3) supporting speech interfaces designers through appropriate heuristics. We believe the interdisciplinary and cross-community strengths of venues like CUI are well positioned to address these challenges going forward.
- Ali Abdolrahmani, Ravi Kuber, and Stacy M Branham. 2018. "Siri Talks at You" An Empirical Investigation of Voice-Activated Personal Assistant (VAPA) Usage by Individuals Who Are Blind. In Proceedings of the 20th International ACM SIGACCESS Conference on Computers and Accessibility. 249--258.Google ScholarDigital Library
- Amazon. 2020. What Do the Lights on Your Echo Device Mean? https://www.amazon.com/gp/help/customer/display.html?nodeId=GKLDRFT7FP4FZE56 Accessed 23rd Feb 2020.Google Scholar
- Mohamed Benzeghiba, Renato De Mori, Olivier Deroo, Stephane Dupont, Teodora Erbes, Denis Jouvet, Luciano Fissore, Pietro Laface, Alfred Mertins, Christophe Ris, et al. 2007. Automatic speech recognition and speech variability: A review. Speech communication 49, 10-11 (2007), 763--786.Google Scholar
- Robin N. Brewer, Leah Findlater, Joseph "Jofish" Kaye, Walter Lasecki, Cosmin Munteanu, and Astrid Weber. 2018. Accessible Voice Interfaces. In Companion of the 2018 ACM Conference on Computer Supported Cooperative Work and Social Computing (CSCW '18). Association for Computing Machinery, New York, NY, USA, 441--446. https://doi.org/10.1145/3272973.3273006Google ScholarDigital Library
- Leigh Clark, Philip Doyle, Diego Garaialde, Emer Gilmartin, Stephan Schlögl, Jens Edlund, Matthew Aylett, João Cabral, Cosmin Munteanu, Justin Edwards, et al. 2019. The State of Speech in HCI: Trends, Themes and Challenges. Interacting with Computers 31, 4 (2019), 349--371.Google ScholarCross Ref
- Leigh Clark, Nadia Pantidi, Orla Cooney, Philip Doyle, Diego Garaialde, Justin Edwards, Brendan Spillane, Emer Gilmartin, Christine Murad, Cosmin Munteanu, et al. 2019. What makes a good conversation? challenges in designing truly conversational agents. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. 1--12.Google ScholarDigital Library
- Eric Corbett and Astrid Weber. 2016. What can I say? addressing user experience challenges of a mobile voice user interface for accessibility. In Proceedings of the 18th International Conference on Human-Computer Interaction with Mobile Devices and Services. 72--82.Google ScholarDigital Library
- Benjamin R Cowan, Nadia Pantidi, David Coyle, Kellie Morrissey, Peter Clarke, Sara Al-Shehri, David Earley, and Natasha Bandeira. 2017. "What Can I Help You With?": Infrequent Users' Experiences of Intelligent Personal Assistants. Proceedings of the 19th International Conference on Human-Computer Interaction with Mobile Devices and Services - MobileHCI '17 (2017), 1--12. https://doi.org/10.1145/3098279.3098539Google ScholarDigital Library
- Isobel Crichton-Smith. 2002. Communicating in the real world: Accounts from people who stammer. Journal of fluency disorders 27, 4 (2002), 333--352.Google ScholarCross Ref
- Joel E Fischer, Stuart Reeves, Martin Porcheron, and Rein Ove Sikveland. 2019. Progressivity for voice interface design. In Proceedings of the 1st International Conference on Conversational User Interfaces. 1--8.Google ScholarDigital Library
- Rosemarie Hayhow, Anne Marie Cray, and Pam Enderby. 2002. Stammering and therapy views of people who stammer. Journal of fluency disorders 27, 1 (2002), 1--17.Google ScholarCross Ref
- Hyunhoon Jung, Hee Jae Kim, Seongeun So, Jinjoong Kim, and Changhoon Oh. 2019. TurtleTalk: an educational programming game for children with voice user interface. In Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems. 1--6.Google ScholarDigital Library
- Andrew L Kun et al. 2018. Human-machine interaction for vehicles: Review and outlook. Foundations and Trends® in Human-Computer Interaction 11, 4 (2018), 201--293.Google Scholar
- Liliana Laranjo, Adam G Dunn, Huong Ly Tong, Ahmet Baki Kocaballi, Jessica Chen, Rabia Bashir, Didi Surian, Blanca Gallego, Farah Magrabi, Annie YS Lau, et al. 2018. Conversational agents in healthcare: a systematic review. Journal of the American Medical Informatics Association 25, 9 (2018), 1248--1258.Google ScholarCross Ref
- Qi Li, Jinsong Zheng, Augustine Tsai, and Qiru Zhou. 2002. Robust endpoint detection and energy normalization for real-time speech and speaker recognition. IEEE Transactions on Speech and Audio Processing 10, 3 (2002), 146--157.Google ScholarCross Ref
- Ewa Luger and Abigail Sellen. 2016. "Like Having a Really Bad PA": The Gulf between User Expectation and Experience of Conversational Agents. Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems - CHI '16 (2016), 5286--5297. https://doi.org/10.1145/2858036.2858288Google ScholarDigital Library
- Roisin McNaney, Christopher Bull, Lynne Mackie, Floriane Dahman, Helen Stringer, Dan Richardson, and Daniel Welsh. 2018. StammerApp: Designing a Mobile Application to Support Self-Reflection and Goal Setting for People Who Stammer. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. 1--12.Google ScholarDigital Library
- Meredith Moore, Hemanth Venkateswara, and Sethuraman Panchanathan. 2018. Whistle-blowing ASRs: Evaluating the need for more inclusive automatic speech recognition systems. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2018. 466--470.Google ScholarCross Ref
- Roger K Moore. 2017. Is spoken language all-or-nothing? Implications for future speech-based human-machine interaction. In Dialogues with Social Robots. Springer, 281--291.Google Scholar
- Christine Murad, Cosmin Munteanu, Benjamin R Cowan, and Leigh Clark. 2019. Revolution or Evolution? Speech Interaction and HCI Design Guidelines. IEEE Pervasive Computing 18, 2 (2019), 33--45.Google ScholarDigital Library
- Christi Olson and Kelli Kemery. 2019. Voice report: From answers to action: customer adoption of voice technology and digital assistants. Microsoft Search and Market Intelligence, Tech. Rep (2019).Google Scholar
- World Health Organization. 2010. ICD-10 Version:2010. http://apps.who.int/classifications/icd10/browse/2010/en#/F98.5 Accessed 23rd Feb 2020.Google Scholar
- Rebecca Palmer, Pam Enderby, and Mark Hawley. 2007. Addressing the needs of speakers with longstanding dysarthria: computerized and traditional therapy compared. International journal of language & communication disorders 42, S1 (2007), 61--79.Google ScholarCross Ref
- Martin Porcheron, Joel E Fischer, Stuart Reeves, and Sarah Sharples. 2018. Voice interfaces in everyday life. In proceedings of the 2018 CHI conference on human factors in computing systems. 1--12.Google ScholarDigital Library
- Alisha Pradhan, Kanika Mehta, and Leah Findlater. 2018. "Accessibility Came by Accident" Use of Voice-Controlled Intelligent Personal Assistants by People with Disabilities. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. 1--13.Google ScholarDigital Library
- Juniper Research. 2018. Digital Voice Assistants in Use to Triple to 8 Billion by 2023, Driven by Smart Home Devices. shorturl.at/dgoGL. Accessed 22nd Feb 2020.Google Scholar
- Sergio Sayago, Barbara Barbosa Neves, and Benjamin R Cowan. 2019. Voice assistants and older people: some open issues. In Proceedings of the 1st International Conference on Conversational User Interfaces. 1--3.Google ScholarDigital Library
- Stamma. 2020. Stammer in the Population. https://stamma.org/news-features/stammering-population Accessed 23rd Feb 2020.Google Scholar
- Stamma. 2020. Talking With Someone Who Stammers. https://stamma.org/about-stammering/talking-someone-who-stammers Accessed 23rd Feb 2020.Google Scholar
Index Terms
- Speech diversity and speech interfaces: considering an inclusive future through stammering
Recommendations
Exploring Smart Speaker User Experience for People Who Stammer
ASSETS '22: Proceedings of the 24th International ACM SIGACCESS Conference on Computers and AccessibilitySpeech-enabled smart speakers are common devices used for numerous tasks in everyday life. While speech-enabled technologies are widespread, using one’s voice as a computing modality introduces new accessibility challenges for people with speech ...
The effect of single syllable silent reading and pantomime speech in varied syllable positions on stuttering frequency throughout utterance productions
Strategies used on single syllables increases fluency throughout an utterance.Speech rehearsal on initial, middle, and final syllables decreases stuttering.Strategies increasing neuromotor planning demonstrate increased effectiveness.Strategies used ...
Silent speech interfaces
The possibility of speech processing in the absence of an intelligible acoustic signal has given rise to the idea of a 'silent speech' interface, to be used as an aid for the speech-handicapped, or as part of a communications system operating in silence-...
Comments