skip to main content
10.1145/3405755.3406139acmotherconferencesArticle/Chapter ViewAbstractPublication PagescuiConference Proceedingsconference-collections
short-paper

Speech diversity and speech interfaces: considering an inclusive future through stammering

Published:22 July 2020Publication History

ABSTRACT

The number of speech interfaces and services made available through them continue to grow. This has opened up interactions to people who rely on speech as a critical modality for interacting with systems. However, people with diverse speech patterns such as those who stammer are at risk of being negatively affected or excluded from speech interface interaction. In this paper, we consider what an inclusive speech interface future may look like for people who stammer. In doing so, we identify three key challenges: (1) developing effective speech recognition, (2) understanding the user experiences of people who stammer and (3) supporting speech interfaces designers through appropriate heuristics. We believe the interdisciplinary and cross-community strengths of venues like CUI are well positioned to address these challenges going forward.

References

  1. Ali Abdolrahmani, Ravi Kuber, and Stacy M Branham. 2018. "Siri Talks at You" An Empirical Investigation of Voice-Activated Personal Assistant (VAPA) Usage by Individuals Who Are Blind. In Proceedings of the 20th International ACM SIGACCESS Conference on Computers and Accessibility. 249--258.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Amazon. 2020. What Do the Lights on Your Echo Device Mean? https://www.amazon.com/gp/help/customer/display.html?nodeId=GKLDRFT7FP4FZE56 Accessed 23rd Feb 2020.Google ScholarGoogle Scholar
  3. Mohamed Benzeghiba, Renato De Mori, Olivier Deroo, Stephane Dupont, Teodora Erbes, Denis Jouvet, Luciano Fissore, Pietro Laface, Alfred Mertins, Christophe Ris, et al. 2007. Automatic speech recognition and speech variability: A review. Speech communication 49, 10-11 (2007), 763--786.Google ScholarGoogle Scholar
  4. Robin N. Brewer, Leah Findlater, Joseph "Jofish" Kaye, Walter Lasecki, Cosmin Munteanu, and Astrid Weber. 2018. Accessible Voice Interfaces. In Companion of the 2018 ACM Conference on Computer Supported Cooperative Work and Social Computing (CSCW '18). Association for Computing Machinery, New York, NY, USA, 441--446. https://doi.org/10.1145/3272973.3273006Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Leigh Clark, Philip Doyle, Diego Garaialde, Emer Gilmartin, Stephan Schlögl, Jens Edlund, Matthew Aylett, João Cabral, Cosmin Munteanu, Justin Edwards, et al. 2019. The State of Speech in HCI: Trends, Themes and Challenges. Interacting with Computers 31, 4 (2019), 349--371.Google ScholarGoogle ScholarCross RefCross Ref
  6. Leigh Clark, Nadia Pantidi, Orla Cooney, Philip Doyle, Diego Garaialde, Justin Edwards, Brendan Spillane, Emer Gilmartin, Christine Murad, Cosmin Munteanu, et al. 2019. What makes a good conversation? challenges in designing truly conversational agents. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. 1--12.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Eric Corbett and Astrid Weber. 2016. What can I say? addressing user experience challenges of a mobile voice user interface for accessibility. In Proceedings of the 18th International Conference on Human-Computer Interaction with Mobile Devices and Services. 72--82.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Benjamin R Cowan, Nadia Pantidi, David Coyle, Kellie Morrissey, Peter Clarke, Sara Al-Shehri, David Earley, and Natasha Bandeira. 2017. "What Can I Help You With?": Infrequent Users' Experiences of Intelligent Personal Assistants. Proceedings of the 19th International Conference on Human-Computer Interaction with Mobile Devices and Services - MobileHCI '17 (2017), 1--12. https://doi.org/10.1145/3098279.3098539Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Isobel Crichton-Smith. 2002. Communicating in the real world: Accounts from people who stammer. Journal of fluency disorders 27, 4 (2002), 333--352.Google ScholarGoogle ScholarCross RefCross Ref
  10. Joel E Fischer, Stuart Reeves, Martin Porcheron, and Rein Ove Sikveland. 2019. Progressivity for voice interface design. In Proceedings of the 1st International Conference on Conversational User Interfaces. 1--8.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Rosemarie Hayhow, Anne Marie Cray, and Pam Enderby. 2002. Stammering and therapy views of people who stammer. Journal of fluency disorders 27, 1 (2002), 1--17.Google ScholarGoogle ScholarCross RefCross Ref
  12. Hyunhoon Jung, Hee Jae Kim, Seongeun So, Jinjoong Kim, and Changhoon Oh. 2019. TurtleTalk: an educational programming game for children with voice user interface. In Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems. 1--6.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Andrew L Kun et al. 2018. Human-machine interaction for vehicles: Review and outlook. Foundations and Trends® in Human-Computer Interaction 11, 4 (2018), 201--293.Google ScholarGoogle Scholar
  14. Liliana Laranjo, Adam G Dunn, Huong Ly Tong, Ahmet Baki Kocaballi, Jessica Chen, Rabia Bashir, Didi Surian, Blanca Gallego, Farah Magrabi, Annie YS Lau, et al. 2018. Conversational agents in healthcare: a systematic review. Journal of the American Medical Informatics Association 25, 9 (2018), 1248--1258.Google ScholarGoogle ScholarCross RefCross Ref
  15. Qi Li, Jinsong Zheng, Augustine Tsai, and Qiru Zhou. 2002. Robust endpoint detection and energy normalization for real-time speech and speaker recognition. IEEE Transactions on Speech and Audio Processing 10, 3 (2002), 146--157.Google ScholarGoogle ScholarCross RefCross Ref
  16. Ewa Luger and Abigail Sellen. 2016. "Like Having a Really Bad PA": The Gulf between User Expectation and Experience of Conversational Agents. Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems - CHI '16 (2016), 5286--5297. https://doi.org/10.1145/2858036.2858288Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Roisin McNaney, Christopher Bull, Lynne Mackie, Floriane Dahman, Helen Stringer, Dan Richardson, and Daniel Welsh. 2018. StammerApp: Designing a Mobile Application to Support Self-Reflection and Goal Setting for People Who Stammer. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. 1--12.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Meredith Moore, Hemanth Venkateswara, and Sethuraman Panchanathan. 2018. Whistle-blowing ASRs: Evaluating the need for more inclusive automatic speech recognition systems. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol. 2018. 466--470.Google ScholarGoogle ScholarCross RefCross Ref
  19. Roger K Moore. 2017. Is spoken language all-or-nothing? Implications for future speech-based human-machine interaction. In Dialogues with Social Robots. Springer, 281--291.Google ScholarGoogle Scholar
  20. Christine Murad, Cosmin Munteanu, Benjamin R Cowan, and Leigh Clark. 2019. Revolution or Evolution? Speech Interaction and HCI Design Guidelines. IEEE Pervasive Computing 18, 2 (2019), 33--45.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Christi Olson and Kelli Kemery. 2019. Voice report: From answers to action: customer adoption of voice technology and digital assistants. Microsoft Search and Market Intelligence, Tech. Rep (2019).Google ScholarGoogle Scholar
  22. World Health Organization. 2010. ICD-10 Version:2010. http://apps.who.int/classifications/icd10/browse/2010/en#/F98.5 Accessed 23rd Feb 2020.Google ScholarGoogle Scholar
  23. Rebecca Palmer, Pam Enderby, and Mark Hawley. 2007. Addressing the needs of speakers with longstanding dysarthria: computerized and traditional therapy compared. International journal of language & communication disorders 42, S1 (2007), 61--79.Google ScholarGoogle ScholarCross RefCross Ref
  24. Martin Porcheron, Joel E Fischer, Stuart Reeves, and Sarah Sharples. 2018. Voice interfaces in everyday life. In proceedings of the 2018 CHI conference on human factors in computing systems. 1--12.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Alisha Pradhan, Kanika Mehta, and Leah Findlater. 2018. "Accessibility Came by Accident" Use of Voice-Controlled Intelligent Personal Assistants by People with Disabilities. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. 1--13.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Juniper Research. 2018. Digital Voice Assistants in Use to Triple to 8 Billion by 2023, Driven by Smart Home Devices. shorturl.at/dgoGL. Accessed 22nd Feb 2020.Google ScholarGoogle Scholar
  27. Sergio Sayago, Barbara Barbosa Neves, and Benjamin R Cowan. 2019. Voice assistants and older people: some open issues. In Proceedings of the 1st International Conference on Conversational User Interfaces. 1--3.Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Stamma. 2020. Stammer in the Population. https://stamma.org/news-features/stammering-population Accessed 23rd Feb 2020.Google ScholarGoogle Scholar
  29. Stamma. 2020. Talking With Someone Who Stammers. https://stamma.org/about-stammering/talking-someone-who-stammers Accessed 23rd Feb 2020.Google ScholarGoogle Scholar

Index Terms

  1. Speech diversity and speech interfaces: considering an inclusive future through stammering

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        CUI '20: Proceedings of the 2nd Conference on Conversational User Interfaces
        July 2020
        271 pages
        ISBN:9781450375443
        DOI:10.1145/3405755

        Copyright © 2020 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 22 July 2020

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • short-paper
        • Research
        • Refereed limited

        Acceptance Rates

        CUI '20 Paper Acceptance Rate13of39submissions,33%Overall Acceptance Rate34of100submissions,34%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader