skip to main content
10.1145/3512729.3533009acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
research-article
Open Access

Voxento 3.0: A Prototype Voice-Controlled Interactive Search Engine for Lifelog

Published:27 June 2022Publication History

ABSTRACT

Voxento is an interactive voice-based retrieval system for lifelogs which has been redeveloped and optimised to participate in the fifth Lifelog Search Challenge LSC'22, at ACM ICMR'22. Based on the previous experience in the LSC competition and ranked in the top 4 in the last LSC'21 competition among 17 participants, we present a revised version of Voxento to address the critical points to improve the efficiency of retrieval tasks in lifelog datasets. Basically, Voxento provides a spoken interface to the lifelog data, which facilitates an expert and novice user to interact with a personal lifelog using a range of vocal commands and interactions. Briefly, we made some important improvements to support both the retrieval of content and system interaction. This latest version has been enhanced with the addition of a text-based search feature, new filters based on new metadata provided in lifelog data, rich visual information and features and enhanced speech query. Also, the data preparation tasks comprised a new function to reduce the number of non-relevant images and the latest CLIP model version used to derive features from images. The long term development of Voxento includes a lifelog retrieval that supports speech and conversation interaction with less physical actions required by users such as using a mouse. The system presented here uses a desktop computer in order to participate in the LSC'22 competition with the option to use voice interaction or standard text-based retrieval.

References

  1. Naushad Alam, Yvette Graham, and Cathal Gurrin. 2021. Memento: A Prototype Lifelog Search Engine for LSC'21. In LSC 2021 - Proceedings of the 4th Annual Lifelog Search Challenge (Taipei, Taiwan). Association for Computing Machinery, New York, NY, USA, 53--58. https://doi.org/10.1145/3463948.3469069Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Ahmed Alateeq, Mark Roantree, and Cathal Gurrin. 2020. Voxento: A Prototype Voice-controlled Interactive Search Engine for Lifelogs. In Proceedings of the Third Annual Workshop on the Lifelog Search Challenge (LSC'20) (Dublin, Ireland). Association for Computing Machinery, New York, NY, USA, 77--81. https://doi.org/10.1145/3379172.3391728Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Ahmed Alateeq, Mark Roantree, and Cathal Gurrin. 2021. Voxento 2.0: A Prototype Voice-controlled Interactive Search Engine for Lifelogs. In LSC 2021 - Proceedings of the 4th Annual Lifelog Search Challenge (Taipei, Taiwan). Association for Computing Machinery, New York, NY, USA, 65--70. https://doi.org/10.1145/3463948.3469071Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Aaron Duane and Bjorn THORNór Jónsson. 2021. ViRMA: Virtual Reality Multimedia Analytics at LSC 2021. In LSC 2021 - Proceedings of the 4th Annual Lifelog Search Challenge (Taipei, Taiwan). Association for Computing Machinery, New York, NY, USA, 29--34. https://doi.org/10.1145/3463948.3469067Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Jim Gemmell, Gordon Bell, and Roger Lueder. 2006. MyLifeBits: A personal database for everything. Commun. ACM 49, 1 (2006), 88--95. https://doi.org/10.1145/1107458.1107460Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Cathal Gurrin, Alan F. Smeaton, and Aiden R. Doherty. 2014. LifeLogging: Personal big data. Vol. 8. Now Publishers. 1--125 pages. https://doi.org/10.1561/1500000033Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Cathal Gurrin, Liting Zhou, Graham Healy, Björn Por Þór Jónsson, Duc-Tien Dang-Nguyen, Jakub Lokoc, Minh-Triet Tran, Wolfgang Hürst, Luca Rossetto, and Klaus Schöffmann. 2022. Introduction to the Fifth Annual Lifelog Search Challenge, LSC'22. In Proc. International Conference on Multimedia Retrieval (ICMR'22). Association for Computing Machinery, New York, NY, USA.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Omar Shahbaz Khan, Aaron Duane, Björn THORNór Jónsson, Jan Zahálka, Stevan Rudinac, and Marcel Worring. 2021. Exquisitor at the Lifelog Search Challenge 2021: Relationships between Semantic Classifiers. In LSC 2021 - Proceedings of the 4th Annual Lifelog Search Challenge (Taipei, Taiwan). Association for Computing Machinery, New York, NY, USA, 3--6. https://doi.org/10.1145/3463948.3469255Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Emil Knudsen, Thomas Holstein Qvortrup, Omar Shahbaz Khan, and Björn THORNór Jónsson. 2021. XQC at the Lifelog Search Challenge 2021: Interactive Learning on a Mobile Device. In LSC 2021 - Proceedings of the 4th Annual Lifelog Search Challenge (Taipei, Taiwan). Association for Computing Machinery, New York, NY, USA, 89--93. https://doi.org/10.1145/3463948.3469063Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Andreas Leibetseder and Klaus Schoeffmann. 2021. LifeXplore at the Lifelog Search Challenge 2021. In LSC 2021 - Proceedings of the 4th Annual Lifelog Search Challenge (Taipei, Taiwan). Association for Computing Machinery, New York, NY, USA, 23--28. https://doi.org/10.1145/3463948.3469060Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Jakub Lokoc, Frantiek Mejzlik, Patrik Veselý, and Tomá Soucek. 2021. Enhanced SOMHunter for Known-item Search in Lifelog Data. In LSC 2021 - Proceedings of the 4th Annual Lifelog Search Challenge (Taipei, Taiwan). Association for Computing Machinery, New York, NY, USA, 71--73. https://doi.org/10.1145/3463948.3469074Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Thao Nhu Nguyen, Tu Khiem Le, Van Tu Ninh, Minh Triet Tran, Nguyen Thanh Binh, Graham Healy, Annalina Caputo, and Cathal Gurrin. 2021. LifeSeeker 3.0: An Interactive Lifelog Search Engine for LSC'21. In LSC 2021 - Proceedings of the 4th Annual Lifelog Search Challenge (Taipei, Taiwan). Association for Computing Machinery, New York, NY, USA, 41--46. https://doi.org/10.1145/3463948.3469065Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, and Ilya Sutskever. 2021. Learning Transferable Visual Models From Natural Language Supervision. arXiv:2103.00020 http://arxiv.org/abs/2103.00020Google ScholarGoogle Scholar
  14. Jihye Shin, Alexandra Waldau, Aaron Duane, and Björn THORNór Jónsson. 2021. PhotoCube at the Lifelog Search Challenge 2021. In LSC 2021 - Proceedings of the 4th Annual Lifelog Search Challenge (Taipei, Taiwan). Association for Computing Machinery, New York, NY, USA, 59--63. https://doi.org/10.1145/3463948.3469073Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Florian Spiess, Ralph Gasser, Silvan Heller, Luca Rossetto, Loris Sauter, Milan Van Zanten, and Heiko Schuldt. 2021. Exploring Intuitive Lifelog Retrieval and Interaction Modes in Virtual Reality with vitrivr-VR. In LSC 2021 - Proceedings of the 4th Annual Lifelog Search Challenge (Taipei, Taiwan). Association for Computing Machinery, New York, NY, USA, 17--22. https://doi.org/10.1145/3463948.3469061Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Ly Duyen Tran, Manh Duy Nguyen, Nguyen Thanh Binh, Hyowon Lee, and Cathal Gurrin. 2021. Myscéal 2.0: A Revised Experimental Interactive Lifelog Retrieval System for LSC'21. In LSC 2021 - Proceedings of the 4th Annual Lifelog Search Challenge (Taipei, Taiwan). Association for Computing Machinery, New York, NY, USA, 11--16. https://doi.org/10.1145/3463948.3469064Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Voxento 3.0: A Prototype Voice-Controlled Interactive Search Engine for Lifelog

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader