Designing with Gaze: Tama – A gaze-aware smart speaker platform

Research output: Contribution to journalJournal articleResearchpeer-review

  • Donald McMillan
  • Brown, Barry Alan
  • Ikkaku Kawaguchi
  • Razan Jaber
  • Jordi Solsona Belenguer
  • Hideaki Kuzuoka

Recent developments in gaze tracking present new opportunities for social computing. This paper presents a study of Tama, a gaze actuated smart speaker. Tama was designed taking advantage of research on gaze in conversation. Rather than being activated with a wake word (such as “Ok Google”) Tama detects the gaze of a user, moving an articulated ‘head’ to achieve mutual gaze. We tested Tama’s use in a multi-party conversation task, with users successfully activating and receiving a response to over 371 queries (over 10 trials). When Tama worked well, there was no significant difference in length of interaction. However, interactions with Tama had a higher rate of repeated queries, causing longer interactions overall. Video analysis lets us explain the problems users had interacting with gaze. In the discussion, we describe implications for designing new gaze systems, using gaze both as input and output. We also discuss how the relationship to anthropomorphic design and taking advantage of learned skills of interaction. Finally, two paths for future work are proposed, one in the field of speech agents, and the second in using human gaze as an interaction modality more widely.

Original languageEnglish
Article number176
JournalProceedings of the ACM on Human-Computer Interaction
Volume3
Issue numberCSCW
ISSN2573-0142
DOIs
Publication statusPublished - Nov 2019
Externally publishedYes

Bibliographical note

Funding Information:
This work was supported by JSPS KAKENHI grant number 18H06473, Oki Electric Industry Co., Ltd., Vetenskapsrådet grant 2016-03843, and the Swedish Foundation for Strategic Research project RIT15-0046.

Publisher Copyright:
© 2019 Copyright held by the owner/author(s). Publication rights licensed to ACM.

    Research areas

  • Gaze Detection, Gaze Interaction, Smart Speaker, Voice Assistant

ID: 318207560