Skip to main navigation Skip to search Skip to main content

Rambler: Supporting Writing With Speech via LLM-Assisted Gist Manipulation

Susan Lin, Jeremy Warner, J.D. Zamfirescu-Pereira, Matthew G. Lee, Sauhard Jain, Shanqing Cai, Piyawat Lertvittayakumjorn, Michael Xuelin Huang, Shumin Zhai, Björn Hartmann, Can Liu*

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

Dictation enables efficient text input on mobile devices. However, writing with speech can produce disfluent, wordy, and incoherent text and thus requires heavy post-processing. This paper presents Rambler, an LLM-powered graphical user interface that supports gist-level manipulation of dictated text with two main sets of functions: gist extraction and macro revision. Gist extraction generates keywords and summaries as anchors to support the review and interaction with spoken text. LLM-assisted macro revisions allow users to respeak, split, merge, and transform dictated text without specifying precise editing locations. Together they pave the way for interactive dictation and revision that help close gaps between spontaneously spoken words and well-structured writing. In a comparative study with 12 participants performing verbal composition tasks, Rambler outperformed the baseline of a speech-to-text editor + ChatGPT, as it better facilitates iterative revisions with enhanced user control over the content while supporting surprisingly diverse user strategies. © 2024 Copyright held by the owner/author(s).
Original languageEnglish
Title of host publicationCHI’24
Subtitle of host publicationProceedings of the 2024 CHI Conference on Human Factors in Computing Systems
Place of PublicationNew York, NY
PublisherAssociation for Computing Machinery
ISBN (Print)9798400703300
DOIs
Publication statusPublished - 2024
Event2024 ACM CHI Conference on Human Factors in Computing Systems (CHI 2024): Surfing the World - Hybrid, Honolulu, United States
Duration: 11 May 202416 May 2024
https://chi2024.acm.org/

Conference

Conference2024 ACM CHI Conference on Human Factors in Computing Systems (CHI 2024)
Abbreviated titleCHI '24
PlaceUnited States
CityHonolulu
Period11/05/2416/05/24
Internet address

Funding

This project was funded in part by the Berkeley Artificial Intelligence Research Lab - Open Research Commons, and by the National Natural Science Foundation of China - Young Scientists Fund (CityU 62202397). We thank the Android Input Research team at Google for discussions and feedback.

Research Keywords

  • AI
  • LLM
  • STT
  • dictation
  • speech
  • speech-to-text
  • text composition
  • writing

Fingerprint

Dive into the research topics of 'Rambler: Supporting Writing With Speech via LLM-Assisted Gist Manipulation'. Together they form a unique fingerprint.

Cite this