Abstract
In this work, we propose a model-based framework named Vitas to handle VUI testing of VPA apps. Vitas interacts with the app VUI, and during the testing process, it retrieves semantic information from voice feedbacks by natural language processing. It incrementally constructs the finite state machine (FSM) model of the app with a weighted exploration strategy guided by key factors such as the coverage of app functionality. We conduct a large-scale testing on 41,581 VPA apps (i.e., skills) of Amazon Alexa, the most popular VPA service, and find that 51.29% of them have weaknesses. They largely suffer from problems such as unexpected exit/start, privacy violation and so on. Our work reveals the immaturity of the VUI designs and implementations in VPA apps, and sheds light on the improvement of several crucial aspects of VPA apps. © 2022 Association for Computing Machinery.
| Original language | English |
|---|---|
| Title of host publication | ASE '22 |
| Subtitle of host publication | Proceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering |
| Publisher | Association for Computing Machinery |
| Number of pages | 12 |
| ISBN (Print) | 978-1-4503-9475-8 |
| DOIs | |
| Publication status | Published - Oct 2022 |
| Externally published | Yes |
| Event | 37th IEEE/ACM International Conference on Automated Software Engineering (ASE 2022) - Oakland Center, Rochester, United States Duration: 10 Oct 2022 → 14 Oct 2022 https://conf.researchr.org/home/ase-2022 |
Conference
| Conference | 37th IEEE/ACM International Conference on Automated Software Engineering (ASE 2022) |
|---|---|
| Abbreviated title | ASE '22 |
| Place | United States |
| City | Rochester |
| Period | 10/10/22 → 14/10/22 |
| Internet address |
Funding
We are grateful for the constructive feedback of all the anonymous reviewers to improve this manuscript. The authors from Nanjing University are supported in part by the Leading-edge Technology Program of Jiangsu Natural Science Foundation (No. BK20202001), the National Natural Science Foundation of China (No. 62172200 and No.62032010) and the Fundamental Research Funds for the Central Universities (No. 020214380094). The authors from IIE, Chinese Academy of Science, are supported in part by NSFC U1836211, Beijing Natural Science Foundation (No.M22004), the Anhui Department of Science and Technology under Grant 202103a05020009, Youth Innovation Promotion Association CAS, Beijing Academy of Artificial Intelligence (BAAI). The authors from University of Queensland are supported in part by UQ’s NSRSG grant and Oracle Labs Australia under the CR grant.