Computer Science Department Faculty Publication Series

AdaStreamLite: Environment-adaptive Streaming Speech Recognition on Mobile Devices

Yuheng Wei, Xidian University
Jie Xiong, University of Massachusetts Amherst
Hui Liu, Xidian University
Yingtao Yu, Xidian University
Jiangtao Pan, Xidian University
Junzhao Du, Xidian University

Publication Date

2024

Journal or Book Title

Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies

Abstract

Streaming speech recognition aims to transcribe speech to text in a streaming manner, providing real-time speech interaction for smartphone users. However, it is not trivial to develop a high-performance streaming speech recognition system purely running on mobile platforms, due to the complex real-world acoustic environments and the limited computational resources of smartphones. Most existing solutions lack the generalization to unseen environments and have difficulty to work with streaming speech. In this paper, we design AdaStreamLite, an environment-adaptive streaming speech recognition tool for smartphones. AdaStreamLite interacts with its surroundings to capture the characteristics of the current acoustic environment to improve the robustness against ambient noise in a lightweight manner. We design an environment representation extractor to model acoustic environments with compact feature vectors, and construct a representation lookup table to improve the generalization of AdaStreamLite to unseen environments. We train our system using large speech datasets publicly available covering different languages. We conduct experiments in a large range of real acoustic environments with different smartphones. The results show that AdaStreamLite outperforms the state-of-the-art methods in terms of recognition accuracy, computational resource consumption and robustness against unseen environments.

DOI

https://doi.org/10.1145/3631460

Pages

1-29

Volume

Issue

License

UMass Amherst Open Access Policy

Recommended Citation

Wei, Yuheng; Xiong, Jie; Liu, Hui; Yu, Yingtao; Pan, Jiangtao; and Du, Junzhao, "AdaStreamLite: Environment-adaptive Streaming Speech Recognition on Mobile Devices" (2024). Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies. 1370.
https://doi.org/10.1145/3631460

Download

COinS

ScholarWorks@UMass Amherst

Computer Science Department Faculty Publication Series

AdaStreamLite: Environment-adaptive Streaming Speech Recognition on Mobile Devices

Publication Date

Journal or Book Title

Abstract

DOI

Pages

Volume

Issue

License

Recommended Citation

Browse

Author Corner

Links

ScholarWorks@UMass Amherst

Computer Science Department Faculty Publication Series

AdaStreamLite: Environment-adaptive Streaming Speech Recognition on Mobile Devices

Authors

Publication Date

Journal or Book Title

Abstract

DOI

Pages

Volume

Issue

License

Recommended Citation

Share

Browse

Author Corner

Links