HMM-based phoneme speech recognition system for the control and command of industrial robots

Typ: Artykuł

Liczba pobrań: 1038

Pobierz zasób

PDF

Cytuj

BibTeX EndNote

Tytuł: HMM-based phoneme speech recognition system for the control and command of industrial robots
Autor: Naik, Adwait
Opublikowane w: Technical Transactions
Numeracja: Vol. 118, iss. 1
Data wydania: 2021
Miejsce wydania: Warsaw
Wydawca: Sciendo
Sekcja czasopisma: Mechanics
Język: angielski
eISSN: 2353-737X
DOI: https://doi.org/10.37705/TechTrans/e2021002
Słowa kluczowe: speech recognition, phoneme, robotics, human-robot interaction (HRI), linear predictive coding (LPC), hidden Markov model (HMM)
Abstrakt: In recent years, the integration of human-robot interaction with speech recognition has gained a lot of pace in the manufacturing industries. Conventional methods to control the robots include semi-autonomous, fully-autonomous, and wired methods. Operating through a teaching pendant or a joystick is easy to implement but is not effective when the robot is deployed to perform complex repetitive tasks. Speech and touch are natural ways of communicating for humans and speech recognition, being the best option, is a heavily researched technology. In this study, we aim at developing a stable and robust speech recognition system to allow humans to communicate with machines (robotic-arm) in a seamless manner. This paper investigates the potential of the linear predictive coding technique to develop a stable and robust HMM-based phoneme speech recognition system for applications in robotics. Our system is divided into three segments: a microphone array, a voice module, and a robotic arm with three degrees of freedom (DOF). To validate our approach, we performed experiments with simple and complex sentences for various robotic activities such as manipulating a cube and pick and place tasks. Moreover, we also analyzed the test results to rectify problems including accuracy and recognition score.
Wydział: Zbiory cyfrowe BPK
Licencja: Creative Commons; cc by sa 4.0
Prawa dostępu: Zasób dostępny dla wszystkich