Speech recognition model for solution of website element management tasks

Keywords: recognition, website, SpeechGrammarList API Web Speech, script, interface

Abstract

The article deals with the method of speech recognition, namely, the possibility of using this technology for the language control of website elements. Due to the widespread introduction of such technologies into human lives, the task is to create a voice application that would improve the usability. The feature of the proposed model is the implementation of speech recognition not in the service, as it happens in most cases, but in the device itself, using only a microphone. In the existing design users can easily add any commands. Language recognition is implemented on the website page using the JavaScript programming language. The script work is based on the use of the SpeechRecognition Web Speech APIs and the SpeechGrammarList API for Web Speech. The current direction of the use of speech in the process of interaction between the technical system and the user is the application of such technology for creating comfortable living conditions for people who have a violation of the musculoskeletal system and who have lost the opportunity to use traditional means and methods of dialogue with the system. It analyzed the basic principles of the website and the ability to control it using voice control. To operate the proposed speech recognition model, two interfaces are used to solve the problem of managing elements of a website: SpeechRecognition Web Speech API and SpeechGrammarList API Web Speech. In order to manage the elements of the website, a model is proposed, the implementation of which is possible through the use of a microphone on the user's desktop only. The feature of the proposed model is that it is easy to add any commands to an already existing structure. Such application provides a great perspective for building new web interfaces in combination with artificial intelligence.

References

Запрягаев, С. А., & Коновалов, А. Ю. (2009). Распознавание речевых сигналов. Вестник Воронежского государственного университета. Серия: Системный анализ и информационные технологии, 2, 39-48.

Макаров, В. (2017). Как устроен искусственный интеллект: распознавание речи. Взято с https://www.popmech.ru/technologies/392382-kak-ustroen-iskusstvennyy-intellekt-raspoznavanie-rechi/.

Сорокин, В. Н. (2008). Синтез речи. Москва: Наука.

Черный, Д. В. (2014). Сверхбыстрое распознавание речи без серверов на реальном примере. Взято с https://habrahabr.ru/post/237589.

Як працює команда «Ok Google» на пристрої Android – Пристрій Android – Пошук Google Довідка. (2018). Support.google.com. Взято з https://support.google.com/websearch/answer/6031948?hl=uk&co=GENIE.Platform%3DAndroid.

Davies, K. H., Biddulph, R., & Balashek, S. (1952). Automatic Speech Recognition of Spoken Digits. Journal of the Acoustical Society of America, 24 (6), 637-642.

Jurafsky, D. & Martin, J. H. (2009). Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Prentice-Hall.

Klass, P. J. (1962). Fiber Optic Device Recognizes Signals. Aviation Week & Space Technology, 77 (20), 94-101. N.Y.: McGraw-Hill. Retrieved from https://archive.org/stream/Aviation_Week_1962-11-12#page/n46/mode/1up.

REFERENCES (TRANSLATED AND TRANSLITERATED)

Zapryagaev, S. A., & Konovalov, A. Yu. (2009). Recognition of speech signals. Vestnik Voronezhskogo gosudarstvennogo universiteta. Seriya: Sistemnyj analiz i informacionnye tekhnologii, 2, 39-48. (in Russian)

Makarov, V. (2017). How is the artificial intelligence: speech recognition. Retrieved from https://www.popmech.ru/technologies/392382-kak-ustroen-iskusstvennyy-intellekt-raspoznavanie-rechi/. (in Russian)

Sorokin, V. N. (2008). Synthesis of Speech. Moscow: Nauka. (in Russian)

Cherny, D. V. (2014). Super-fast speech recognition without servers on a real example. Retrieved from https://habrahabr.ru/post/237589. (in Russian)

How the "Ok Google" command works on the Android device – Android Device – Google Help Search. (2018). Support.google.com. Retrieved from https://support.google.com/websearch/answer/6031948?hl=uk&co=GENIE.Platform%3DAndroid. (in Ukrainian)

Davies, K. H., Biddulph, R., & Balashek, S. (1952). Automatic Speech Recognition of Spoken Digits. Journal of the Acoustical Society of America, 24 (6), 637-642. (in English)

Jurafsky, D. & Martin, J. H. (2009). Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Prentice-Hall. (in English)

Klass, P. J. (1962). Fiber Optic Device Recognizes Signals. Aviation Week & Space Technology, 77 (20), 94-101. N.Y.: McGraw-Hill. Retrieved from https://archive.org/stream/Aviation_Week_1962-11-12#page/n46/mode/1up. (in English)

Published
2018-06-30
How to Cite
Lytvyn, Y., & Strokan, O. (2018). Speech recognition model for solution of website element management tasks. Ukrainian Journal of Educational Studies and Information Technology, 6(2), 1-7. https://doi.org/10.32919/uesit.2018.02.06