The problem of voice interaction in distribution management tasks
Abstract
The problem of automation of voice interaction in distribution management tasks is analyzed in the article. The main goal is to analyze domestic and foreign experience and single out ways for the use of voice control capabilities to optimize distribution processes in three areas: distribution management systems, voice recognition systems and voice interaction systems. The limits of GPS monitoring systems, which are used to control delivery processes, but do not reflect the causes of real situation deviation from the planned route, are discovered. To overcome these limits, the direction of voice interaction system for management tasks in distribution is offered. Available traditional voice recognition systems based on neural networks and hidden Markov models do not provide the required level for distribution because of resistance to noise, they require powerful hardware or stable access to the Internet. Two new principles of resolving the problem – the transition to another speech recognition unit and building a tree of possible scenarios of interaction to reduce the amount necessary to recognize commands, depending on the context of the situation, are offered. Thus, integrative model of reflex voice interaction in distribution management, which combines two articulated principles, is offered. The prospect of further research is to develop a software, which will be realized in proposed model, and to integrate it with existing authoring system of automation routing
Keywords
voice control, voice interaction, distribution management, speech recognition
References
- Baumann,H. (2012). Order picking supported by mobile computing. Bremen: University of Bremen.
- Bondaros,Yu.G., Makovkin,K.A., & Chuchupal,V.Ya. (2007). A speech-command recognition system for pilot voice interface in integrated modular avionics. Bulletin of Computer and Information Technologies, 4, 2–13.
- Comendador,J.A., López‑Lambas,M.E., & Monzón de Cáceres,A. (2012, June). A GPS analysis for urban freight distribution. In The Seventh International Conference on City Logistics (pp. …). E.T.S.I. Caminos, Canales y Puertos (UPM). https://doi.org/10.1016/j.sbspro.2012.03.127.
- Egorkenkov,A.V. (2016). Applied application of a reflex voice control system. Management of Development of Complex Systems, 25, 103–107.
- Gefke,D.A., & Zatsepine,P.M. (2012). Application of hidden Markov models for recognition of acoustic sequences. Journal of Altai State University, Barnaul.
- Gonzalez‑Feliu,J., Pluvinet,P., Serouge,M., & Gardrat,M. (2013, June). GPS‑based data production in urban freight distribution. Nova Science.
- Heisterkamp,P. (2001). Linguatronic product‑level speech system for Mercedes‑Benz cars. In Proceedings of the First International Conference on Human Language Technology Research (pp. 1–2). San Diego: Association for Computational Linguistics. https://doi.org/10.3115/1072133.1072199.
- Hinton,G., Deng,L., Yu,D., et al. (2012). Deep neural networks for acoustic modeling in speech recognition. IEEE Signal Processing Magazine.
- Ishiguro,H. (2016). Adaptation to teleoperated robots. International Journal of Psychology, 51, 10. https://doi.org/10.1002/ijop.12361
- Jonsson,I.‑M. (2009). Social and emotional characteristics of speech‑based in‑vehicle information systems: Impact on attitude and driving behaviour (Doctoral dissertation). Linköping University.
- Korsun,O.N., Yatsko,A.A., Finaev,I.M., & Chuchupal,V.Ya. (2013). Experimental study of the influence of various acoustic interferences on automatic speech-command recognition results. Science and Education: Scientific Journal of Bauman Moscow State Technical University, 1, 12.
- Kravchenko,A.P., Kramar,N.M., & Morozov,I.V. (2009). Automated computer voice-control system for an automobile. Automobile Transport, (25), 44–47.
- Kravchenko,A.P., Tereshchenko,T.M., & Morozov,I.V. (2012). Automated computer system for interactive communication with the driver. Automobile Transport, (30), 54–58.
- Kumazaki,H., Yoshikawa,Y., Matsumoto,Y., et al. (2016, May). Robot-mediated interventions for social anxiety in individuals with autism spectrum disorder. In Proceedings of the 2016 International Meeting for Autism Research (pp. 270–271).
- Makovkin,K.A. (2006). Hybrid models: Hidden Markov models and neural networks, their application in speech recognition systems. In Models, Methods, Algorithms and Architectures of Speech Recognition Systems. Moscow.
- Naidyonov,I.M. (2015). Mathematical server for automatic building of logistic routes: Ukraine Patent No. 60788 / I. M. Naidyonov; application 25.05.2015; published 30.10.2015, Bulletin No. 38, 2 p.
- Naidyonov,I.M. (2015). Topological heuristic in solving the vehicle routing problem. ScienceRise, 11(6), 52–58. https://doi.org/10.15587/2313-8416.2015.44381.
- Naidyonov,M.I. (2008). Formation of reflexive control systems in organizations. Kyiv: Milenium.
- Nishio,S., Minato,T., & Ishiguro,H. (2015, October). Using androids to provide communication support for the elderly. New Breeze, 27(4), 14–17.
- Park,Y., Itakura,S., Henderson,A.M., et al. (2015, October). Do infants consider a robot as a social partner in collaborative activity? In Proceedings of the 3rd International Conference on Human-Agent Interaction (pp. 91–95). Daegu, South Korea: ACM. https://doi.org/10.1145/2814940.2814953.
- Pylypenko,V.V. (n.d.). Recognition of key words in speech stream using a phonetic stenograph. Speech Recognition Systems: Models, Methods, Algorithms and Architectures.
- Quak,H., & de Koster,R. (2006, September). Urban distribution: The impacts of different governmental time‑window schemes (ERIM Report). Erasmus Research Institute of Management.
- Saini,P., & Kaur,P. (2013). Automatic speech recognition: A review. International Journal of Engineering Trends and Technology, 4(2).
- Teslia,I., Popovych,N., Pylypenko,V., & Chornyi,O. (2014). The non-force interaction theory for reflex system creation with application to TV voice control. In Proceedings of the 6th International Conference on Agents and Artificial Intelligence (pp. 288–296). https://doi.org/10.5220/0004754702880296.
- Teslia,Yu.M. (2010). Introduction to the Informatics of Nature. Kyiv: Maklaut.
- Teslia,Yu.M., & Chornyi,O. (2013). Reflex voice control system of technical devices (RSVU). Management of Development of Complex Systems, 15, 105–110.
- Volkov,A.V. (2014). Analysis of existing recognition methods for invariance to background noise and speaker diction. Izvestiya of Tula State University. Technical Sciences, 9(2), 6. https://doi.org/10.14489/vkit.2016.05.pp.021-028.