Ijraset Journal For Research in Applied Science and Engineering Technology
Authors: Indrajit Roy, Apurba Ghosh
DOI Link: https://doi.org/10.22214/ijraset.2023.57228
Certificate: View Certificate
Our proposed system AVATAR (AI Virtual Assistant Technology for Automatic Response) is an innovative voice assistant system, combining Artificial Intelligence (AI) and Python for human-like interactions. It seamlessly executes a range of functions, from dispatching emails to conducting searches on Wikipedia. The system\'s design incorporates essential Python packages and ultrasonic sensors for object detection and face recognition. Python\'s extensive libraries and clean syntax make it the optimal language for this project. AVATAR\'s security measures include biometric authentication and password protection. While internet connectivity is crucial for optimal performance, the system reliably operates within its designated input range. This research represents a significant leap in AI-driven applications, enhancing efficiency and user experience. Additionally, individuals who are blind or have amputations can utilize AVATAR, as the system is entirely voice-controlled.
I. INTRODUCTION
Artificial intelligence, when integrated with machines, showcases the capacity to emulate human thought processes [1][2]. This entails designing a computer system that typically requires interaction from humans. Python, being a rapidly evolving language [3][4][5][6], offers a conducive environment for crafting scripts for voice assistants [3][7][8][11]. The assistant's instructions can be customized to suit the user's needs. The speech recognition system operates on par with well-known counterparts like Alexa and Siri[10]. Python boasts an API known as 'speech recognition,' enabling the conversion of speech into text. This empowers users to seamlessly control their assistant without the need for manual keyboard input. In essence, users can fully automate their computer, incorporating features for home automation [9]. Through the execution of this project, we have come to appreciate how the integration of AI in various domains significantly enhances daily life, leading to increased efficiency and time savings.
A. Functionalities of AVATAR
The capabilities of this system represent a convergence of cutting-edge technologies, redefining interactive experiences. It seamlessly executes a spectrum of functions, including but not limited to:
In the development process, Visual Studio Code stood as the cornerstone of our environment, coupled with Python (.py) files, driving the engine of innovation. The integration of a diverse array of modules and libraries, including Speech Recognition, Datetime, Wikipedia, pyttsx3, Smtplib, pywhatkit, pyjokes, pyPDF2, pyautogui, pyQt, and others, empowered our system with unparalleled capabilities. The user interface was meticulously crafted for seamless and intuitive operation, creating an immersive and engaging experience with the Assistant. Our voice assistant, aptly named AVATAR, is a testament to the convergence of language processing and voice recognition principles. It offers a suite of features comparable to its contemporaries. Powered by Artificial Intelligence, it ensures precision and efficiency in every interaction, streamlining tasks and eliminating the need for manual input. The algorithms underpinning this assistant are meticulously optimized, exemplifying efficiency in every operation.
II. LITERATURE REVIEW
III. PROPOSED WORK
In this project, the chosen Integrated Development Environment (IDE) is Visual Studio Code, serving as the primary platform for crafting and managing all Python files. Its seamless package management system facilitates easy installations. The project leverages an array of essential modules and libraries, including but not limited to pyttsx3, Speech Recognition, Datetime, Wikipedia, Smtplib, pywhatkit, pyjokes, pyPDF2, pyautogui, pyQt. A vibrant Graphical User Interface (GUI) has been meticulously designed to imbue conversations with AVATAR with an aesthetically pleasing and functionally intuitive interface.
A. System Design
The system is designed using principles of Artificial Intelligence and relies on essential Python packages. Python provides a wide array of libraries and packages that facilitate various tasks; for example, pyPDF2 can be used to read PDF files. The system consistently employs an ultrasonic sensor to monitor and measure the distance of any object (user) in front of it. Upon detecting an object (user) within a specific range, it initiates the face recognition process. This involves capturing an image or a series of images of the user's face. The captured face image is then compared with a database of pre-registered faces to verify a match. This database can be either locally stored or accessed remotely, depending on the system's configuration. In the event of a matching face, the system prompts the user for a password, adding an extra layer of security to ensure authorized access. The entered password is subsequently compared with the stored password associated with the recognized face. If both the face and password match, access is granted, allowing the user to utilize the system. Conversely, if there is no match or the password is incorrect, the system reverts to object detection and measurement mode, continuing to monitor the area and waiting for another user to approach.
When our system allows any user to utilize it, it will first take voice commands through the microphone, recognize the voice, process the input voice, and then execute the requested task. At the core of this project lies a fundamental workflow where the assistant dynamically responds to user input, seamlessly executing tasks in accordance with the user's articulated desires expressed in the richness of the English language. The workflow of this AI system is shown in Fig. 1.Top of Form
B. Imported Python Libraries
In our AI Assistant following python libraries are used:
C. Functions
The imported python libraries and functions are shown in Fig. 2 and Fig. 3.
D. Why Choose Python Language for AVATAR
Although python was created before machine learning and AI become so popular, we have used the python language because python has certain qualities that set it apart from other languages. The qualities are discussed below.
The popularity of the Python language is depicted in Figure 4.
IV. RESULTS AND DISCUSSION
AVATAR, our advanced voice assistant, represents a pinnacle of technological innovation. Its multifaceted capabilities span from efficient email dispatch to precise facial recognition, showcasing a seamless fusion of language processing and Artificial Intelligence. This remarkable system excels in tasks as diverse as curating personalized playlists based on the user's emotional state to delivering real-time weather forecasts. Its integration with Bluetooth devices and control over electronic appliances further extend its utility beyond the digital realm. The meticulously optimized algorithms driving AVATAR ensure not only efficiency but also a human-like touch, evident in its engaging dialogues and insightful questioning. By executing system commands through voice prompts, it offers a hands-free and efficient approach to computer operations, exemplifying the future of interactive technology.
A. Specialities of AVATAR
Some screenshots of task executions are shown in the Fig. 5, Fig. 6, Fig. 7, Fig. 8 and Fig. 9.
B. Limitation
Our system is stable, providing bounded output for a specific range of input. It functions effectively across all its intended capabilities. Given that our assistant relies on internet connectivity to receive input commands, a robust internet connection is imperative. With a faster internet connection, tasks are executed swiftly and with high accuracy.
C. Security
Although this system is fully protected by a password and also includes a face lock feature to ensure that no unauthorized person can access it without the user's permission. The security testing primarily focuses on identifying vulnerabilities and potential risks. Since AVATAR is a local desktop application, there is no risk of a data breach through remote access. The software is dedicated to a specific system, and it will only be activated when the user logs in.
D. Stability
Our system is stable, providing bounded output for a specific range of input. It functions effectively across all its intended capabilities. Given that our assistant relies on internet connectivity to receive input commands, a robust internet connection is imperative. With a faster internet connection, tasks are executed swiftly and with high accuracy.
AVATAR stands at the forefront of voice assistance technology, a groundbreaking system that seamlessly combines natural conversation with unmatched effectiveness and efficiency. Its cutting-edge features not only differentiate it from its predecessors but also set new standards in the field. It\'s worth noting that throughout the development process, we diligently recognized the importance of addressing accent-related challenges to ensure the highest level of accuracy and inclusivity. Looking ahead, there are promising avenues to explore. The development of a dedicated AVATAR Android app is on our agenda, extending its accessibility to a broader user base. Furthermore, the foundational principles of AVATAR provide a robust platform for the creation of a new generation of voice assistants, poised to redefine the landscape of human-machine interaction. While our system currently boasts robust password and biometric protections, we acknowledge the imperative need for voice command encryption to further bolster security measures. In our pursuit of excellence, we are actively researching and investing in state-of-the-art microphone technology to address the challenge of background noise interference, a critical step towards ensuring seamless functionality in diverse environments. Looking further into the future, the integration of machine learning, deep learning, and AI technologies holds the promise of propelling the system to unprecedented heights, endowing it with the capacity to learn and execute tasks with unparalleled precision. Through dedicated deep learning training regimens, we are committed to maximizing AVATAR\'s capabilities. Envisioning the horizon, the potential emergence of a humanoid robot inspired by our system is a tantalizing prospect. With boundless possibilities at our fingertips, we dare to imagine an entire edifice seamlessly controlled by this visionary system, ushering in a new era of intelligent infrastructure management.\"
[1] Ashutosh Sakharkar, Shidesh Tondawalkar, Pratik Thombara, Prof. Rajashri Sonawane, “Python Based Ai Assistant For Computer,” International Research Journal of Engineering and Technology(IRJET), Volume:08, Issue:04, Apr 2021. [2] R. Sathya, M. Pavithra, G. Girubaa, “Artificial Intelligence For Speech Recognition,” International Journal of Computer Science & Engineering Technology (IJCSET), Volume:08, No. 01, Jan 2017. [3] Pooja C.Goutam, Monika S. Jalpure, Akshata S. Gavade, Pranjali Chaudhary, Prof.A.V.Gundavade, “Voice Assistant Using Python,” International Journal of Creative Research Thoughts(IJCRT), Volume:10, Issue: 06, June 2022. [4] Maghna Chandel, Sanjay Silakari, Rajeev Pandey, Smita Sharma, “A Study on Machine Learning and Python’s Frameworks,” International Journal of Computer Science and Engineering(JCSE), Volume:10, Issue:05, May 2022. [5] Anjali Fapal, Trupti Kanade, Bharati Janrao, Mrunalini Kamble, Megha Raule, “Personal Virtual Assistant For Windows Using Python,” International Research Journal of Modernization in Engineering Technology and Science (IRJMETS), Volume:03, Issue:07, July 2021. [6] Deepak Shende, Ria Umabiya, Monika Rathore, Aishwarya Bhisikar, Anup Bhange, “AI Based Voice Assistant Using Python,” International Journal of Emerging Technologies and Innovative Research (IJETIR), Volume: 06, Issue: 02, February 2019. [7] Divisha Pandey, Afra Ali, Shweta Dubey, Muskan Srivastava, Shyam Dwivedi, Md. Saif Raza, “Voice Assistant Using Python And AI,” International Research Journal of Engineering and Technology(IRJET), Volume: 09, Issue:05, May 2022. [8] Dr. Jaydeep Patil, Atharva Shewale, Ekta Bhushan, Alister Fernandes, Rucha Khartadkar, “A Voice-Based Assistant Using Google Dialogflow and Machine Learning,” International Journal of Scientific Research and Technology(IJSRT), Volume:08, Issue:03, May 2021. [9] Qzer Celik, Serthan Salih Altunaydin, “A Research on Machine Learning Methods and Its Applications,” Journal of Education Technology and Online Learning (JETOL), Volume:1, Issue: 03, August 2018. [10] Sitaram Pal, Ashish Chauhan, Sanjeev Kumar Gupta, “Voice Controlled Smart Home Automation System,” International Journal of Recent Technology and Engineering (IJRTE) ISSN: 2277-3878, Volume: 8, Issue, September 2019. [11] V. Geetha, C.K.Gomathy, Kottamasu Manasa Sri Vardhan, Nukala Pavan Kumar, “The Voice Enabled Personal Assistant for Pc using Python,” International Journal of Engineering and Advanced Technology (IJEAT) ISSN: 2249-8958, Volume-10 Issue-4, April 2021.
Copyright © 2023 Indrajit Roy, Apurba Ghosh. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Paper Id : IJRASET57228
Publish Date : 2023-12-01
ISSN : 2321-9653
Publisher Name : IJRASET
DOI Link : Click Here