Ijraset Journal For Research in Applied Science and Engineering Technology
Authors: Atish Patil, Madhuri Kardule, Praveen Gupta
DOI Link: https://doi.org/10.22214/ijraset.2023.54064
Certificate: View Certificate
Python is a relatively new programming language, and writing a script for a voice assistant in Python is simple. How the assistant responds to our requests is entirely under our control. We can convert anything we speak into text using voice recognition. This method is used by all voice-activated assistants like Alexa, Siri, Cortana, and others. Python\'s Speech Recognition library makes it easy to convert speech into text. The challenge of making my own personal helper was exciting. You can now launch your preferred IDE, send emails, perform web searches, play music, and more with just one voice command without ever opening a browser. Given the status of technology today, it is capable of performing any task at least as well as we do, if not better. This project\'s capabilities include being able to send emails to anyone, read PDF files for you, send WhatsApp messages, etc. Now, the fundamental question is: How is it an AI? The virtual assistant I developed isn\'t quite artificial intelligence, but it is the result of a number of statements. Artificial intelligence (AI) computers\' main objective is to carry out tasks as effectively and efficiently as people. My virtual assistant is not a particularly good example of artificial intelligence, but it is one, and that is a fact.
I. INTRODUCTION
Artificial intelligence, when used in conjunction with machines, demonstrates the ability to think like a human. In this sense, computer systems are usually designed to require human interaction. As you know, Python is an emerging language, so it is easy to write voice assistants in Python. Assistant instructions can be processed according to the user's needs. Python has an API called Speech Recognition that can convert speech to text. Making my own assistant was fun. Send an email without typing a word, search Google without opening a browser, play music or open your favourite IDE with a voice command, and many other everyday tasks are easier to do. In the current scenario, technological advancements have made it possible for humans to perform any task with human efficiency, or perhaps even more. By doing this project, we realised that the concept of artificial intelligence reduces human effort and saves time in all fields. Voice assistants use artificial intelligence, so the results they provide are very accurate and efficient. Assistants help reduce human effort and save time while doing anything. They completely remove the concept of typing and act like another person we are talking to or ask them to do something. Assistants are not worse than human assistants, but they can do everything more effectively and efficiently. The libraries and packages used to create this assistant focus on time complexity and time saving. A virtual assistant is usually a cloud-based application that requires an internet-connected device or application to function. The technology that powers virtual assistants requires extensive knowledge to power not only the platform, but also the areas of machine learning, verbal communication processes, and speech recognition. A virtual assistant is a software program that helps make everyday tasks easier, such as checking the weather forecast, creating reminders, or creating a shopping list.
They can take commands through text input or by voice. Voice-based intelligent assistants need an invoking word or wakeup word to activate the assistant, followed by the command. Today we have so many voice assistants, like Apple’s Siri, Amazon’s Alexa, and Microsoft’s Cortana. For this project, the wake word was chosen as Hello Mark.
A. Objectives
B. Purpose
The purpose of virtual assistants is to be able to interact with voice, play music, make to-do lists, play audiobooks, and tell us news such as weather, work, sports, etc. while providing us with real time information. Virtual assistants allow users to use voice commands to control devices and applications. Millennial consumers in particular are showing increased awareness and increased comfort towards this technology. In this ever-evolving digital world, which is constantly optimised for speed, efficiency, and convenience, it is a fact that we are moving towards less screen interaction.
II. LITERATURE REVIEW
In today's world, machines are trained to think like humans and perform tasks on their own, replacing what humans can do. Based on this situation, the concept of voice assistants, which can perform various human tasks based on the human voice, was born. A virtual assistant can filter the voice commands given by the user and return relevant information. People around the world are transforming their digital experiences with future technologies such as virtual reality, augmented reality, and voice interaction. Voice assistants are emerging as a new evolution in human-machine interaction, where analog signals are converted into digital waves by audio signals. Over the past few years, smartphone usage has grown significantly, leading to the widespread use of voice assistants such as Apple's Siri, Google Assistant, Microsoft's Cortana, and Amazon's Alexa. Voice assistants are built using technologies such as speech recognition, text-to-speech, and natural language processing (NLP) that offer unlimited applications to make users' lives easier and more convenient.
Voice assistants provide many services to satisfy their users, such as:
The capabilities of voice assistant are expanding according to the needs of the user.
According to Deepak Shende, Ria Umabiya, AIVA (Microsoft, Google's Google Assistant and the smartest assistant named "AIVA" 2018) aims to create a voice assistant that can do many things, such as searching the Internet. It has new features such as commenting on social media such as Facebook and Twitter. With just few simple commands you can learn about the weather around you and get information about the weather in your area.
Tulshan explains that user's fingers can be injured due to constant typing. In order to avoid such problems, we need to create a system that allows us to do everything with voice commands. The speech will be recognized by the system. The recognition words will be compiled, if necessary are clarified, then printed on the screen, and after this again the recognized word will be matched with a specific keyword and if the match is found, then program will be compiled and executed.
Dr. Kshama V. Kulhalli presents research on voice assistants such as Google Assistant, Apple's Siri and Microsoft's Cortana. From this research, it was concluded that Google Assistant's answers are more accurate than others because it can easily understand the sound change.
III. METHODOLOGY
A. Existing System
We are familiar with many existing voice assistants like Alexa, Siri, Google Assistant, and Cortana, which use concepts of language processing and voice recognition. They listen to the commands given by the user and, as per their requirements, perform that specific function in a very efficient and effective manner. As these voice assistants use artificial intelligence, the results that they are providing are highly accurate and efficient. These assistants can help reduce human effort and consume less time while performing any task; they have removed the concept of typing completely and behave as another individual to whom we are talking and asking to perform tasks. These assistants are no less than human assistants, but we can say that they are more effective and efficient at performing any task. The algorithm used to make these assistants focus on the time complexities and reduce time, but for using these assistants, one should have an account (like a Google account for Google Assistant or a Microsoft account for Cortana) and can use it with an internet connection only because these assistants are going to work with internet connectivity. They are integrated with many devices like phones, laptops, speakers, etc.
B. Proporsed System
Developing this assistant was an interesting task. This virtual assistant makes it easier to send an email without typing a word, search Google without opening a browser, play music, or open your favourite application with a voice command. Many other tasks can be done with ease using this virtual assistant with just a voice command. This virtual assistant differs from other virtual assistants because it is specific to desktops and does not require creating an account to use it, nor does it require an Internet connection to receive instructions for doing certain tasks. The IDE used for this project is Visual Studio Code. All Python files are written in Visual Studio code, and all required packages can be easily installed in this IDE. We created a live GUI to interact with this virtual assistant and gave it an interesting look and design during conversations. Advancements allow virtual assistants to do anything as efficiently or even more efficiently than we do. By making this project, we realized that the concept of artificial intelligence reduces human effort and saves time in all fields. The features of this project are:
IV. SYSTEM DESIGN
A. Data Flow
The data flow of Virtual Assistant is as follows:
Originally, the system was in idle mode. As soon as it receives any command, it begins to execute. The received command is recognised, whether it's a questionnaire or a task to be performed. Specific action is taken accordingly. When a question is being answered or a task is being performed, the system waits for another command. This circles unless it receives a quit command.
D. Sequence Diagram
The end user sends a command to the voice assistant in audio form. The command is passed to the interpreter of the system. It identifies what the end user has asked for and directs it to the task execution function. If the command is incorrect or missing some information, the voice assistant asks the end user about it. The received information is then transferred back to the task, and it's fulfilled. After execution, feedback is transferred back to the end user.
This sequence diagram describes the sequence of interactions that happens in virtual assistant.
V. SOFTWARE DETAILS
The IDE used for this project is Visual Studio Code. All Python files are written with Visual Studio code, and all required packages can be easily installed in this IDE. Modules and libraries such as pyttsx3, SpeechRecognition, Datetime, Wikipedia, keyboard, pywhatkit, pyjokes, PyPDF2, pyautogui, and PyQt are used in this project. A live GUI is created for interacting with the virtual assistant, as it gives the conversation a unique and interesting look.
A. Visual Studio Code
It is an IDE, i.e., Integrated Development Environment, which has many features like supporting scientific tools (like Matplotlib, NumPy, and SciPy), web frameworks (example: Django, web2py, and Flask), refactoring in Python, an integrated Python debugger, code completion, code and project navigation, etc.
VI. IMPLEMENTATION WORK DETAILS
Virtual Assistant is a desktop voice assistant that can perform many daily tasks on the desktop, like playing music or opening your favourite IDE, with the help of a single voice command. Virtual Assistant is different from other traditional voice assistants in terms of the fact that it is specific to desktops and the user does not need to make an account to use it; it does not require any internet connection while getting instructions to perform any specific task.
A. Real Life Application
B. Data Implementation
As the first step, we will install all the necessary libraries and packages. The command used to install the libraries is "pip install," and then import them. The necessary packages included are as follows:
C. Functions
VII. RESULT
This part of the research report is a brief description of the results of our project. We chose Python as the preferred programming language for our project. We are focusing on activities performed by voice assistants. The main reason to use Python in this project is its robust standard libraries.
Following are some screenshots of the output that our virtual assistant gives on executing the following commands:
VIII. FUTURE SCOPE
The following are some of the places that may be relevant for the implementation of virtual assistants in the future:
a. Organisational Inquiry Desk: The system may be utilised in different organisations for simple access to information about the organisation using voice commands.
b. Embedded Systems: In embedded systems, voice commands may be used to handle multiple activities using speech recognition technology. This promotes the automation of labour and can thus be very advantageous in industrial process automation.
c. Application for People with Disabilities: People with disabilities may also benefit from voice recognition software. It is particularly beneficial for those who are unable to use their hands.
We\'ve covered Python-based personal virtual assistants for Windows in this research report. Human’s lives are made simpler by virtual assistants. Using a virtual assistant gives us the ability to use the services with just a single voice command. Python is used to create this virtual assistant for all Windows desktops, which is similar to Alexa, Cortana, Siri, and Google Assistant, which are available on smartphones. Artificial intelligence is used in this project, and virtual personal assistants are an excellent method to keep track of your calendar because of their portability, accuracy, and availability at any moment. Virtual personal assistants are more dependable than human personal assistants. Our virtual assistant will get to know you better and be able to provide ideas and follow orders. This device will most likely be with us for the rest of our lives. It is possible to enhance education by using immersive technology. Voice assistants may help students study in new and innovative ways. This article contains studies on the use of AI voice assistants in day-to-day life. Not much research has been done on voice assistants, but that\'s about to change. Based on this research, new discoveries can be made in the future. The next few years will be all about audio devices such as smart speakers and virtual assistants. Exactly how they will achieve success in the classroom remains a mystery. So not all voice assistants are bilingual, and this can be a problem. In addition, the lack of sufficient security and protection filters for voice assistants can be a bit of a problem. The use of these devices in the classroom can only be successful if instructors are given the proper training and incentives to do so. This system has adequate scope for modification in the future if it is necessary.
Applied Science & Engineering Technology (IJRASET), ISSN: 2321-9653, Vol.10, Issue VI, page no.3574-3575, June-2022 [2] Pooja C. Goutam, Monika S. Jalpure, Akshata S. Gavade, Pranjali Chaudhary, Prof. A.V. Gundavade, “VOICE ASSISTANT USING PYTHON”, International Journal of Creative Research Thoughts (IJCRT), ISSN: 2320-2882, Vol.10, Issue 6, page no.c802-c803, June-2022 [3] Abeed Sayyed , Ashpak Shaikh , Ashish Sancheti , Swikar Sangamnere , Prof. Jayant H Bhangale, “Desktop Assistant AI Using Python”, International Journal of Advanced Research in Science, Communication and Technology (IJARSCT), ISSN (Online) 2581-9429, Vol.6, Issue 2, page no.1330-1333, June-2021 [4] www.stackoverflow.com [5] www.pythonprogramming.net [6] www.codecademy.com [7] www.tutorialspoint.com [8] www.google.co.in
Copyright © 2023 Atish Patil, Madhuri Kardule, Praveen Gupta. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Paper Id : IJRASET54064
Publish Date : 2023-06-14
ISSN : 2321-9653
Publisher Name : IJRASET
DOI Link : Click Here