Personal Voice Assistant

Authors: Siddhi Gupta, Devanshi Joshi, Krutika Kolpate, Dhruv Sharma, Mrs. Nilam Parmar

DOI Link: https://doi.org/10.22214/ijraset.2022.42730

Abstract

In today’s modern world, our daily life has been more linked with technology. Over time the advancement in technology is unmatched. The project that we have built is a personal voice assistant. It works on voice input and displays the output through voice or text. We have used python language for the voice assistant as it provides various libraries and packages to make our project. The assistant can perform some basic functions like giving weather updates, providing time and news, opening Google, YouTube, Gmail etc., setting reminder and performing arithmetic calculations. The main aim behind the voice assistant is to reduce the usage of input devices like keyboard, mouse, etc. This will not only lessen the hardware cost of the devices but also the space taken by them. Voice assistants are efficient and consume less time and therefore they are now inbuilt in many devices that people use nowadays.

Introduction

I. INTRODUCTION

Call Digitization offers new possibilities to facilitate the activities of our daily lives through assistive technology. This is the new way to connect with technology. Currently, the voice assistant is very useful for a person. The voice assistant takes less time. With the help of the voice assistant, we can participate in other works and save our time. Voice assistants are the great innovation that can change people's lives in other ways. Voice assistant was first introduced on smartphones and gained popularity on the back of its popularity. it was widely accepted by all. It can be conveniently used in all age groups.

Speech recognition is the process of converting speech into text. It is typically used by voice assistants such as Alexa and Siri. Python provides an API called Speech Recognition. You can use it to convert audio to text for further processing. Python's Speech Recognition API allows you to convert large or long audio files to text. User can give the commands in verbal and written form as well. The user can open an application (if installed on your system), search for queries on Google, Wikipedia, YouTube, or calculate mathematical questions. Just give a voice command. Used the Google Speech Recognition API and Google Text-to-Speech for voice input and output, respectively. In addition, you can use the Wolfram Alpha API to calculate formulas.

II. PROPOSED METHODOLOGY

Voice assistants use a combination of technologies such Natural language processing (NLP), artificial intelligence (AI) and speech recognition. By combining these technologies, technology can understand, answer and improve questions. It also works online in offline mode. System app runs offline mode, while web-based operations run online mode. The data is stored within the app itself, rather than the cloud which reduces time and space complexity. It even reduces the economic cost due to reduce high data usage plans.

Package required in this system to create a personal voice assistant using the pip command, the following packages must be installed on the system:

Speech Recognition: Speech recognition, or speech-to-text, is an ability which enables a program to process human speech into a written format.
Pyttsx3: pyttsx3 is a text-to-speech conversion library in Python.
Wikipedia: Wikipedia is a Python library that makes it easy to access and parse data from Wikipedia.
Capture: This module is used to capture pictures with the camera.
OS: The OS module in Python provides functions for interacting with the operating system. This module provides a portable way of using operating system dependent functionality.
Time: This module provides various time-related functions.
Web Browser: The web browser module provides a high-level interface to allow displaying web-based documents to users.
Subprocess: This module is used for getting system subprocess details which are used in various commands i.e., Shutdown, Sleep, etc.
JSON: JavaScript Object Notation (JSON) is a standardized format commonly used to transfer data as text that can be sent over a network. It’s used by lots of APIs and Databases, and it’s easy for both humans and machines to read.
Requests: The requests module allows you to send HTTP requests using Python. The HTTP request returns a Response Object with all the response data (content, encoding, status, etc).
Wolfram Alpha: Wolfram Alpha is a computational knowledge engine and answer engine. It answers factual queries directly by computing the answer from externally sourced data.

III. WORKING MODEL

The assistant first waits for the input provided by the user. After receiving the voice command, the assistant picks it up and finds for the keyword in the input command. If the assistant searches the keyword successfully it will perform the given task and return the result to the user via voice.

Assistant Taking Voice Command

2. Assistant Providing Weather Updates

3. Assistant opening YouTube

4. Assistant Providing News

5. Assistant Opening Map

6. Assistant Performing Arithmetic Operations

7. Assistant Opening Gmail

8. Assistant Opening Wikipedia

9. Assistant Opening Web Browser

Conclusion

Voice assistant completes the task given by the user making human life easy. This paper provides the abstract of design and development of personal voice assistant using python. The voice assistant includes the procedure: collection of data in the voice input -> analysing the voice and converting it into text -> processing the data -> providing the output. This system makes life easier for people who are physically disabled. The Voice assistant is less time consuming, easy to use and performs the task given by the user.

References

[1] https://en.wikipedia.org/wiki/Virtual_assistant [2] https://www.linkedin.com/pulse/what-advantages-voice-assistants-sylwia-ilona-borycka?trk=public_profile_article_view [3] https://alan.app/blog/voiceassistant-2/ [4] https://www.miquido.com/blog/what-are-voice-assistants/ [5] https://masterofcode.com/blog/9-key-predictions-for-the-future-of-voice-assistants [6] https://www.ibm.com/in-en/cloud/learn/speech-recognition [7] https://www.cs.mcgill.ca/~hv/classes/MS/TkinterPres/ [8] https://www.geeksforgeeks.org/python-pil-image-open-method/ [9] https://pythonprogramminglanguage.com/how-to-install-modules/#:~:text=You%20can%20install%20modules%20or,it%20will%20install%20the%20module.&text=That%20will%20install%20a%20Python%20module%20automatically.

Copyright

Copyright © 2022 Siddhi Gupta, Devanshi Joshi, Krutika Kolpate, Dhruv Sharma, Mrs. Nilam Parmar. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download Paper

Paper Id : IJRASET42730

Publish Date : 2022-05-15

ISSN : 2321-9653

Publisher Name : IJRASET

DOI Link : Click Here