Ijraset Journal For Research in Applied Science and Engineering Technology
Authors: Abhiram J, Amrutha S, Aneetta Susan John , Jinshu Maria John, Midhun V Nair, Alpha Mathew
DOI Link: https://doi.org/10.22214/ijraset.2022.43538
Certificate: View Certificate
The inception of Internet has caused a dramatic revolution in many fields. Internet, being a global computer network, has made life of people easier, as they could access any information they want, more efficiently. Communication is one of the main fields that Internet has revolutionized. Communication has become so easy due to the integration of communication technologies with the internet. E-mails are considered to be the most reliable way of Internet communication, for sending or receiving some important information. Visually challenged person feels difficulty in using these technologies as it requires visual perception. Around 250 million people in this world, are unaware about the usage of Internet or E-mail. The only way by which a visually impaired person can use the current email application is that, they require the help of a third person who would send mail on the behalf of the them.But this does not guarantee privacy and security for that person. This gave the idea of developing a voice-based email which requires only less training. It makes use of mouse operations and speech recognition. It could be used by both visually impaired and also by a normal person Index Terms: Feature extraction, MFCC, GMM, Speech recognition, Google API
I. INTRODUCTION
The Voice based Email for visually challenged, however, is a technology which has greater significance that could lead to growing digital world. We will be developing a voice based email system which provides an aid to the visually impaired folks that are na¨?ve to computer systems to use email facilities more securely and efficiently. This e-mail system can be accessed by any user of any age bracket easily. It provides the feature of speech to text and also text to speech with speech reader which makes designed system to be handled by visually impaired person with more ease. It would be a web-based application for visually impaired persons that make use of IVR- Interactive voice response, thus enabling everyone to regulate their mail accounts using their voice only and also to read, send, and perform all other useful activities. The system will prompt the user with voice commands to perform certain action and then, the user will respond accurately to the same. The main advantage of this system is that the use of keyboard is eliminated. The user will have to respond through voice and mouse click only. Also the user needn’t worry about which mouse click operation he/she must perform so as to avail a given service as the system itself would be prompting them on which click will provide them with what operations.
II. OBJECTIVES AND SCOPES
This system would be a better aid for visually challenged people to access the mail services without the help of a third person.One of the main objective of this system is that it provides more privacy.Also the system does not require the use of keyboard.Instead, it works only on mouse operations and speech conversions to text. This project is proposed for the betterment of the society.
One of the major issue faced by visually impaired people while using the current mail system is that, they lacks privacy as they requires the support of a third person to use this system. An ideal solution for this problem is to develop a voice based email system that could be accessible by visually impaired people without a third person help. The proposed system make use of Google API and Gaussian Mixture Model(GMM) for feature extraction and speech recognition.
III. PROPOSED METHOD
The task of the proposed system is that, it completely eliminates the use of keyboard and is based on mouse clicks and speech recognition. The user is first asked to login by entering the login credentials. The validity of the details are checked and are encrypted and if valid, we are redirected to the dashboard. It is the main page where the system provides services like Compose, Inbox, Trash etc. The system will prompt the user with voice commands to perform a certain action and the user will respond to the same. To compose a mail, this voice command given by the user is converted to text and is send to the recipient. Similarly, for all the other services, the user is prompted via voice commands.
IV. SYSTEM DESCRIPTION
A. Architecture
The system begins with registration of new user by entering his/her details like name, mail id, training model etc. It is done by an admin. Entered informations are stored to the database. These informations are then fetched from the database when-ever needed. Already registered users can directly login to the system by entering the email id via voice commands. We use Google API for speech recognition. If the system detects the user as valid, he/she will be directed to the dashboard where the email services can be accessed.
B. System modules
The System mainly consists of Registration module, Login module and Dashboard module.
1. Implementation of Registration Module: New User Registration is done in this module. It is done by an admin. Registration is done by entering details like name, mail id, gender etc. These entered informations are stored in the database which can be monitored by the admin. Along with registration, feature extraction and training is also done on the data set.
2. Implementation of Login Module: After Registration, the user can login to the system via Login module. Here, the user is prompted to enter mail id as voice command.I f the mail id is valid, then the user is asked for confirmation. After confirmation, system requests the user to say ”password”. If the voice matches with the trained dataset, the user will be directed to the Dashboard
3. Implementation of Dashboard Module: After successful login, we enter the dashboard module.There are mainly 5 services and it can be accessed either by mouse clicks or voice commands.
a. Compose: User is asked to speak the recipient mail id,mail subject and content to be composed via voice.After each entry, system asks for confirmation.After getting confir-mation, the mail is sent to the desired recipient.
b. Inbox: User can check all unread mails and recently received mails
c. Read Messages from an Email Id: User is able to search a specific mail from inbox.User will be asked to speak the mail id and thus, mails from that particular mail id can be accessed via voice.
d. Delete Mails: User can delete unnecessary mails from inbox.Mails from specific users can be searched by saying the mail id and then it can be deleted.
e. Logout: User can logout from the system by selecting the logout option.
V. MODELS USED
A. Mel Frequency Cepstral Coefficient
Mel Frequency Cepstral Coefficient(MFCC) are coefficients that collectively make up an MFC. They’re derived from a sort of cepstral representation of the audio clip (a nonlinear ”spectrum-of-a-spectrum”). The difference between the cep-strum and Mel-frequency cepstrum is that, within the MFC, the frequency bands are equally spaced on the Mel scale, which approximates the human auditory system’s response more closely than the linearly-spaced frequency bands that are used in normal spectrum. This frequency warping allows better representation of sound, for example, in audio compression.
MFCCs are commonly derived as follows:
B. . Gaussian Mixture Model
Gaussian Mixture Model(GMM) is a type of machine learning algorithm used for data clustering.It classifies data into different categories based on the frequency or pitch of the user’s voice. GMM mainly uses Unsupervised Learning and is more robust. GMM is a probabilistic model that assumes all the data points are generated from a mixture of a finite number of Gaussian distributions with unknown parameters. One can think of mixture models as generalizing k-means clustering to incorporate information about the covariance structure of the data as well as the centers of the latent Gaussians.It make use of maximum-likelihood estimation.
VI. SYSTEM REQUIREMENTS
A. Software Requirements
VII. RELEVANCE
A. It makes the life of differently abled people more easier.
B. This system makes disabled people feel like a normal person.
C. The use of keyboard is eliminated as, in this application, the user need to respond only through voices and mouse clicks.
D. It provides more privacy.
VIII. FUTURE SCOPE
Voice-based Email System for visually challenged will make the email system easily accessible to visually challenged people. Privacy is the most important feature that is considered while developing this system. Both fully and partially blind people can use this system. With the help of our system visually challenged people will become independent as they can use email services without the support of a third person. The system makes use of an efficient voice input and mouse click based technology which reduces the burden of accessing email service. As blind people become capable of performing mail services their own they will be able to contribute to the growing digital world.
[1] Ayisha Zubain Bhandari,Prof.B C.Melinamath ”A Survey on Auto-matic Recognition of Speech via Voice Commands” International Jour-nal of New Innovations in Engineering and Technology,ISSN: 2319-6319,Volume 6 Issue 4-January 2017 [2] S. Usharani,P. Manju Bala,R. Balamurugan ”Voice Based Form Filling System for Visually Challenged People” ISBN 978-1-7281-6202-7,IEEE ICSCAN 2020 [3] Angayarkanni.S.A et al.,”SHOPAIDE: Voice Based AI Assistant for E-Shopping” Interantional Journal of Modern Agriculture,ISSN: 2305-7246,Volume 10 Issue 3,2021 [4] Subhash S et al.,”Artificial Intelligence-based Voice Assistant” Fourth World Conference on Smart Trends in Systems,Security and Sustain-ability(World S4) 2020
Copyright © 2022 Abhiram J, Amrutha S, Aneetta Susan John , Jinshu Maria John, Midhun V Nair, Alpha Mathew. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Paper Id : IJRASET43538
Publish Date : 2022-05-29
ISSN : 2321-9653
Publisher Name : IJRASET
DOI Link : Click Here