Speak To Classify Your Notes

Authors: Shreya Jagtap, Neha Kanekar, Zainab Mister, Poonam Narkhede

DOI Link: https://doi.org/10.22214/ijraset.2022.41098

Abstract

Over the past few decades, note taking, which was used mostly in school and at university, has been used in everyday life after university as well. Writing can be time-consuming, especially in a fast lecture. Typing takes the least amount of time, so the page can have more information and be reviewed later. It consists of two phases speech to text conversion and classification both plays an important role. Speech-to-text conversion facilitates the integration of people with hearing impairments into oral communication settings. However, the transformation of speech into written language in real time requires specific techniques as it must be very fast and almost 80% correct to be understood. In machine learning, classification is the process of assigning a set of predefined categories to open-ended text. Text classifiers are capable of organizing, structuring, and categorizing pretty much any kind of text.

Introduction

I. INTRODUCTION

The operation of Artificial Intelligence is Machine Learning. Learning by experience is a way to educate the computer to do what it naturally does in humans. A person can use speech recognition to convert speech to textbook in real- time if they repeat what was firstly spoken. The reason forre-speaking is substantially to include punctuation and speaker identification, but also to accommodate the language proficiency of the followership. Other than ferocious and endless training of the speech recognition machine, no special training is needed. A sound- shielded terrain is profitable. A speech recognition system doesn't bear any special training. Still, verbal knowledge is necessary for the chunking of the words and acclimations of the wording. The textbook bracket process involves dividing input documents into two or further classes where each document can be distributed into one or further orders. In bracket systems, group of words or terms are collected together and organized. Cosine similarity measures the similarity between two vectors of an inner product space. A document can be represented by thousands of attributes, each recording the frequence of a particular word ( similar as a keyword) or expression in the document. Therefore, each document is an object represented by what's called a term- frequence vector.

II. LITERATURE SURVEY

A review On Speech to Text Conversion methods by Miss. Prachi Khilari and Prof. Bhope V. P we will conclude that after speaking through the microphone the application will convert the speech to text and will save that text in a file[1].

Intralingual speech to text conversion in real time Challenges and Opportunities by Susanne Wagner we can conclude that the speech is recognized by the application using speech recognition technique.Journal of Speech to Text Conversion by Dhanush Kumar S, Lavanya S, Madhumita G,Mercy Rajaselvi V the system first recognizes the person in front of it then ensures the integrity and prevents identity theft and it verifies the date and the subject of the examination by voicing out the same[2] [3].

Towards speech to text translation without speech recognition by Sameer Bansal, Herman Kamper, Adam Lopez, Sharon Goldwater UTD , MT Model and ZRToolKit is explained in Detail[4].

Text Classification through Statistical and Machine Learning Methods: A Survey by Krina Vasa [5], Text Classification Techniques: A Literature Review by M. Thangaraj and M. Sivakami [6] and Text Classification Using Machine Learning Techniques by M. Ikonomakis S. Kotsiantis V. Tampakas [7] we had conclude that the classification of the text files will be done by the classification techniques used and explained in this paper.

III. PROBLEM DEFINITION

It's designed grounded on the fact that people prefer digital notetaking. Currently, still, the use of computers is replacing the traditional pencil-and- paper methodology. It's an early attempt to connect between the relations of physical documents in the digital world. Our proposed system will successfully identify words spoken audibly and convert them into readable textbook using automatic speech recognition and save them. Automatic speech recognition (ASR) technologies moment can rightly fete and write down further than 90 percent of a long series of spoken words.

Our system will also classify unshaped textbook into lines. Automated textbook bracket has been considered as a vital system to manage and reuse a vast quantum of documents in digital forms that are wide and continuously adding. The part of automated textbook bracket is to classify documents into destined orders, generally applying machine literacy algorithms. With the instant growth of information, textbook bracket has come a vital fashion for handling and organizing textbook data.

IV. PROPOSED SYSTEM

With reference to figure 1

1. First, speech recognition that allows the machine to catch the words, expressions and rulings we speak.Second, natural language processing to allow machines to understand what we are saying . Third, speech conflation to allow the machine to speak. This chapter focuses on speech recognition, the process of understanding the words that are spoken by mortal beings. PyAudio is used to capture the speech signals. It's demanded for levee microphone input and also it has to be understood by the system.
2. The Speech Recognition library acts as a wrapper for several popular speech APIs and is therefore extremely flexible. One of these — the Google Web Speech API — supports a dereliction API key that's hard- enciphered into the Speech Recognition library. That means you can get off your bases without having to subscribe up for a service.
3. Speech to text translation is done with the help of Google Speech Recognition. A working internet connection is required. Still, there are certain offline Recognition systems similar as Pocket Sphinx, but have a truly rigorous installation process that requires several dependences. There are few speech recognition programs that are as easy to use as Google Speech Recognition. There are few speech recognition programs that are as easy to use as Google Speech Recognition. Also the textbook is saved in the form of a textbook train.
4. Remove stopwords Removing stopwords is one of the important step in textbook classification .It is used to remove gratuitous words from our documents in order to give further concentrate to the important information. It's used to prize unique words form documents.
5. Tokenize Tokenization is the process of tokenizing or divorcing a string, textbook into a list of commemoratives.
6. Convert documents to vector In order to perform cosine vector similarty on document, we need to transfigure our documents into vector representations similar that we can apply numeric machine knowledge operations. It convert the judgment into wordbook with words count for each word. For word counting we've used counter from collections modules to make the wordbook of word count.
7. Cosine similarity measure Cosine similarity measures the similarity between two vectors of an inner product. It's constantly used to measure document similarity irrespective of their size inNaturallanguageProcessing.However, it means two vectors have the same exposure, If the Cosine similarity score is 1. The value near to 0 indicates that the two documents have lower similarity.

Formula :

By using these formula we have calculated similarity value for each subject.

8. Bracket Bracket of documents is done base rested on the similarity value which we've calculated using cosine similarity.

V. RESULTS

With reference to Figure 1. Home Screen has 4 buttons : Microphone ,View,List of Subjects ,Classify the subject

VI. ACKNOWLEDGMENT

We sincerely wish to thank the project guide Prof. Poonam Narkhede for her encouraging and inspiring guidance helped us to make our project a success. Our project guide makes us endure with her expert guidance, kind advice and timely motivation which helped us to determine our project. We would like to thank our project coordinator Prof. Reena Deshmukh for all the support we needed from her for our project. We also express our deepest thanks to our HOD.

Dr. Uttara Gogate whose benevolent helps us making available the computer facilities to us for our project in our laboratory and making it true success. Without his kind and keen co-operation our project would have been stifled to standstill.Lastly, we would like to thank our college principal. Dr. P. R. Rodge for providing lab facilities and permitting to go on with our project. We would also like to thank our colleagues who helped us directly or indirectly during our project.

Conclusion

Proposed system will make use of various deep learning and machine learning algorithms. In existing system Speaker has to train the speech recognition system in advance with her voice and speaking characteristics .It also requires SD card to store text file .The aim of our project is to implement the speech recognition system without SD card and by using Google speech recognition API. The system will give the input data from mic in the form of voice, Once the voice is recognized properly then it will pre-processed that data & convert into text format displayed on screen. We will try to classify unstructured data into two or more classes.

References

[1] Miss. Prachi Khilari and Prof. Bhope V. P , http://www.ijarcet.org/wp/content/uploads/IJARTCET-vol 4/issue/7/3067/3072.pdf [2] Susanne Wagner , https://www.researchgate.net/publication/283123585_Intralingual_speech-to-text-conversion_in_real-time_Challenges_and_Opportunities [3] Dhanush Kumar S, Lavanya S, Madhumita G,Mercy Rajaselvi V, https://www.ijariit.com/manuscript/journal-on-speech-to-text-conversion/ [4] Sameer Bansal, Herman Kamper, Adam Lopez, Sharon Goldwate, https://www.researchgate.net/publication/313671310_towards_speech-to-text_translation_without_speech_recognition [5] Krina Vasa, http://www.ijedr.org/papers/IJEDR1602114.pdf [6] M. Thangaraj and M. Sivakami , https://www.ijikm.org/Volume13/IJIKMv13p117-135Thangaraj3803.pdf [7] M.Ikonomakis S. Kotsiantis V. Tampakas, https://www.researchgate.net/publication/228084521_Text_Classification_Using_Machine_Learning_Techniques [8] Challenges , https://www.researchgate.net/publication/283016320_Challenges_of_Digital_Note_Taking [9] Machine learning: Cosine similarity https://medium.com/

Copyright

Copyright © 2022 Shreya Jagtap, Neha Kanekar, Zainab Mister, Poonam Narkhede. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download Paper

Paper Id : IJRASET41098

Publish Date : 2022-03-30

ISSN : 2321-9653

Publisher Name : IJRASET

DOI Link : Click Here