Ijraset Journal For Research in Applied Science and Engineering Technology
Authors: Ritika Dhyani, Priyansh Vatsal, Priyanshu Goel, Prafull chauhan, Prince Chauhan, Pratham Chauhan
DOI Link: https://doi.org/10.22214/ijraset.2023.50890
Certificate: View Certificate
The classification of music by genre is crucial in the modern world since the number of music tracks, both online and offline, is growing quickly. We must appropriately index them in order to have greater access to them. To retrieve music from a vast collection, automatic music genre classification is crucial. The majority of the current methods for categorising music genres rely on machine learning. We give a music dataset with ten distinct genres in this article. The system is trained and classified using a Deep Learning technique. Convolution neural networks are employed in this instance for training and classification. For audio analysis, feature extraction is the most important step. For sound samples, the Mel Frequency Cepstral Coefficient (MFCC) is employed as a feature vector. The suggested technique uses feature vector extraction to categorise music into different genres. Our findings indicate that our system\'s accuracy level is approximately 76%, which will significantly increase and facilitate the automatic classification of musical genres.
I. INTRODUCTION
With the abundance of music at consumers' fingertips throughout the globe, there is a growing need for automatic classification of music for indexing of music and easier retrieval, which is frequently done manually by specialists in the field. In a nutshell, the issue statement for our project may be stated as follows: Given a number of audio recordings, the job is to classify each audio file into a specific category, such as audio that belongs to happy, sad, etc. Audio processing is one of the more difficult data science projects compared to image processing and other classification techniques.One such use is the classification of music genres, which seeks to place audio files in the appropriate sound groups to which they belong. Because classifying music manually requires listening to each song for the entirety, the application is crucial and needs automation to reduce manual error and time. Therefore, we will employ machine learning and deep learning techniques to automate the procedure.
In a nutshell, the issue statement for our project may be stated as follows: Given a number of audio files, the job is to classify each audio file into a specific genre, such as disco, hip-hop, etc.
A classification algorithm uses a dataset of labelled examples as inputs to create a model that can automatically categorise unlabeled examples when presented with new, unlabeled data. A binary classification problem is one where there are just two labels (such as "calm" or "rock"). The challenge of multi-class classification arises when there are three or more labels in the set. We are looking at a multi-class problem because the set contains a variety of genres.
II. LITERATURE REVIEW
III. METHODOLOGY
Convolutional neural networks (CNNs) are fed data from these visual representations.[14] The authors of Lidy and Rauber (2005)[13] talk on the use of psychoacoustic properties for classifying musical genres, particularly the significance of STFT measured using the Bark Scale (Zwicker and Fastl, 1999). Among the features used by (Tzanetakis and Cook, 2002)[4] were spectral contrast, spectral roll-off, and mel-frequency cepstral coefficients (MFCCs). In Nannietal. (2016), SVM and AdaBoost classifiers are trained using a combination of audio and visual information.
A. Common ML Algorithms
A few of the algorithms are described below.
Before using the K Nearest Neighbours Algorithm, keep the following points in mind:
4. Convolutional Neural Network (CNN): Convolutional neural network is a Deep Learning method built specifically for working with photos and videos. It uses photographs as inputs, extracts and learns the image's attributes, then categorises the images using the learned features. This programme takes its cues from how the Visual Cortex functions in the human brain. Processing of visual data from the outside world is carried out by the visual cortex, a region of the human brain. It has many levels, and each layer functions independently, extracting different information from images or other visuals. Once all the information from the various layers has been merged, the picture or visual is then evaluated or classed.
A neural network type called a convolutional neural network, or CNN or ConvNet, is particularly adept at processing input with a grid-like architecture, like an image. A binary representation of visual data is a digital image. It is made up of a grid-like arrangement of pixels, each of which has a pixel value to indicate how bright and what colour it should beIn CNN, rather than all the neurons in the fully linked layer, a layer's neurons will only be connected to a tiny portion of the layer.
IV. RANDOM FOREST CLASSIFICATION
The supervised classification approach known as the random forest can be applied to both classification and regression issues. As the name implies, this algorithm builds a forest out of several trees.
A. Input Data Set
Three Types of Music Metadata.
The picture classification model will be created, trained, and tested using the Python programming language. The model could be categorised roughly into:
a. Importing libraries and getting data ready
b. Model definition
c. Report on classification
d. Confusion Matrix
e. Last classified photos
V. FUTURE SCOPE
With more research in this area, we will be able to use different machine learning algorithms, compare accuracies, and make even more accurate predictions while also learning how other models function and their benefits.
The classification of music into genres is a fundamental component of a powerful recommendation system. The major objective is to develop a machine learning model that categorises music samples into various genres in a more methodical manner.
Automating music classification can make it easier to locate important information like trends, popular genres, and performers.
Our application successfully categorises playlists according to mood with the aid of machine learning, giving users a categorised playlist. When a playlist is being listened to, the listener feels more at ease and filled with emotions, which boosts their mood and improves their mental condition. Marilyn Manson once said that \"Music is the strongest form of magic\" because music has the power to heal people and transform their emotions, which is equivalent to any form of magic. Different music from your mood can make you feel stressed and unhappy, which can lead to low energy or inappropriate actions. However, this application\'s playlist perfectly matches the user\'s mood. The right music energises and inspires people to combat or handle their current predicament.
[1] Neural Network Music Genre Classification des genres de par reseau-neuronal (Nikki Pelchat). [2] Music Genre Classification using Machine Learning Techniques (by Hareesh Bahuleyan) [3] Music Genre Classification Using Deep Learning (by Navneet Parab, Shikta Das, Gunj Goda, Ameya Naik) [4] George Tzanetakis and Perry Cook. 2002. Musical genre classification of audio signals. IEEE Transactions on speech and audio processing 10(5):293– 302. [5] Y. M. Costa, L. S. Oliveira, and C. N. Silla, “An evaluation of convolutional neural networks for music classification using spectrograms,” Appl. Soft Comput., vol. 52, pp. 28–38, Mar. 2017. Accessed: Dec. 16, 2018. [Online] [6] “On Combining Diverse Models for Lyrics-Based Music Genre Classification ,Caio Luiggy Riyoichi SawadaUeno;Diego Furtado Silva, 2019 8th Brazilian Conference on Intelligent Systems (BRACIS). [7] J. Despois. Finding the Genre of a Song With Deep Learning— A.I. Odyssey Part. 1. Accessed: Dec. 27, 2018. [Online]. Available: https://hackernoon.com/finding-the-genre-of-a-song-with-deep-learningda8f59a61194. [8] F. Pachet and D. Cazaly, “A classification of musical genre,” in Proc. RIAO Content-Based Multimedia Information Access Conf., Paris, France, Mar. 2000. [9] S. Gollapudi, Practial Machine Learning. Birmingham, U.K.: Packt, 2016. [10] T. O’Brien. (2017). Learning to Understand Music From Shazam. Accessed: Dec. 19, 2018. [Online]. Available: https://blog.shazam. com/learning-to-understand-music-from-shazam-56a60788b62 [11] T. Feng. Deep learning for music genre classification. 2014. [12] R. Panda and R. P. Paiva, “Mirex 2012: Mood classification tasks submission,” Machine Learning, vol. 53, no. 1-2, pp. 23–69, 2003
Copyright © 2023 Ritika Dhyani, Priyansh Vatsal, Priyanshu Goel, Prafull chauhan, Prince Chauhan, Pratham Chauhan. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Paper Id : IJRASET50890
Publish Date : 2023-04-24
ISSN : 2321-9653
Publisher Name : IJRASET
DOI Link : Click Here