Environmental Audio Tagging Using Deep Convolution Neural Network and Digital Signal Processing

Anirudh Rana; Rajinder Singh Rana

doi:10.31033/ijemr.11.6.17

Authors

Anirudh Rana System Security Engineer, Silex Software’s Ltd, INDIA
Rajinder Singh Rana Principal, Sanatan Dharama College, Ambala Cantt., INDIA

DOI:

https://doi.org/10.31033/ijemr.11.6.17

Keywords:

Environmental Sound Classification, Deep Convolutional Neural Networks, Digital Signal Processing, Urban Sound Dataset, Data Augmentation

Abstract

Machine learning has experienced a strong growth in recent years, due to increased dataset sizes and computational power, and to advances in deep learning methods that can learn to make predictions in extremely non-linear problem settings. The intense problem of automatic environmental sound classification has received alarming attention from the research community in recent years. In this paper the audio dataset is converted into mass spectrogram using Digital Signal Processing (DSP). The spectrogram thus obtained is fed to the Convolutional Neural Network (CNN) for the classification of the audio signal. In this we present a deep convolutional neural network architecture with localized kernels for environmental sound. By training the network on another additional deformed data, the hope is that the network becomes invariant to all deformations and generalizes better to all unseen data. We show that the proposed DSP in combination with CNN architecture, yields state-of-the-art performance for environmental sound classification.

Downloads

Download data is not yet available.

Environmental Audio Tagging Using Deep Convolution Neural Network and Digital Signal Processing

Authors

DOI:

Keywords:

Abstract

Downloads

Downloads

Published

How to Cite

Issue

Section

License

Make a Submission

Information

Current Issue

Abstracting & Indexing

Useful Links