Questions tagged [audio-processing]

Audio processing involves the study of mathematical and signal processing techniques to understand or alter the nature of audio signals. The different kind of audio signals under study include speech, music, environmental audio and computer audio. Audio is analyzed in the temporal or spectral domain by applying various filters.

Key concept is to transform the audio into PCM format so you have access to the raw audio curve. Each channel will have its own curve.

Digital audio is represented by a series of points on this curve. Each point is called an audio sample. Numerical value of each sample can be represented in either integer or floating point.

Be aware to map each audio sample numerical value to memory typically requires several bytes of storage. One byte can store only 2^8 distinct values (256) which will result in noticeable distortion. High quality audio is typically stored using at least two bytes of storage per audio sample. When we use two bytes this gives us 2^16 possible values of the raw audio curve height as the audio wobbles up and down. The more bytes we use for storage the higher fidelity we gain as this reduces the gap between each distinct curve height measurement. This called bit depth. CD quality audio uses two bytes per audio sample per channel. The other fundamental aspect of digital audio is Sample Rate with determines the number of samples per second of time.

556 questions

-1

votes

1 answer

Getting different background colour of spectrograph from audio reading

import numpy as np import pandas as pd import matplotlib.pyplot as plt import librosa as lr import glob path = r'/content/drive/MyDrive/ESC-50/305 - Coughing/*.ogg' a = glob.glob(path) print(len(a)) for file in range(0,len(a),1): #scale,…

asked Nov 03 '21 at 07:56

Adil Asif

-1

votes

1 answer

Matlab : audio extract feature and neural nerwork

I have a 50 wav file of glass break sound and 50 wav file sound of normal sound. All the sound file duration is 1 second. Then I need to classify the sound using neural network. How can I extract the sound file and what neural network should i…

matlab neural-network audio-processing

asked Dec 07 '16 at 12:17

Bassam Abdul Basir

-1

votes

1 answer

How to get the lengh of a tone in scipy?

I have a signal with several tones and want to know where a given tone starts and where it ends. At first, I am applying a band pass filter and transform the signal (fft) in order to know if the recording has the tone. Now, I get the filtered signal…

python numpy scipy signal-processing audio-processing

asked Feb 20 '16 at 00:48

Dayvid Oliveira

1,157
2
14
34

-1

votes

1 answer

Audio/speech processing (with python's pymedia library) support in Robot Framework for automation

After a bit of research on the available automation frameworks for python, I find that Robot Framework fits almost all my requirements, except that it has not listed pymedia, tkSnack (on which my scripts are based) in any of the (built-in or…

python wav robotframework audio-processing

asked Sep 02 '15 at 12:18

skrowten_hermit

-1

votes

1 answer

FFT Plot of an Audio Signal - MATLAB

I'm using MATLAB to plot a recorded sound using the FFT. I want to take the log of the y-axis but I don't know what I did if correct. Currently, my FFT plotting code looks like this: nf=1024; %number of point in DTFT Y = fft(y,nf); f =…

matlab plot signal-processing fft audio-processing

asked May 17 '15 at 10:00

Hasan Haj

-1

votes

1 answer

Suggestion on Classification Method and Machine Learning Model for Environment Sound Classification

I currently have a problem regarding of Environment Sound Classification. I want to use Audio Classification to detect a specific type of collied sound (which is indeed quite different and very easy to be distinguished by human ears). But there are…

audio machine-learning classification audio-processing

asked May 14 '14 at 14:58

Tamaki Sakura

-2

votes

1 answer

What is Mel spectrogram as an audio sequence and how do I apply it?

I was under the impression that Mel-spectrograms were simply spectrograms with mel scale as the y axis. However, recently, I read in a research paper this line "Data representations such as Mel-Spectrograms can be seen from two different…

audio theory audio-processing spectrogram acoustics

asked Sep 14 '22 at 04:52

cchoi1022

-2

votes

1 answer

Removal of Noise from a wav file when I have the original audio

So for university I'm supposed to remove noise from a audio file. I have the original audio file and the teacher added some noise on top of it and I'm supposed to get rid of it using python. We have learnt the concepts on basic signals but I'm…

scipy signal-processing audio-processing

asked Nov 21 '20 at 07:57

nuub

-2

votes

2 answers

Measuring audio quality and appeal/aesthetic

Is there any way to measure the quality and appeal/aesthetic of an audio clip? The quality quantifies how good the sound is, ie., the lower the noise the better the quality is. Whereas the appeal/aesthetic measures how appealing the sound is to the…

deep-learning signal-processing audio-processing

asked Nov 16 '19 at 04:33

Saikat

1,209
3
16
30

-2

votes

1 answer

Why is my output different when the code is same?

I am currently putting together a band-pass filter using the following code: https://ipython-books.github.io/116-applying-digital-filters-to-speech-sounds/ I made few edits to the above code, namely the file is no longer pulled from an url but…

python audio-processing bytesio

asked May 20 '19 at 20:10

PaperRockBazooka

-2

votes

1 answer

How can i analysis the output of the three machine learning model so that i only get the one result?

I am trying to do genre based classification of the song. I have extracted the audio features ( MFCC, Pitch, and Rhythm) for a song. I trained the three model (ANN, SVM, SVM ) by each features. Three model give the three output (genre of the song).…

machine-learning signal-processing data-mining feature-extraction audio-processing

asked Jul 29 '16 at 17:33

Someone

-2

votes

1 answer

What does "frame blocking" mean?

I'm curious about "frame blocking" which is the first step of feauture extraction as i saw in web site. It says that these steps should be follow for feature extraction of an audio: frame blocking windowing fft mel-frequency…

c# speech-recognition naudio audio-processing feature-extraction

asked Mar 05 '13 at 15:07

Cengaver

-2

votes

1 answer

How to get audio levels of an audio file extracted from a video file?

The frame rate of a given video is 24fps. For audio sampling rate equal to 44.1Khz and 22050 (Fs/2) samples of audio are present in every second of audio. i.e. for every video frame, 22050/24≈919 audio samples are present. This is the matlab code I…

matlab audio-processing

asked Dec 26 '12 at 13:07

Animesh Pandey

5,900
13
64
130

-3

votes

1 answer

how to read an MP3 file for audio processing in Delphi?

When I want to process an audio in a byte level, I always convert it in .wav format and then do my processing. For example in my last project, I was trying to generate a kind of special waveform image of my audio file to use it in a video clip. Then…

delphi buffer mp3 waveform audio-processing

asked Aug 20 '18 at 20:58

M Ahmadzadeh

-3

votes

1 answer

Which developer roles or titles are needed to build a software DAW?

So far, I've used many different Audio Production software on Mac and Windows platforms. Often times, I ponder on the idea of creating my own DAW, but I realize that would be an extremely difficult challenge for a single person to undertake…

audio roles audio-processing

asked Nov 15 '12 at 20:01

chamberlainpi

4,854
8
32
63

Prev 1 2 3

…

38 Next