Questions tagged [audio-processing]

Audio processing involves the study of mathematical and signal processing techniques to understand or alter the nature of audio signals. The different kind of audio signals under study include speech, music, environmental audio and computer audio. Audio is analyzed in the temporal or spectral domain by applying various filters.

Key concept is to transform the audio into PCM format so you have access to the raw audio curve. Each channel will have its own curve.

Digital audio is represented by a series of points on this curve. Each point is called an audio sample. Numerical value of each sample can be represented in either integer or floating point.

Be aware to map each audio sample numerical value to memory typically requires several bytes of storage. One byte can store only 2^8 distinct values (256) which will result in noticeable distortion. High quality audio is typically stored using at least two bytes of storage per audio sample. When we use two bytes this gives us 2^16 possible values of the raw audio curve height as the audio wobbles up and down. The more bytes we use for storage the higher fidelity we gain as this reduces the gap between each distinct curve height measurement. This called bit depth. CD quality audio uses two bytes per audio sample per channel. The other fundamental aspect of digital audio is Sample Rate with determines the number of samples per second of time.

556 questions

votes

1 answer

Find most dominant audio frequency in sample

I'm trying to create a project that pulls in a live stream audio file from the internet and continuously samples the audio looking for the most dominant frequency for a given time period. The idea is that if it detects a frequency of let's say 440Hz…

asked Dec 08 '18 at 15:58

Bill

5,478
17
62
95

votes

5 answers

Convert audio to text

I just want to know if there is any build in libraries or external libraries in Java or C# that allow me to take an audio file and parse it and extract the text from it. I need to make an application to do so, but I don't know from where I can…

c# java speech-recognition audio-processing

asked Oct 18 '10 at 10:41

Amira Elsayed Ismail

9,216
30
92
175

votes

3 answers

How to write C++ audio processing applications?

I'm an Electronics and Telecommunications student, next to my graduation. I'm gonna work on a project that involves my knowledge about DSP, music and audio in general. I allready know all the basic mathematic instruments and all the stuff I need to…

c++ linux eclipse waveform audio-processing

asked Mar 08 '10 at 06:15

Cesko82

votes

1 answer

How to add an external audio track to a video file using VLC or FFMPEG command line

I want to add an audio.mp3 soundtrack to a soundless video.mp4 file using a bash script, what is the correct syntax of the "cvlc" "ffmpeg" command line ? I've recorded the video with VLC and --no-audio option so there is no settings such as bit rate…

ffmpeg video-processing vlc audio-processing

asked Nov 27 '13 at 23:06

Jonathan

votes

2 answers

Sound recognition API, SDK (Android)

I need to make an Android app that can recognize certain sound files created by me, and do an action on recognition. So something similar to Shazam/Soundhound, but with my own sound files. Is there any API or SDK or something for this? I've read…

android audio audio-processing

asked Jun 20 '13 at 07:10

KKO

1,913
3
27
35

votes

6 answers

Library for reading audio files

I want to process audio online/live where I constantly read audio samples from an audio file, process these (e.g. apply some effect), and forward the processed samples to an audio output device like a soundcard. The input files have common formats…

c++ c mp3 wav audio-processing

asked Oct 16 '12 at 13:34

Hyndrix

4,282
7
41
82

votes

4 answers

Python: Change Pitch of Audio File

this is my first post on stack. So far this site has been very helpful, but I am a novice and need a clear explanation to my problem, which is related to pitch-shifting audio in Python. I have the current modules installed: numpy, scipy, pygame, and…

python scipy pygame audio-processing

asked Dec 14 '11 at 08:03

hilmers

votes

1 answer

Adding silent frame to wav file using python

First time posting here, lets see how this goes. I trying to write a script in python which would add a second of silence in the beginning of the wav file, but so far been unsuccessfully in doing so. What I was trying to do is read in the wav header…

python audio wave audio-processing

asked Oct 15 '17 at 17:22

Madmax

votes

1 answer

Get Video and Audio buffer separately while recording video using front camera

I dug a lot on SO and some nice blog post But seems I am having unique requirement of reading Video and Audio buffer separately for further processing on it while recording going on. My use case is like When the user starts the Video recording, I…

android video-processing android-camera2 audio-processing firebase-mlkit

asked Jan 05 '19 at 05:35

Bipin Vayalu

3,025
2
25
39

votes

4 answers

Find sound effect inside an audio file

I have a load of 3 hour MP3 files, and every ~15 minutes a distinct 1 second sound effect is played, which signals the beginning of a new chapter. Is it possible to identify each time this sound effect is played, so I can note the time offsets? The…

python audio audio-processing librosa

asked Sep 29 '18 at 21:26

Craig Francis

1,855
3
22
35

votes

0 answers

How to get & parse the values of ITune's EQ presets

We are trying to implement a music player app with Equalizer presets. We are successful in getting presets from iPod and applying it through audio unit. But, now we need to display sliders and set frequency with respect to the selected preset. But…

ios iphone swift audio audio-processing

asked Aug 09 '16 at 05:32

Raghavandra Santosh

votes

1 answer

Open source FSK decoder library?

I'm looking for a library or tool to decode FSK in wav files, e.g. caller id. Currently using the tools bundled with vpb-driver for Voicetronix hardware that is available via debian/ubuntu. But this appears to have an error that I'm trying to debug…

audio telephony audio-processing

asked Sep 15 '10 at 02:43

bstpierre

30,042
15
70
103

votes

1 answer

audio comparison with R

I am working in a project where my task deals with speech/audio/voice comparison. This project is used for judging the winner in the competitions(mimicry). Practically I need to capture the user's speech/voice and compare it with the original audio…

r audio-processing

asked Dec 14 '15 at 10:20

Dinesh

votes

0 answers

Can I measure distances with sound in an android app?

I've got a number of questions this time, although they all relate to the same problem: I wanted to build a rudimentary sonar in Android, and have no clue as to how possible it is to do such a thing. Is it something that can work reasonably…

android waveform audio-processing

asked Apr 23 '15 at 05:14

Carlos Vergara

votes

5 answers

processing an audio wav file with C

I'm working on processing the amplitude of a wav file and scaling it by some decimal factor. I'm trying to wrap my head around how to read and re-write the file in a memory-efficient way while also trying to tackle the nuances of the language (I'm…

c wav audio-processing

asked Mar 16 '10 at 19:35

sa125

28,121
38
111
153

Prev 1

…

37 38 Next