Questions tagged [audio-analysis]

To be used for questions related to the process of analyzing audio streams or files.

The process of analyzing audio streams or files.

93 questions
1
vote
3 answers

Audio Analysis for Sheet Music

I'm currently working on a program that analyses a wav file of a solo musician playing an instrument and detects the notes within it. To do this it performs an FFT and then looks at the data produced. The goal is to (at some point) produce the…
Nyx
  • 75
  • 8
1
vote
0 answers

Audio context breaks when switching to a new audio source

I am using chrome and I have an HTML audio element hooked up to an analyzer like this: var audio = document.getElementById('audio'); audio.src = "http://127.0.0.1:5000/api/getfile/new.wav"; var ctx = new…
1
vote
0 answers

Zoom and Moving based on audio information in FFMPEG

I recently wondered if it is possible to zoom or move things in FFMPEG based on an audio source. I already played around with complex filters as they allow some audio visualization but didn't really manage to move/zoom things based on sound. See…
Mattstir
  • 272
  • 2
  • 9
1
vote
1 answer

How to automatically extract pitch range from .wav files in PRAAT and put them all in a .csv file?

I am using PRAAT, and I need to extract pitch ranges. My audio files are all less than 2 seconds and I have dozen of them. I only know to obtain the information when I click on For now, I manually extract the pitch range (Query > Pitch Info) from…
silvermaze
  • 133
  • 1
  • 1
  • 3
1
vote
0 answers

How to calculate instantaneous frequency of the wav file

I have a wav file containing the recording of the human voice. I want to calculate the average frequency of the words spoken. Normally the male voices are upto 200hz and female voices are slightly higher. For eg. I want my answer like 170 hz like…
1
vote
0 answers

Is there a way to get Mel-frequency cepstrum coefficients of a track from the Spotify API?

I am looking to get the MFCC(Mel-frequency cepstrum coefficients) of a spotify track. My main aim is to identify genre of a track, and the algorithm which I'm studying right now uses MFCC to extract features of a track. I think there might be 2 ways…
Karan
  • 11,509
  • 8
  • 34
  • 38
1
vote
1 answer

Capture default audio stream with ALSA in C++

I am doing a fun project to change Philips Hue bulb lights color based on the sound that is coming from the default ALSA device. I want to write small C++ program that captures and analyzes default audio stream and split it into 3 changes low, mid,…
Lum Zhaveli
  • 175
  • 2
  • 18
1
vote
1 answer

Increasing training data doesn't reduce overfitting in CNN

I am working on a CNN classification problem: Using CNN to classify audio emotions into 6 classes (anger, disgust, fear, happy, sad, neutral). I am using the EMODB dataset, and the input features are Fourier transforms [256*36]. And my CNN network…
1
vote
2 answers

Praat Integration in Python 2/3

I've been working on Praat for some audio analysis stuff. However, I found some libraries that use Praat in Python and would like to do the same. This site offers a lot of features that can be extracted when praat is used. I have followed he…
1
vote
1 answer

How to detect a known sound (short audio about 10sec) in a large audio file (5min)

I am new working with audio file. Now I want to do the below task in Java. Is there any good tools or framework (like OpenCV for video and image) that can perform such task or not? I have 5 audio files which each of them is about 10 sec long, and…
Bahramdun Adil
  • 5,907
  • 7
  • 35
  • 68
1
vote
1 answer

Web Audio API's AnalyserNode alternative for Node.js

Audio analysis in web browser can be easily performed by AnalyserNode of Web Audio API. Is there something similar for Node.js?
TN.
  • 18,874
  • 30
  • 99
  • 157
1
vote
1 answer

run aubiopitch continuously on a file descriptor

I'd like to use aubiopitch to continuously output the frequency of a signal coming from an input source. Since aubiopitch likes to have its input be a file, not a stream, I tried using process substitution: $ aubiopitch -i <(sox -q -d -t wav -) I'd…
whereswalden
  • 4,819
  • 3
  • 27
  • 41
1
vote
0 answers

Detect specific sound in audio file

I have PCM file with recorded metronome. I want programmatically detect metronome beats in the file. How can I do that? Could you suggest some algorithms or sound analyzing library?
Taras
  • 2,526
  • 3
  • 33
  • 63
1
vote
1 answer

Recognition of Audio

I'm working on a program that shall record audio recognizable, so it can later be easily compared with other audio files. The audio files will contain something like speech, so I was wondering what would be easier to do: Implementing an algorithm…
1
vote
1 answer

Audio Analysis in WinJS

I've been poking around the API to find what I'm looking for, as well as searching online (but examples of Windows Store apps are pretty scarce). What I'm essentially looking for is a starting point for analyzing audio in a Windows Store JavaScript…
David
  • 208,112
  • 36
  • 198
  • 279