Questions tagged [signal-processing]

AKA digital signal processing (DSP). A signal is an information-carrying changing attribute of an entity, but in the digital sense, 'signal' refers to either received or transmitted streams/blocks of data, commonly representing real-world quantities such as audio levels, luminosity, pressure etc over time or distance. 'Processing' is the act of altering, analyzing or characterizing the data to retrieve/modify information inherent in the signal in question.

A signal is an information-carrying changing attribute of an entity, but in the digital sense, 'signal' refers to either received or transmitted streams/blocks of data, commonly representing real-world quantities such as audio levels, luminosity, pressure etc over time or distance.

These real-world quantities usually comes as analogue signals that are being sampled and quantized into a digital format, i.e. a sequence or higher-dimensional array of discrete numbers.

'Processing' is the act of altering, analyzing or characterizing the data to retrieve/modify information inherent in the signal in question.

NOTE: questions not related to the programming of signal processing implementations but the signal processing theory behind are off-topic for Stack Overflow and should be posted to the appropriate sister site, Signal Processing SE.

Common topics include:

  • digital filtering;
  • audio processing;
  • image processing;
  • sampling;
  • data compression;
  • spectral analysis;

Analysis tools commonly in practice are the Discrete Fourier Transform (DFT), especially its fast implementations (FFT); and wavelet transforms.

Common softwares used in the field include ; ; , especially with /; ///; ; and various languages.

5814 questions
18
votes
3 answers

I want to learn audio programming

At my high school we can take a class where we basically learn about a subject on our own for a semester. I was thinking that I want to learn about "sound programming," but I realized that I have no idea what that entails. I'm interested in learning…
Austin Gayler
  • 4,038
  • 8
  • 37
  • 60
18
votes
3 answers

Librosa pitch tracking - STFT

I am using this algorithm to detect the pitch of this audio file. As you can hear, it is an E2 note played on a guitar with a bit of noise in the background. I generated this spectrogram using STFT: And I am using the algorithm linked above like…
pavlos163
  • 2,730
  • 4
  • 38
  • 82
18
votes
1 answer

Make the matrix multiplication operator @ work for scalars in numpy

In python 3.5, the @ operator was introduced for matrix multiplication, following PEP465. This is implemented e.g. in numpy as the matmul operator. However, as proposed by the PEP, the numpy operator throws an exception when called with a scalar…
18
votes
3 answers

convert sound to list of phonemes in python

How do I convert any sound signal to a list phonemes? I.e the actual methodology and/or code to go from a digital signal to a list of phonemes that the sound recording is made from. eg: lPhonemes = audio_to_phonemes(aSignal) where for…
Roman
  • 8,826
  • 10
  • 63
  • 103
18
votes
3 answers

custom wave forms in web audio API

I'm working through this awesome article: https://jackschaedler.github.io/circles-sines-signals/dft_introduction.html I want to use the Web Audio API's PeriodicWave object to implement this demo: However, when I set a periodic wave with these…
Eliot Winder
  • 252
  • 2
  • 9
18
votes
2 answers

Does "16bit integer PCM data" mean it's signed or unsigned?

I'm using FMOD to develop an application which would immediately start playing the recording of the next/previous sentence exactly from its beginning in a MP3 file which contains speech, without music, when the user clicked the Next/Prev button. I…
xiaokaoy
  • 1,608
  • 3
  • 15
  • 27
17
votes
4 answers

Detecting wind noise

I want to develop an app for detecting wind according the audio stream. I need some expert thoughts here, just to give me guide lines or some links, I know this is not easy task but I am planning to put a lot of effort here. My plan is to detect…
Lukap
  • 31,523
  • 64
  • 157
  • 244
17
votes
4 answers

What is a good approach for extracting portions of speech from an arbitrary audio file?

I have a set of audio files that are uploaded by users, and there is no knowing what they contain. I would like to take an arbitrary audio file, and extract each of the instances where someone is speaking into separate audio files. I don't want to…
stef
  • 14,172
  • 2
  • 48
  • 70
17
votes
1 answer

How can I use smoothing techniques to remove jitter in pose estimation?

I get lots of jitter when using OpenPose to extract pose data from video. This is unnatural looking, and my results don't look natural and human. The data I get from the OpenPose model is what I have to work with, and I can't improve the quality of…
Stephen Meschke
  • 2,820
  • 1
  • 13
  • 25
17
votes
2 answers

How to find the fundamental frequency of a guitar string sound?

I want to build a guitar tuner app for Iphone. My goal is to find the fundamental frequency of sound generated by a guitar string. I have used bits of code from aurioTouch sample provided by Apple to calculate frequency spectrum and I find the…
Mircea
  • 915
  • 6
  • 12
17
votes
1 answer

Using numpy/scipy to identify slope changes in digital signals?

I am trying to come up with a generalised way in Python to identify pitch rotations occurring during a set of planned spacecraft manoeuvres. You could think of it as a particular case of a shift detection problem. Let's consider the…
stm4tt
  • 755
  • 1
  • 5
  • 22
17
votes
3 answers

Algorithm to determine fundamental frequency from potential harmonics

I am attempting to extract a fundamental frequency from a sound source. maybe someone is singing A3 into the microphone, so I want to be detecting ~ 110Hz my approach is: FFT 1024 floats use the phase of each bin to accurately determine its…
P i
  • 29,020
  • 36
  • 159
  • 267
17
votes
1 answer

FSK demodulation with GNU Radio

I'm trying to demodulate a signal using GNU Radio Companion. The signal is FSK (Frequency-shift keying), with mark and space frequencies at 1200 and 2200 Hz, respectively. The data in the signal text data generated by a device called GeoStamp Audio.…
Evan
  • 1,960
  • 4
  • 26
  • 54
17
votes
4 answers

Pitch detection in Python

The concept of the program I'm working on is a Python module which detects certain frequencies (human speech frequency 80-300hz) and by checking from a database shows the intonation of the sentence. I use SciPy to plot frequency of the sound files,…
17
votes
2 answers

How to capture audio samples in iOS with Swift?

I've found lots of examples online for working with audio in iOS, but most of them are pretty outdated and don't apply to what I'm trying to accomplish. Here's my project: I need to capture audio samples from two sources - microphone input and…
Hundley
  • 3,167
  • 3
  • 23
  • 45