Im using scipy.signal.stft
to get the stft of an audio. No porblem with that and I'm getting the results. But what I don't understand is, when an audio of 169600 samples
whose sample rate is fs=44100 Hz
is used to calculate stft
, I get 3 returns as f, t
and Zxx
. Here the shape of Zxx
is (2049, 84)
.
For calculating stft
I use a window of size 4096
, and as the window type the hanning window
is used. And by default, scipy.signal.stft
uses a window_size // 2
overlap between frames.
My question: Is there 2049
ovelapping frames? if so or if not, how to calculate the number of overlapping frames in stft? if 2049
is not the number of ovelapping samples, what does that number mean?