How is audio data down-sampled to 5512 Hz PCM? I have read some articles and the steps involved are decoding the audio to PCM, converting it to Mono and then downsampling it.
For converting to mono, are the channels of each frame averaged to get the mono signal?
Once the mono signal is obtained, how is it down-sampled?
if down-sampled sample rate = sample rate divided by integer factor, how is this applied to the audio data in the mono signal?