I have used FFMPEG to extract decibel (or rms? I am not familiar with the units) values of the audio volume from an mp4. I have 20 samples per frame.
How can I use these values (which are negative in almost all frames), to determine if the frame is silent or has audio (music, speech, etc)?