I have an AXIS IP camera (M1054) which sends an H264/RTP stream via RTSP.
Unfortunately, they do not send SPS and PPS NALUs at all, they only transfer (fragmented) Codec slices.
I'm trying to decode that stream with the iOS VideoToolbox
framework which needs the H264 SPS and PPS tuple to correctly setup the CMFormatDescription
.
I wonder how I can synthesize the necessary parameter sets from looking at the actual H264 slices?
Update: I have captured an example session where mplayer
manages to display the stream via Wireshark. The capture file is here and you can see the whole RTSP setup as well as a couple of seconds RTP.