IVR bots are audio-only, they do not support video. Video is supported with a real-time media bot.
A real-time media bot can record the audio and video streams (and, ideally, notify the Skype caller that it may be recording the call). We do not provide a full sample showing this scenario. There is an answer for similar question on the GitHub site:
In a 1:1 Skype video call with your real-time media bot, the bot has
access to the Skype caller's audio and video. The Huebot sample shows
a bot receiving the audio and video of the Skype caller. To create a
recording, the bot could save the contents of audio and video media
buffers to disk and use a tool like ffmpeg to process them into an MP4
file, for example. But there's no specific recording sample. Also note
that the bot cannot control what resolution it receives of the Skype
caller's video, and will likely get a low resolution such as 240p.
Hope this help!!