The design of this API is not taking into account that media files might have multiple audio tracks. You will need to use a different API offering better track and subtitle flexibility, such as high level WPF MediaElement
or some wrapper over low level media APIs like DirectShow or Media Foundation (respectively DirectShow.NET library, and less so MediaFoundation.NET).
If you expect similar ease of use equal to Microsoft.DirectX.AudioVideoPlayback
API, you would probably want to use .NET APIs and controls like WPF MediaElement, XAML MediaPlayerElement, staying within their limitations and types of media they offer support for.
Use of low level media APIs (which in turn are all expecting consumption from C++ and not .NET) with wrappers is offering better flexibility but is also notoriously associated with need to dive deep into very small details of media playback.