IDirectSoundBuffe Lock, why to pass 2 parts?

Question

On the lock method of IDirectSoundBuffer8 we're suppose to pass 2 parts and their respective sizes. What are those? The fist part is the start and the second part is the end of the lock? They have to do with the primary and secondary buffers? Stereo sound? I'm totally lost.

Can someone help?

The buffer may be circular. So the buffer conceptually can wrap around. When that happens the buffer is not contiguous and has to be represented by two disjoint regions of memory. — David Heffernan, Apr 24 '16 at 13:51
Now make sense. When that size to lock is greater than the buffer size, is stored on other part. Thanks David. — Luiz Heringer, Apr 24 '16 at 14:10
Note that I know nothing at all about audio, and just gleaned this from a few websearches. You should dig deeper yourself. There are surely plenty more subtleties. But websearch will get you a long way. — David Heffernan, Apr 24 '16 at 14:18
I know that. The problem is that I'm kind of learning winapi just now, and sometimes I don't even know how to search for something. And my english don't help as well. For to do this question I spent at least 15 min, and I was worried that no one would understand. So ask something here as complicated as DirectSound is easier for me than try to make google understand what I'm trying to say. But I will try more. Thanks again for the response, and the advice. Have a nice day David. — Luiz Heringer, Apr 24 '16 at 14:29

Chuck Walbourn · Answer 1 · 2016-04-24T20:23:46.573

The best answer here is: Don't use DirectSound unless you are using Windows XP.

If you are playing only a single sound at a time and you don't care about any real-time mixing, you can use something as trivial as PlaySound. I'm assuming that you actually want real-time mixing and the ability to play multiple sounds that overlap.

For Windows 8, Windows 8.1, or Windows 10 you can use XAudio 2.8 or 2.9 which is built into the operating system. Otherwise, you can use XAudio 2.7 which is part of the legacy DirectX SDK.

See Learning XAudio2 for some educational resources.

See DirectX Tool Kit for Audio for a simple C++ wrapper for XAudio2.

RE: DirectSound buffers

To your original question: back in the old days of Windows 95 the 'primary buffer' was the actual audio buffer submitted to the hardware. The 'secondary buffers' are where you created your individual 'voices' for playing back more than one sound at a time. The system then mixes all the secondary buffers into the primary buffer for playback.

Since the transition to NT, however, the 'primary buffer' isn't really there anymore. There is something called a 'primary' buffer but it's mostly there for BackCompat. All the buffers are mixed into a single buffer by DirectSound, and then fed to the system for playback. On Windows Vista or later, it's being fed to Windows Core API (WASAPI) that mixes all the sounds from the system and all running applications before it actually gets to the audio hardware.

You can use WASAPI directly, but the API is quite restrictive because it doesn't do any application-level mixing or source-rate conversions. Generally you only use WASAPI directly if you are an audio engine that has already done all the required conversions and mixing and just want to play a final mix.

In any case, the reason there are two sets of pointers when dealing with Lock is because it's a "ring buffer" aka a "circular buffer". In the olden-days of Windows 95, parts of the primary buffer would actually be being played out by the hardware at the exact same time you could be writing into the buffer ahead of where the playback was currently taking place. You had this complicated two-pointer setup to avoid overwriting data that was still being played--otherwise you got the dreaded 'popping' or 'glitching' in your sound playback. Since this never happens anymore on modern versions of Windows, it's all just there for BackCompat w.r.t. to the primary buffer. That said, the DirectSound mixer still makes use of the fact that secondary buffers are "ring buffers" so the same mechanism is used for guarding the real-time mixer reading as you write 'ahead' as well if you happen to be updating a playing buffer. If a secondary buffer is not playing, you can safely just pass nullptr for the second pointer & size.

This old-school "ring buffer" model was complicated to work with, and was more important when system memory was quite limited. Pretty much all modern sound APIs are 'packet' based instead where each playing voice has a queue of pending buffers, and you add more data by submitting a new buffer to the queue for processing. You can get notifications as a buffer is completed to know the audio in that 'packet' has been processed.

Also, in DirectSound you had to copy the audio data into the memory provided by Lock, but modern 'packet' based APIs avoid the extra copy by reading the source data directly out of your application memory. This does add the complication of you needing to ensure the source memory remains available until all playback has stopped (i.e. you can't free the memory when it's still being read by the real-time mixer or your application will crash), but in return you avoid a lot of extra copying.

Thanks for the information, but I think you misunderstand the question. Thanks anyway, the information was pretty valid and useful. — Luiz Heringer, Apr 24 '16 at 19:52
Sorry, wasn't finished composing the answer... Now I'm done. — Chuck Walbourn, Apr 24 '16 at 20:07
Thanks Chuck. That't is what I call a big ass awesome answer. hehe Thanks so much man. BTW sorry for the delay, I've been very busy lately. — Luiz Heringer, Apr 28 '16 at 02:08

IDirectSoundBuffe Lock, why to pass 2 parts?

1 Answers1