How should C++ libraries allow custom allocators?

Question

In C, it's simple for a library to allow the user to customize memory allocation by using global function pointers to a function that should behave similarly to malloc() and to a function that should behave similarly to free(). SQLite, for example, uses this approach.

C++ complicates things a bit because allocation and initialization are usually fused. Essentially we want to get the behavior of having overridden operator new and operator delete for only a library but there's no way to actually do that (I'm fairly certain but not quite 100%).

How should this be done in C++?

Here's a first stab at something that replicates some of the semantics of new expressions with a function Lib::make<T>.

I don't know if this is so useful, but just for fun, here's a more complicated version that also tries to replicate the semantics of new[] expressions.

This is a goal oriented question so I'm not necessarily looking for code review. If there's some better way to do this just say so and ignore the links.

(By "allocator" I only mean something that allocates memory. I'm not referring to the STL allocator concept or even necessarily allocating memory for containers.)

Why this might be desirable:

Here's a blog post from a Mozilla dev arguing that libraries should do this. He gives a few examples of C libraries that allow the library user to customize allocation for the library. I checked out the source code for one of the examples, SQLite, and see that this feature is also used internally for testing via fault injection. I'm not writing anything that needs to be as bulletproof as SQLite but it still seems like a sensible idea. If nothing else, it allows client code to figure out, "Which library is hogging my memory and when?".

not strictly *better*, but the standard library uses `allocator` i.e. `allocator_traits` to do its container allocations allowing you to provide your own implementation via template arguments — BeyelerStudios, Nov 06 '15 at 11:30
It is a good idea to do this. The standard technique would be to make types that perform allocation accept a custom allocator via a template argument. Unfortunately this requires turning all those types into templates, which might no be desirable. Have a look at http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2013/n3525.pdf for a polymorphic approach to allocator. — pmr, Nov 06 '15 at 11:33
another thing if you go the allocator std way to consider is that people started adding the `string_view` `array_view` type to have a common view type to multiple unrelated types (raw array, std::array, std::vector, custom vector etc). — Alexander Oh, Nov 06 '15 at 11:35
@BeyelerStudios I've seen the Bloomberg allocator work, good stuff! Aside from the downsides of STL style allocators, it just seems simpler to require the client to provide something like `malloc()` rather than a potentially complicated allocator template. — Praxeolitic, Nov 06 '15 at 11:36
+1 Finally there is a library writer who cares about memory allocation! (See [my desperate attempts](http://stackoverflow.com/questions/16377991/must-i-replace-global-operators-new-and-delete-to-change-memory-allocation-strat) to workaround lack of allocators). — Ivan Aksamentov - Drop, Nov 06 '15 at 11:40
Some libraries allow to pass an allocator *object instance* as a parameter to it's memory consuming functions, for example factory functions. An example of widely used non-factory functions that accept allocator object is [thrust::sort](https://github.com/thrust/thrust/blob/master/examples/cuda/custom_temporary_allocation.cu) family. — Ivan Aksamentov - Drop, Nov 06 '15 at 11:46
Another approach to this: http://bitsquid.blogspot.com/2010/09/custom-memory-allocation-in-c.html — sleep, Oct 29 '19 at 19:19

score 5 · Answer 1 · 2015-11-14T07:13:50.360

Simple answer: don't use C++. Sorry, joke.

But if you want to take this kind of absolute control over memory management in C++, across libraries/module boundaries, and in a completely generalized way, you can be in for some terrible grief. I'd suggest to most to look for reasons not to do it more than ways to do it.

I've gone through many iterations of this same basic idea over the years (actually decades), from trying to naively overload operator new/new[]/delete/delete[] at a global level to linker-based solutions to platform-specific solutions, and I'm actually at the desired point you are at now: I have a system that allows me to see the amount of memory allocated per plugin. But I didn't reach this point through the kind of generalized way that you desire (and me as well, originally).

C++ complicates things a bit because allocation and initialization are usually fused.

I would offer a slight twist to this statement: C++ complicates things because initialization and allocation are usually fused. All I did was swap the order here, but the most complicating part is not that allocation wants to initialize, but because initialization often wants to allocate.

Take this basic example:

struct Foo
{
    std::vector<Bar> stuff;
};

In this case, we can easily allocate Foo through a custom memory allocator:

void* mem = custom_malloc(sizeof(Foo));
Foo* foo = new(foo_mem) Foo;
...
foo->~Foo();
custom_free(foo);

... and of course we can wrap this all we like to conform to RAII, achieve exception-safety, etc.

Except now the problem cascades. That stuff member using std::vector will want to use std::allocator, and now we have a second problem to solve. We could use a template instantiation of std::vector using our own allocator, and if you need runtime information passed to the allocator, you can override Foo's constructors to pass that information along with the allocator to the vector constructor.

But what about Bar? Its constructor may also want to allocate memory for a variety of disparate objects, and so the problem cascades and cascades and cascades.

Given the difficulty of this problem, and the alternative, generalized solutions I've tried and the grief associated when porting, I've settled on a completely de-generalized, somewhat pragmatic approach.

The solution I settled on is to effectively reinvent the entire C and C++ standard library. Disgusting, I know, but I had a bit more of an excuse to do it in my case. The product I'm working on is effectively an engine and software development kit, designed to allow people to write plugins for it using any compiler, C runtime, C++ standard library implementation, and build settings they desire. To allow things like vectors or sets or maps to be passed through these central APIs in an ABI-compatible way required rolling our own standard-compliant containers in addition to a lot of C standard functions.

The entire implementation of this devkit then revolves around these allocation functions:

EP_API void* ep_malloc(int lib_id, int size);
EP_API void ep_free(int lib_id, void* mem);

... and the entirety of the SDK revolves around these two, including memory pools and "sub-allocators".

For third party libraries outside of our control, we're just SOL. Some of those libraries have equally ambitious things they want to do with their memory management, and to try to override that would just lead to all kinds of clashes and open up all kinds of cans of worms. There are also very low-level drivers when using things like OGL that want to allocate a lot of system memory, and we can't do anything about it.

Yet I've found this solution to work well enough to answer the basic question: "who/what is hogging up all this memory?" very quickly: a question which is often much more difficult to answer than a similar one related to clock cycles (for which we can just fire up any profiler). It only applies for code under our control, using this SDK, but we can get a very thorough memory breakdown using this system on a per-module basis. We can also set superficial caps on memory use to make sure that out of memory errors are actually being handled correctly without actually trying to exhaust all contiguous pages available in the system.

So in my case this problem was solved via policy: by building a uniform coding standard and a central library conforming to it that's used throughout the codebase (and by third parties writing plugins for our system). It's probably not the answer you are looking for, but this ended up being the most practical solution we've found yet.

How should C++ libraries allow custom allocators?

1 Answers1