Is it allowed to write an instance of Derived over an instance of Base?

Question

Say, the code

    class Derived: public Base {....}

    Base* b_ptr = new( malloc(sizeof(Derived)) ) Base(1);
    b_ptr->f(2);
    Derived* d_ptr = new(b_ptr) Derived(3);
    b_ptr->g(4);
    d_ptr->f(5);

seems to be reasonable and LSP is satisfied.

I suspect that this code is standard-allowed when Base and Derived are POD, and disallowed otherwise (because vtbl ptr is overwritten). The first part of my question is: please point out precise precondition of such an overwrite.

There may exist other standard-allowed ways to write over.

The second part of my question is: is there any other ways? What are their precise preconditions?

UPDATE: I do NOT want to write code like this; I am interested in theoretical possiblity (or impossibiliy) of such a code. So, this is "standard nazi" question, not a question "how can I ...". (Has my question to be moved to other stackoverflow site?)

UPDATE2&4: What about destructors? Supposed semantics of this code is "Base instance is (destructively) updated by slice of Derived instance". Let us assume, for the sake of simplicity, that Base class has a trivial destructor.

UPDATE3: The most intersting for me is the validity of access via b_ptr->g(4)

What are you actually trying to accomplish? It's hard to suggest an alternative without knowing. — Mark Ransom, Feb 02 '12 at 17:59
The line `b_ptr->g(4)` is UB, since the original object `*b_ptr` no longer exists after the second `new`. To get a new base pointer, you have to say something like `static_cast(d_ptr)`. — Kerrek SB, Feb 02 '12 at 18:04
Yeah, you should at least use **new and delete** operators instead of **malloc and free** functions. — Sergey Vyacheslavovich Brunov, Feb 02 '12 at 18:05
@Serge: He is using the `new` operator. If you want to argue he should use an `void* operator new(size_t)` allocation function, that's something completely different from the `new` operator. — Ben Voigt, Feb 02 '12 at 18:07
I am trying to imagine how you've justified this to yourself, and I've come up with two possibilities: 1) You've got an incredibly tight memory budget and the space occupied by the base is the only memory you are sure you can successfully reuse. The logic here is poor, so I'm going to presume this is not the case. 2) You are trying to deceive some client by "upgrading" an object without invalidating other references to that object. This seems more likely. Am I close? — Tim, Feb 02 '12 at 18:09
I'm saddened by the fact that a rule explicitly covering this exact scenario was needed. — Ben Voigt, Feb 02 '12 at 18:21
@Tim Yes, I am trying to deceive some client by "upgrading" an object without invalidating other references to that object. Thanks for wording. (But again, I do not want to deceive. I am interested in "does standard allow to deceive".) — user1123502, Feb 02 '12 at 19:52

score 8 · Accepted Answer · edited Feb 02 '12 at 20:10

You really need to do b_ptr = d_ptr after placement-new of Derived, in case the Base subobject isn't first in the layout of Derived. As written, b_ptr->g(4) evokes undefined behavior.

The rule (3.8 basic.life):

If, after the lifetime of an object has ended and before the storage which the object occupied is reused or released, a new object is created at the storage location which the original object occupied, a pointer that pointed to the original object, a reference that referred to the original object, or the name of the original object will automatically refer to the new object and, once the lifetime of the new object has started, can be used to manipulate the new object, if:

the storage for the new object exactly overlays the storage location which the original object occupied, and

the new object is of the same type as the original object (ignoring the top-level cv-qualiﬁers), and

the type of the original object is not const-qualiﬁed, and, if a class type, does not contain any non-static data member whose type is const-qualiﬁed or a reference type, and

the original object was a most derived object (1.8) of type T and the new object is a most derived object of type T (that is, they are not base class subobjects).

You also should probably be destroying the old object before reusing its memory, but the standard doesn't mandate this. However failure to do so will leak any resources owned by the old object. The full rule is given in section 3.8 (basic.life) of the Standard:

A program may end the lifetime of any object by reusing the storage which the object occupies or by explicitly calling the destructor for an object of a class type with a non-trivial destructor. For an object of a class type with a non-trivial destructor, the program is not required to call the destructor explicitly before the storage which the object occupies is reused or released; however, if there is no explicit call to the destructor or if a delete-expression (5.3.5) is not used to release the storage, the destructor shall not be implicitly called and any program that depends on the side effects produced by the destructor has undefined behavior.

@DeadMG: I got the standard quote right there that says otherwise. It's conditionally undefined behavior. — Ben Voigt, Feb 02 '12 at 18:05
It is not mandated - the text is right there: "the program is not required to call the destructor". What it says is UB is assuming the destructor will somehow be magically be called anyway. — Alan Stokes, Feb 02 '12 at 18:05
@user1123502: No, it's not legal. Using a pointer after creating a new object when the new and old types have a common base class but are not the same type is specifically forbidden. See the bolded text above. — Ben Voigt, Feb 02 '12 at 19:43

score 1 · Answer 2 · answered Feb 02 '12 at 17:58

1

You are initializing the same piece of memory twice. That won't end well.

Suppose for example that the Base constructor allocates some memory and stores it in a pointer. The second time through the constructor, the first pointer will be overwritten and the memory leaked.

answered Feb 02 '12 at 17:58

Mark Ransom

299,747
42
398
622

Sure, rewrite means "we will not call destructor". I corrected my question. – user1123502 Feb 02 '12 at 19:36

score 1 · Answer 3 · answered Feb 02 '12 at 18:03

1

I think the overwrite is allowed.

If it was me I'd probably call Base::~Base before reusing the storage, so that the lifetime of the original object ended cleanly. But the standard explicitly allows you to reuse the storage without calling the destructor.

I don't believe your access via b_ptr is valid. The lifetime of the Base object has ended.

(See 3.8/4 in either standard for the rules on lifetime.)

And I'm also not entirely convinced that b_ptr must give the same address as was originally returned by the malloc() call.

answered Feb 02 '12 at 18:03

Alan Stokes

18,815
3
45
64

In the first placement new, `b_ptr` is given the same address as was returned from `malloc` (it's not an array, therefore padding is forbidden). – Ben Voigt Feb 02 '12 at 18:06
@Ben I agree padding is not allowed. I wasn't completely certain that converting a Base * to a void * necessarily gave you a pointer to the first byte of the Base object, but of course it does here. – Alan Stokes Feb 02 '12 at 18:14

score 1 · Answer 4 · answered Feb 02 '12 at 18:12

If you write this code more cleanly, it is easier to see what goes wrong:

void * addr = std::malloc(LARGE_NUMBER);

Base * b = new (addr) Base;
b->foo();                    // no problem

Derived * d = new (addr) Derived;
d->bar();                    // also fine  (#1)

b->foo();                    // Error! b no longer points to a Base!

static_cast<Base*>(d)->foo(); // OK
b = d; b->foo();              // also OK

The problem is that at the line marked (#1), b and d point to entirely separate, unrelated things, and since you overwrote the memory of the erstwhile object *b, b is in fact no longer valid at all.

You may have some misguided thoughts about Base* and Derived* being convertible pointer types, but that has nothing to do with the present situation, and for the sake of this example, the two types may as well be entirely unrelated. It is only one the very last two lines that we use the fact that the Derived* is convertible to a Base*, when we perform the actual conversion. But note that this conversion is a genuine value conversion, and d is not the same pointer as static_cast<Base*>(d) (at least as far as the language is concerned).

Finally, let's clean up this mess:

d->~Derived();
std::free(addr);

The opportunity to destroy the original *b has passed, so we may have leaked that.

Is it allowed to write an instance of Derived over an instance of Base?

4 Answers4

Linked

Related