endl and flushing the buffer

Question

In the C++ primer book, in chapter (1), it mentions the following:

endl is a special value, called a manipulator, that when written to an output stream has the effect of writing a newline to the output and ﬂushing the buffer associated with that device. By ﬂushing the buffer, we ensure that the user will see the output written to the stream immediately.

What is meant by "flushing the buffer" here?

check out: http://stackoverflow.com/a/9125587/1155650 – Rohit Vipin Mathews Feb 14 '12 at 09:28 — Rohit Vipin Mathews, Feb 14 '12 at 09:28

score 46 · Accepted Answer · answered Jan 20 '11 at 20:10

46

Output is generally buffered before it's written to the intended device. That way, when writing to slow to access devices(like files), it doesn't have to access the device after every single character.

Flushing means emptying the buffer and actually writing it to the device.

answered Jan 20 '11 at 20:10

Benjamin Lindley

101,917
9
204
274

1

`std::endl` invokes `flush()`? – John Aug 30 '21 at 12:11

wich · Answer 2 · 2011-01-20T20:20:12.673

C++'s iostreams are buffered, that means that when you output to an ostream, the content will not immediately go to what is behind the stream, e.g. stdout in the case of cout. The implementation of the stream determines when to actually send the buffered part of the stream out. This is done for reasons of efficiency, it would be very inefficient to write to a network or disk stream byte by byte, by buffering this problem is solved.

This does however mean that when you write say debug messages to a log file and your program crashes you may lose part of the data you wrote to the log file through the stream, as a part of the log may still be in the stream's buffer and not yet written to the actual file. To prevent this from happening you need to make the stream flush its buffers either by an explicit flush method call, or by using the convenience of endl.

If however you're just writing to a file regularly you should use \n instead of endl to prevent the stream from unnecessarily flushing the stream every line reducing your performance.

Edited to include this note:

cin and cout have a special relationship, where reading from cin will automatically flush cout beforehand. This makes sure that the e.g. the prompt you wrote to cout will actually be seen by the user before the read from cin is waiting for input. Hence, even in cout you don't normally need endl but can use \n instead. You can create such relationships between other streams as well by tying them together.

Also, in many Unices, standard output is line buffered if it goes to a terminal, which means that `\n` will flush the buffer just like `endl`. (Normal buffering takes over if standard output is not a terminal, which you can see if you first pipe the output through a couple of programs, like `./myProg | grep a | less`.) — Daniel Gallagher, Jan 20 '11 at 20:21
@Daniel The important part of course being "if it goes to a terminal", which you should of course not assume for cout as it could well go anywhere else. My point being that there's no good reason to use endl when writing to cout. But yes, excellent point that I completely forgot about. — wich, Jan 20 '11 at 20:31
@Daniel Gallagher It's amazing! `./myProg | grep a | less` is not line buffered even if it's outputed to a terminal? — John, Aug 30 '21 at 12:03
@John Standard output for myProg and grep will not be line buffered as they are going into a pipe. Standard output for less will (almost certainly) be line buffered as it goes to the terminal — wich, Aug 30 '21 at 13:37

score 12 · Answer 3 · edited May 23 '17 at 12:34

What is meant by "flushing the buffer" here?

std::endl causes the data in the stream's internal staging memory (its "buffer") to be "flushed" (transferred) to the operating system. The subsequent behavior depends on what type of device the stream is mapped to, but in general, flushing will give the appearance that the data has been physically transferred to the associated device. A sudden loss of power, however, might defeat the illusion.

This flushing involves some overhead (wasted time), and should therefore be minimized when execution speed is an important concern. Minimizing the overall impact of this overhead is the fundamental purpose of data buffering, but this goal can be defeated by excessive flushing.

Background information

The I/O of a computing system is typically very sophisticated and composed of multiple abstraction layers. Each such layer may introduce a certain amount of overhead. Data buffering is a way of reducing this overhead by minimizing the number of individual transactions performed between two layers of the system.

CPU/memory system-level buffering (caching): For very high activity, even the random-access-memory system of a computer can become a bottleneck. To address this, the CPU virtualizes memory accesses by providing multilple layers of hidden caches (the individual buffers of which are called cache lines). These processor caches buffer your algorithm's memory writes (pursuant to a writing policy) in order to minimize redundant accesses on the memory bus.
Application-level buffering: Although it isn't always necessary, it is not uncommon for an application to allocate chunks of memory to accumulate output data before passing it to the I/O library. This provides the fundamental benefit of allowing for random accesses (if necessary), but a significant reason for doing this is that it minimizes the overhead associated with making library calls -- which may be substantially more time-consuming than simply writing to a memory array.
I/O library buffering: The C++ IO stream library optionally manages a buffer for every open stream. This buffer is used, in particular, to limit the number of system calls to the operating system kernel because such calls tend to have some non-trivial overhead. This is the buffer which is flushed when using std::endl.
operating system kernel and device drivers: The operating system routes the data to a specific device driver (or subsystem) based on what output device the stream is attached to. At this point, the actual behavior may vary widely depending on the nature and characteristics of that type of device. For example, when the device is a hard disk, the device driver might not initiate an immediate transfer to the device, but rather maintain its own buffer in order to further minimize redundant operations (since disks, too, are most efficiently written to in chunks). In order to explicitly flush kernel-level buffers, it may be necessary to call a system-level function such as fsync() on Linux -- even closing the associated stream, doesn't necessarily force such flush.

Example output devices might include...
- a terminal on the local machine
- a terminal on a remote machine (via SSH or similar)
- data being sent to another application via pipes or sockets
- many variations of mass-storage devices and associated file-systems, which may be (again) locally attached or distributed via a network
hardware buffers: Specific hardware may contain its own memory buffers. Hard drives, for example, typically contain a disk buffer in order to (among other things) allow the physical writes to occur without requiring the system's CPU to be engaged in the entire process.

Under many circumstances, these various buffering layers tend to be (to a certain extent) redundant -- and therefore essentially overkill. However, the buffering at each layer can provide a tremendous gain in throughput if the other layers, for whatever reason, fail to deliver optimum buffering with respect to the overhead associated with each layer.

Long story short, std::endl only addressed the buffer which is managed by the C++ IO stream library for that particular stream. After calling std::endl, the data will have been moved to kernel-level management, and what happens next with the data depends on a great many factors.

How to avoid the overhead of `std::endl`

Method 1: Don't use std::endl -- use '\n' instead.
Method 2: Don't use std::endl -- use something like the following version instead...

inline std::ostream & endl( std::ostream & os )
   {
   os.put( os.widen('\n') ); // http://en.cppreference.com/w/cpp/io/manip/endl
   if ( debug_mode ) os.flush(); // supply 'debug_mode' however you want
   return os;
   }

In this example, you provide a custom endl which can be called with-or-without invoking the internal call to flush() (which is what forces the transfer to the operating system). Enabling the flush (with the debug_mode variable) is useful for debugging scenarios where you want to be able to examine the output (for example a disk-file) when the program has terminated before cleanly closing the associated streams (which would have forced a final flush of the buffer).

...It seemed worthwhile to provide some context for the general concepts involved (IO layers, overhead, and buffering) and then describe what `std::endl` does *in terms of what it doesn't do* (but that someone might naively assume that it does) -- notably, pointing out the, perhaps subtle, distinction between flushing to a *device driver* (which `std::endl` does), and flushing to a *physical device* (which it might not do). — Brent Bradburn, Apr 17 '15 at 21:15
Since assertions firing could be a primary cause for IO buffers to go unflushed prior to program termination, it would be meaningful to tie `debug_mode` to the existence of the standard preprocessor define `NDEBUG` (which also controls asserts) -- or just disable the flush directly with `#ifndef NDEBUG`. — Brent Bradburn, Apr 17 '15 at 21:32

score 1 · Answer 4 · answered Feb 23 '17 at 13:45

When using std::cout, the operand used after the output operator ( << ) are stored in a buffer and are not displayed onto the stdin (usually terminal, or the command prompt) until it comes across std::endl or std::cin, which causes the buffer to be flushed, in the sense, display/output the contents of the buffer onto the stdin.

Consider this program:

#include <iostream>
#include <unistd.h>

int main(void)
{
    std::cout << "Hello, world";
    sleep(2);
    std::cout << std::endl;

    return 0;
}

The output obtained will be:

after 2 seconds

Hello, World

score 0 · Answer 5 · edited Jun 22 '19 at 06:00

One simple code to show you the effects of buffered I/O in c++

Whatever input you provide is buffered and then passed on to the program variables in case of inputs.

Have a look at the code below:

//program to test how buffered  I/O can have unintended effects on our program

#include<bits/stdc++.h>
using namespace std;

int main()
{
    int a;
    char c;
    cin>>a;
    cin>>c;
    cout<<"the number is : "<<a;
    cout<<"\nthe character is : "<<c;
}

here we have declared two variables one int and one char if we input the number as "12d34" this will cause the int variable to accept only 12 as value and it will discard the rest which will still be there in the buffer. And in the next input the char variable will automatically accept the value "d" without even asking you for any input

endl and flushing the buffer

5 Answers5

Background information

How to avoid the overhead of `std::endl`

Linked

Related

endl and flushing the buffer

5 Answers5

Background information

How to avoid the overhead of std::endl

Linked

Related

How to avoid the overhead of `std::endl`