Memory Layout of Program

06/12/2022 Tags: C_C_plus_plus Embedded Programming

“In computing, a code segment, also known as a text segment or simply as text, is a portion of an object file or the corresponding section of the program’s virtual address space that contains executable instructions. The term “segment” comes from the memory segment, which is a historical approach to memory management that has been succeeded by paging. When a program is stored in an object file, the code segment is a part of this file; when the loader places a program into memory so that it may be executed, various memory regions are allocated (in particular, as pages), corresponding to both the segments in the object files and to segments only needed at run time.”

Brief

When we declare a variable in program, C++ allocates space for that variable from one of several memory regions: One region of memory is reserved for variable that persist throughout the lifetime of the program, such as constant. This information is called static data. One region of memory is reserved for allocating a new block of memory called a stack frame to hold its local variables. This information is called stack. One region of memory is reserved for allocating memory dynamically. This space comes from a pool of memory called the heap.

In this blog post, I would like to discuss the typical layout of a simple computer’s program memory.

Typical Code Segment

The typical layout of a simple computer’s program memory is with the text, various data, and stack and heap sections.

Text section: contains executable instructions and is sharable so that only a single copy needs to be in memory for frequently executed programs. It is often read-only and may be placed below the heap or stack in order to prevent heaps and stack overflows from overwriting it.
Data section: divided into two parts
- Initialized Data Segment: contains the global variables and static variables that are initialized by the programmer . It is not read-only since the values of the variables can be altered at run time.
- Uninitialized Data Segment: often called the “bss” segment and is initialized by the kernel to arithmetic 0 before the program starts executing uninitialized data starts at the end of the data segment . It contains all global variables and static variables that are initialized to zero or do not have explicit initialization in source code.
Stack section: contains the program stack, a LIFO structure and stores virtual pointer. Each time a recursive function calls itself, a new stack frame is used, so one set of variables doesn’t interfere with the variables from another instance of the function. When the program tries to use more memory space than the call stack has available, it will occur a stack overflow.
Heap section: managed by malloc, realloc, and free, which may use the brk and sbrk system calls to adjust its size. (brk, sbrk – change data segment size). It is shared by all shared libraries and dynamically loaded modules in a process.

<Remark> An industry group led by major Japanese central processing unit (CPU) manufacturers have addressed the shortcomings of C++ for embedded applications: While maximizing execution efficiency and making compiler construction simpler, the effort of programming is to preserve the most useful object-oriented features of the C++ language yet minimize code size.