Visualizing memory allocations

2025-02-17

I've been recently implementing a simple to region-based memory allocator to use in some future C projects. Regions are a nice way to simplify dynamic memory management since it's very similar to stack allocation and the headache of managing the lifetimes of each individual heap-allocated object is essentially eliminated.

I'm not going to go into how region-based allocators work as that has been thoroughly explored in multiple articles and blogs over the years, but I thought that I'd share a simple technique that I've been using to debug and study the behavior of my implementation. I have a definition of a region and a basic set of functions for creating one and allocating bytes from it:

typedef struct { char* start; char* end; char* bumpPtr; } Region; Region Region_New(ptrdiff_t capacity); void* Region_Alloc(Region*, ptrdiff_t bytes, ptrdiff_t alignment); void Region_Free(Region*);

Creating and freeing a region is trivial, so the allocation function is the one that I've been looking into. With each allocation the bump pointer is supposed to be moved such that it points to the start of the chunk of bytes allocated for the caller. As the caller is expected to specify an alignment along with the number of bytes needed, there's some calculation involved in figuring out how much the pointer should move. My allocator also allocates memory from the end of the region towards the start, so any padding bytes needed for alignment have to be added before the bytes allocated for the caller.

My first approach to following the movement of the bump pointer was to just print it's value along with the number of bytes requested and the number of bytes needed to pad the allocation according to the specified alignment, so pretty much what I'd expect most people would start with. This obviously worked fine initially, but I quickly found myself whipping out a calculator to ensure that everything was behaving as expected when trying out some more complicated allocation patterns. Manual calculations tend to get tedious after a while, so I decided to write a script to visualize these prints in a way that I can quickly understand.

I added an #ifdef to the allocation function for tracing which contains a comma-separated print for the pointer values and calculated byte amounts that are needed to verify that everything works correctly. Prior to printing these values, the first allocation also prints a corresponding CSV header, so when I pipe the output from my test program to a file, I'm left with a easily processable CSV file in the end. A generated file looks something like this:

start,end,bumpPtr,bytesRequested,bytesNeededForAlignment f0882a0,f0882e0,f0882dd,3,0 f0882a0,f0882e0,f0882bc,32,1 f0882a0,f0882e0,f0882b0,8,4 f0882a0,f0882e0,f0882af,1,0 f0882a0,f0882e0,f0882ae,1,0 f0882a0,f0882e0,f0882a8,4,2

The start and end pointer values are of course kind of superfluous after the print from the initial allocation, but with them available this approach works as it is for investigating multi-region scenarios as well, so I thought this was a nice format on top of which to build the visualizer script.

The script starts out with some data type definitions:

#!/usr/bin/python from collections import namedtuple from csv import DictReader from enum import Enum Region = namedtuple("Region", ["start", "end", "allocs"]) Alloc = namedtuple( "Alloc", [ "bump_ptr", "bytes_requested", "bytes_needed_for_alignment", ], ) Byte = Enum("Byte", [("FREE", 1), ("ALLOCATED", 2), ("PADDING", 3)])

These are mostly there to ensure that the script is not working with raw dictionaries everywhere, which seems to be the case far too often with Python. After that we have some code to transform the rows read from the CSV into these Region and Alloc tuples:

if __name__ == "__main__": main() def main(): with open("trace.csv", newline="") as trace_csv_file: reader = DictReader(trace_csv_file) regions = {} for row in reader: start, end, *alloc_params = parse_row(row) if start not in regions: regions[start] = Region(start, end, []) regions[start].allocs.append(Alloc(*alloc_params)) # ... def parse_row(row): return ( int(row["start"], 16), int(row["end"], 16), int(row["bumpPtr"], 16), int(row["bytesRequested"]), int(row["bytesNeededForAlignment"]), )

With the CSV parsed, the script can then move on to calculating how each byte within the region is used. For that there's a function and an enumeration defined:

Byte = Enum("Byte", [("FREE", 1), ("ALLOCATED", 2), ("PADDING", 3)]) def calculate_byte_use(region): region_bytes = {b: Byte.FREE for b in range(region.start, region.end)} for index, alloc in enumerate(region.allocs): alloc_start = alloc.bump_ptr alloc_end = alloc_start + alloc.bytes_requested for alloc_byte in range(alloc_start, alloc_end): region_bytes[alloc_byte] = (Byte.ALLOCATED, index % 2) padding_start = alloc_end padding_end = padding_start + alloc.bytes_needed_for_alignment for alignment_byte in range(padding_start, padding_end): region_bytes[alignment_byte] = Byte.PADDING return region_bytes

The calculations are very simple, which is nice. One noteworthy thing in calculate_byte_use() is the line where a byte is marked as allocated, as it includes 0s or 1s for the bytes within an allocation to allow the visulizing function to differentiate between adjacent allocations. I found this handy since some neighboring allocations obviously may not have any padding in between them.

Now that we have each byte within a region checked, creating various kinds of visualizations is possible. I started out with this function that just prints a character for each byte within the region:

def visualize_stdout(bytes_, bytes_per_line=16): byte_symbols = { Byte.FREE: ".", Byte.ALLOCATED: ["o", "x"], Byte.PADDING: "_", } for index, current_byte in enumerate(reversed(bytes_.values()), start=1): if isinstance(current_byte, tuple): symbol = byte_symbols[current_byte[0]][current_byte[1]] else: symbol = byte_symbols[current_byte] print(symbol, end="\n" if index % bytes_per_line == 0 else "") if len(bytes_) % bytes_per_line != 0: print()

The os and xs denote allocated bytes, while _ and . represent inserted padding bytes and free bytes, respectively. To me this is already quite a big improvement over using a calculator, but we can take it a step further to visualize allocations that may not easily fit on one screen in a terminal.

Visualizing all bytes in something like a 1KB region, which is ultimately still quite small, needs something a bit more colorful to make it easily comprehensible in a single glance. Initially I considered generating PPM images, but ultimately ended up implementing a function to output HTML since it was very simple to hack together in a couple of minutes. So I added this function that outputs an HTML file with a table where each cell represents a single byte:

from math import sqrt ... def visualize_html(bytes_): bytes_per_row = int(sqrt(len(bytes_))) html_start = f""" <!DOCTYPE html> <html> <head> <style> table {{ width: 90vh; margin-left: auto; margin-right: auto; }} td {{ width: {100.0 / bytes_per_row}%; position: relative; }} td div {{ aspect-ratio: 1 / 1; }} td .free {{ background: gainsboro; }} td .alloc0 {{ background: deepskyblue; }} td .alloc1 {{ background: lightseagreen; }} td .padding {{ background: salmon; }} </style> </head> <body><table>""" html_end = "</table></body></html>" cell = '<td><div class="{}"></div></td>' byte_cells = { Byte.FREE: cell.format("free"), Byte.ALLOCATED: [cell.format("alloc0"), cell.format("alloc1")], Byte.PADDING: cell.format("padding"), } table_content = "" for index, current_byte in enumerate(reversed(bytes_.values())): if index % bytes_per_row == 0: table_content += "<tr>" if isinstance(current_byte, tuple): table_content += byte_cells[current_byte[0]][current_byte[1]] else: table_content += byte_cells[current_byte] if index % bytes_per_row == bytes_per_row - 1: table_content += "</tr>" with open("trace.html", "w") as html_file: html_file.write(html_start + table_content + html_end)

In the generated HTML the cells denoting allocated bytes are colored green and blue, padding bytes are red and free bytes are grey:

I could definitely keep going with this idea, but I think this is plenty for now. The allocator seems to work fine after all.