Understanding Memory Management in Unity

ijemin · December 15, 2025, 5:55am

Hi! I’m Jemin Lee, a Partner Engineer at Unity.

Over the years, there are two questions I’ve heard many times whenever developers start profiling memory in Unity:

Why doesn’t my managed heap or virtual memory usage drop, even after the GC runs?
Is it true that calling GC.Collect() multiple times can reduce physical or dirty memory usage?

The short answers are:

Unity’s GC doesn’t free heap memory - it recycles it. This is intentional.
And while physical memory can decrease in some cases, manually triggering GC is almost always the wrong way to achieve it .

In this article, we’ll dive into Unity’s managed heap and how the Garbage Collector (GC) actually works, and tackle the myth of manually forcing memory cleanup.

How OS Memory Works

Before we dive into Unity, let’s take a quick refresher on how the operating system (OS) handles memory.

Physical vs. Virtual Memory

Your device has a fixed amount of Physical Memory (RAM). This is the hardware where data is loaded for real-time access. However, your application doesn’t interact with physical memory directly. Instead, it operates in its own sandboxed Virtual Memory (VM).

This is a crucial design that enables the OS to efficiently manage physical RAM, allowing your app to focus on its functionality without needing to worry about the details. For example, the OS can compress less-used heap regions in Physical Memory to make space, and your app will never know the difference.

Page and Frame

Virtual memory is divided into pages. The typical page size is 16KB on mobile (iOS, Android) and 4KB on desktop (Windows, Linux). Therefore, the total virtual memory usage of your app is equal to the number of pages multiplied by the page size.

The pages of virtual memory can be mapped to physical memory as shown in the following figure. The unit of physical memory that corresponds to a page is called a frame.

Paging in: Loading a page of virtual memory into a page frame of physical memory.
Paging out: Releasing an unnecessary page frame from physical memory.

Let’s look at a quick example. Imagine your code asks the OS for enough memory to hold a large array corresponding to 3 pages(48KB).

const int sizeOfPage = 16384;
const int numberOfPages = 3;

// Allocate an int array corresponding to 3 pages
int* array = (int *)malloc(numberOfPages * sizeOfPage);

Side note: This is a simplified example that assumes a malloc call mmap/VirtualAlloc - which reserves virtual memory. But malloc internally could use brk/sbrk or its own pool (User-space allocator) for small allocations.

When malloc returns, you’re getting virtual memory only. At this point, no physical frames have been assigned yet.

Now, what happens when we actually use that memory?

array[0] = 141;

When we set array[0] = 141, this triggers what’s called a page fault: OS detects that we’re accessing a virtual page that hasn’t been paged in yet. Then, OS will load the page into Physical Memory.

And that page becomes what we call ‘dirty’ - meaning it’s been modified.

Note : Windows vs Unix-like memory model

There are three concepts regarding mapping pages into Physical Memory.

Reservation : Reserve virtual address space without physical memory.
Committing : Assign physical pages to the reserved virtual address range.
Decommitting : Release physical pages back to the OS while keeping the virtual address space reserved.

And Windows gives you explicit APIs for all of this: VirtualAlloc and VirtualFree. Windows separates three concepts very clearly. The developer can call VirtualAlloc/Free to explicitly manage each step.

On Unix-like systems (Linux, macOS, iOS, Android), reservation and commit are not strictly separated.

mmap API from Unix-like OS reserves address space, but commit is lazy. There is no explicit commit or decommit. The closest equivalent to the Windows decommitting API is madvise(DONTNEED), which is only a suggestion. The OS ultimately decides when memory is reclaimed.

Clean vs. Dirty Page

When a page is brought into physical memory, the OS keeps track of whether your app has modified it. This creates two categories:

Clean Pages: Pages without any written value yet, or pages containing data that has not been changed.

Think of your game’s executable code, read-only data, or textures. If the OS is low on memory, it can safely discard a clean page because it can always be reloaded from the original file on disk. Clean pages have a low impact on your app’s stability.

Dirty Pages: Pages that your app has written to and modified.

The OS cannot discard this page from RAM, because the data would be lost. Dirty pages are the primary contributor to your app’s true memory footprint.

Understanding the Managed Heap Structure

Now let’s move into Unity-specific territory. The managed heap.

Managed Heap

The managed heap is where all your C# object allocations occur. This is managed by the garbage collector.

Segments

The managed heap doesn’t work with raw pages directly. Instead, it uses what we call segments - sometimes also referred to as blocks. Each segment consists of one or multiple pages, depending on the garbage collector implementation and platform configuration.

A segment serves as the GC’s internal management unit and can hold one or more managed objects. The GC allocates, marks, and sweeps memory within segments, while the operating system manages physical memory at the page level.

Additional Note: Object allocation does not occur on a per-page basis.

The diagram is intentionally drawn to illustrate the general concept of a segment being composed of multiple pages. However, in Unity’s current GC default configuration, the GC heap block size typically matches the OS page size. As a result, in practice, a segment often maps 1:1 to a single OS page.

That does not mean a segment can contain only one object.

While a memory segment is ultimately reserved and mapped at the OS page level, small object allocation does not occur on a per-page basis. Instead, small object allocations are subdivided and packed within a single segment, allowing multiple managed objects to share the same underlying OS page.

For example, assume a platform where the page size is 16 KB, and the GC heap block size is also configured to be 16 KB, resulting in a 1:1 mapping between a segment and an OS page.

If a single 4-byte float is allocated inside that segment, the OS will treat the entire 16 KB page as dirty, because memory is tracked at page granularity at the OS level.

However, this does not mean that the remaining space inside the page is unusable. From the GC’s perspective, only 4 bytes are occupied, and the remaining space within the same segment can still be subdivided and reused for additional small object allocations.

How allocation and garbage collection works in Managed Heap?

Let’s see what’s really going on on Managed Heap. Here you can see multiple segments containing multiple objects in the Managed Heap.

When you allocate a new object, it is placed in an existing segment if there’s room. If there’s not enough free space for the new allocation, the GC creates a new empty segment and puts the new object there.

When garbage collection executes, unreachable objects (also known as dead objects) are logically collected. The gray blocks below represent objects collected by GC.

The memory regions occupied by dead objects inside segments are recorded in a free list for future reuse. Note that dead objects are no longer referenced, but their pages are still mapped in physical memory and marked as dirty by the OS.

Later, when new objects are allocated:

(1) GC reuses the memory regions that previously held collected objects when a new allocation fits their available space.
(2) If the remaining free space within those segments isn’t large or contiguous enough, GC will search for another suitable region within the existing heap.
(3) If no such region is found, it expands the managed heap by reserving a new memory segment from the OS.

But why doesn’t the memory usage drop?

So far, we’ve seen how GC removes unused objects and how those freed spaces can be reused for new allocations.

However, here’s the key point: even after all objects inside pages have been collected, those pages are not immediately unmapped or released back to the operating system.

Heap pages that once held collected objects remain mapped in physical memory, still marked as dirty pages at the OS level. Even if Unity GC marks them as reusable, the OS still considers them dirty because they once contained modified data.

From the OS’s perspective, these pages remain part of the app’s memory footprint until they’re explicitly unmapped - something Unity’s GC rarely does in real time.

That’s why, in tools like Xcode Instruments, you’ll still see a large Dirty Size even after a GC pass - those pages aren’t cleared physically, only logically.

Empty but Dirty Segments

After a GC cycle, you might have segments that are logically empty but whose backing OS pages are still resident in physical memory. No live objects are inside, but they’re still occupying RAM because they contain dirty pages.

In most cases, this is actually fine. No problem at all. These segments are often reused soon for new allocations, so keeping them in memory makes sense.

But when does it become a problem? If a segment remains empty for a long time and still contains dirty pages, it unnecessarily consumes physical memory. That’s when the GC needs to take action.

What Unity GC does

Unity automatically unmaps heap segments that have remained completely empty for multiple GC cycles (every 6th GC.Collect, but this number may vary between engine versions). This process frees long-unused segments from physical memory while keeping the same virtual heap size.

Unity GC removes heap segments that have been empty for several GC cycles and remaps new segments of the same size and at the same virtual address space. The total virtual heap size stays the same, but resident memory may occasionally shrink.

This remapping behavior can also refresh memory locality. When Unity remaps the pages of a long-unused segment, the operating system provides newly allocated physical pages, often placed in more recently accessed memory regions, which could provide a tiny performance improvement.

Side note: Unmapping unused segments on Windows

On Windows, because Windows supports explicit decommitting through VirtualFree, the GC decommits the pages of an empty segment without dropping the segment entirely.

Important : Why You Shouldn’t Trigger This Manually

Although calling GC.Collect() multiple times can sometimes cause this unmapping to occur, doing so manually is not recommended.

This process is intended only for long-unused segments.

If you call GC.Collect() several times in the same frame, it will also unmap pages that were freed only a frame or two ago - pages that Unity would have reused in the next frame anyway.

This causes unnecessary CPU work, more frequent remapping, and possible heap fragmentation.

In short, Unity already handles long-unused segments automatically, and even optimizes memory locality. Forcing GC manually to achieve this doesn’t reduce real memory usage; it only interrupts Unity’s natural optimization cycle.

Large Object Heap

Everything discussed so far applies to the general managed heap. But in Unity’s current GC (based on Boehm GC), large managed objects that exceed a certain size threshold are handled through a separate allocation path. The GC places large objects on the large object heap(LOH).

Typical examples of large objects include large managed arrays (such as byte[], vector3[], etc.) or huge temporary buffers used during loading or processing. Large objects follow different allocation, reuse, and reclamation rules compared to small objects.

How large?

The definition of a large object is GC-implementation specific.
For example, in Microsoft’s .NET runtime:

If an object is greater than or equal to 85,000 bytes in size,
it’s considered a large object and allocated on the Large Object Heap.

In Unity, the large-object threshold is typically 8KB on mobile and 2KB on desktop, but the exact value depends on platform and build configuration.

Side Note

In Unity GC, an object equal to or larger than MAXOBJBYTES is treated as a large object. MAXOBJBYTES is defined as half of HBLKSIZE(which is GC’s internal heap block size).

For IL2CPP builds, you can verify this value directly in the generated Xcode project by inspecting gc_priv.h under the Il2CppOutputProject/IL2CPP/external/bdwgc.

How large objects are allocated

When a large object is allocated, the GC bypasses the regular small-object free lists. This means it does not attempt to reuse fragmented free space between small objects (which in practice would not be suitable for large allocations anyway).

Instead:

The requested size is rounded up to a whole number of heap blocks
A contiguous sequence of heap blocks is allocated.
That entire range is treated as a single, atomic object.

For example (assuming a 4 KB heap block size):

Requesting 6 KB → 8 KB (2 blocks) allocated
Requesting 17 KB → 20 KB (5 blocks) allocated

Once allocated, the large object exclusively occupies that contiguous block range.

Cross-allocation constraints

Large objects are tracked using dedicated large-object free lists, which are separate from the regular small-object free lists. As a result:

Free space within a live large-object block range is never reused for small-object allocations.
Likewise, fragmented free space between small objects is never combined to satisfy a single large object allocation.

At first glance, this behavior may seem wasteful. For example, consider a case where a 17 KB large object is allocated in 20KB block range (five contiguous blocks). Although there may be unused space remaining within that range, that space cannot be sub-allocated for smaller objects.

Similarly, once the 17 KB object becomes unreachable and is collected, the entire 20 KB block range becomes free. At that point, it might appear that the GC could reuse that space for multiple smaller allocations. In practice, it is most naturally reused by another large allocation of a similar size.

A freed large-object block is initially reused as a whole. While the GC can reuse regions previously occupied by large objects for other purposes, this requires the block to be explicitly repurposed and split, which does not happen automatically at collection time.

This is a design choice. Large objects are treated as indivisible units to avoid ambiguous liveness and bookkeeping complexity

LOH Fragmentation

Because of these characteristics, large objects tend to contribute more visibly to memory fragmentation:

Reuse requires a sufficiently large, contiguous block.
Smaller allocations cannot borrow space from large free blocks.
Large free regions may remain unused for long periods if no similarly sized allocation occurs.

However, completely free large blocks are not wasted forever. They may later be repurposed when allocation pressure requires it.

What happens when a large object is freed

When a large object becomes unreachable and is collected:

The block range is immediately added to the large object free list
Another large allocation can reuse this space directly
This is the fast path and happens without overhead

Adjacent free blocks are coalesced when possible, forming larger contiguous free regions. The freed block retains its full size and is managed purely based on size, not on its previous use.

At this stage, the block remains intact. It is not immediately made available to small-object allocation paths.

Allocation-time reuse and lazy splitting

If a future allocation requires heap space and no suitable block is available through the usual path, the GC may:

Select a completely free large heap block.
Split it on demand.
Use a portion of it as backing storage for the allocation.
Return the remaining portion to the free list.

This lazy splitting occurs at allocation time(not during collection).

Large objects monopolize their block ranges while alive, and freed blocks are reused conservatively. It’s split and repurposed only when future allocation pressure requires it.

Unity’s Current GC and Future Changes(CoreCLR)

The behavior described in this article reflects Unity 6’s current garbage collector implementation, which is based on a Boehm GC. Boehm GC is a conservative, non-moving garbage collector.

Conservative GC

In a conservative GC, objects are never relocated during collection. Live objects remain exactly where they were originally allocated inside a segment, and the collector only identifies which objects are reachable and which are not.

Memory is reclaimed by marking dead objects as free, not by rearranging live ones. Because objects are not moved, heap compaction does not occur.

Compaction is a GC strategy where live objects are moved closer together,
eliminating gaps between them and reducing fragmentation.

Over time, this naturally leads to fragmentation inside heap segments: free spaces may exist, but they are often too small or too scattered to be reused efficiently.

This design is intentional. Moving objects during GC is expensive. It requires relocating objects in memory and updating all references that point to them.

In practice, allocation speed is often prioritized over long-term fragmentation. Avoiding object movement keeps allocations simple and cheap.

Additionally, to move objects safely, the GC must know exactly which values are real object references. However, a conservative GC does not have that level of precision.

Instead, it treats any value that looks like a pointer into the managed heap as a potential reference, even if it is not an actual object reference. This approach deliberately prioritizes safety over precision.

CoreCLR - Moving GC

As Unity continues its transition toward newer .NET runtimes and CoreCLR, some of the constraints described above may change.

CoreCLR-based garbage collectors are precise and compacting by design. They are able to relocate live objects and update references, which reduces fragmentation.

However, a GC optimized for general-purpose workloads is not automatically a perfect fit for real-time game engines, as moving objects and compacting memory introduce additional CPU workloads.

Any use of compaction will need to be tuned specifically for real-time workloads. For this reason, Unity will not simply adopt CoreCLR’s GC behavior “as-is”.

Summary

Virtual Memory ≠ Physical Memory
Memory segments (blocks) are allocated and freed in page units
Small object allocations are subdivided and packed within a segment
- Multiple small objects can share the same underlying OS page
A single large object occupies one or more contiguous segments exclusively
Dead objects don’t reduce physical memory
GC performs logical cleanup, not physical unmapping.
Dirty pages remain mapped and visible in OS profilers until Unity remaps them.
The managed heap’s virtual size only grows, while resident memory may shrink slightly over time
Manual GC.Collect() calls to trigger remapping segments only hurt performance. Let Unity handle it.
Large managed objects are allocated through a separate large-object allocation path
Unity 6’s current GC is based on a conservative, non-moving GC, which prioritizes safety and predictable allocation over compaction.
CoreCLR introduces precise, moving, and compacting GC capabilities, which may change some of the behaviors described above.
CoreCLR in Unity will be carefully tuned for real-time game workloads

meredoth · December 15, 2025, 6:19am

This is a solid and much needed post, it explains Unity’s managed heap behavior clearly and dispels a lot of long standing misconceptions around GC and memory usage. I do think there are two important additions that would make it more complete and future proof:

It would be valuable to explicitly cover how the Large Object Heap behaves in Unity. LOH allocations follow different rules than the rest of the managed heap, are more prone to fragmentation, and are relatively easy to hit in Unity projects (large arrays, buffers, meshes, interop data, etc.). Because LOH segments are handled differently, they can materially affect both performance and observed physical/dirty memory usage. Calling this out would help readers understand cases where memory behavior still looks “wrong” even when they follow best practices.
It would also help to clearly anchor this article to the current Unity GC implementation and include a note that the behavior described may change as Unity continues moving toward newer .NET GC technologies. Without that context, readers coming to this article in a year or two may incorrectly apply these conclusions to a different GC with different segment, compaction, and decommit behavior.

Trisibo · December 15, 2025, 5:17pm

This is a bit confusing, maybe it’s a typo? Should it be something like “you might have segments that are logically empty but physically used/full”?

ijemin · December 16, 2025, 12:55am

I applied your feedback. Thanks

ijemin · December 16, 2025, 1:07am

Thanks for the feedback. I will add that part. It would take some time, though. I want to double-check that we modified some functionality from the original Boehm GC.

That’s a good point. I will add that too. We plan to migrate to CoreCLR in the future, and CoreCLR supports compacting, which is not supported by the Boehm GC, the basis of Unity GC.

wadim-al · December 16, 2025, 5:03pm

Can this article be extended for the case of the HTML5 exported builds? E.g. what memory structure, limits and usage pattern is with Unity game compiled to JS/WASM with Emscripten and il2cpp. What happens on the IOS (cause of the crashes)?

skyake · December 22, 2025, 4:19am

Have you ever considered the .NET experimental soft real-time GC “Satori GC”? It’s a moving GC but the single pausing time is guaranteed to be less than 1ms in most scenarios, without sacrificing the throughput performance compared to DATAS, while maintaining small memory footprint similar to WKS GC: GitHub - VSadov/Satori: Experimenting with dotnet runtime.

We have deployed Satori GC in our production environment and it now runs for months without any issue. In our real-world heavy workload we observed maximum GC pause of 11ms, and P99.9 GC pause of 0.8ms during handling heavy requests. See metrics below.

The game osu! also adopted Satori GC in their experimental channel and it performs very well.

Even in a GC-heavy benchmark BinaryTree, we observed the maximum latency of 27.004ms in WKS GC, 146.004 in Server GC, 61.818ms Server GC (DATAS), and 1.806ms in Satori GC. So I would say that Satori GC fits real-time scenario very well.

See more discussions on https://github.com/dotnet/runtime/discussions/115627

Saeed-Barari · December 22, 2025, 7:54am

This has been very helpful, and there are questions here that would make it even more insightful when they get answered! (the WebGL crashes on iOS for example has been a huge pain! We eventually got around it for %90 of the time by pre-allocating everything in the Player Settings memory section)
Now that things are going forward with more clarity with the community and we get to learn some of the internals and how we can best utilize them, I would like to propose a feature (or rather a set of them) that would greatly accelerate the “clarity” part. What if Unity let us utilize some of the internal heap management APIs, as part of UnityEngine.UnsafeUtility? For example

I can see a clear use-case for something like UnityEngine.UnsafeUtility.DropUnusedMemory() that we could potentially call after we return from the game to the main menu.
Or let us directly delete an object without having to drop all references to it manually (UnityEngine.UnsafeUtility.DropObject(System.Object)?). I’m sure there could be a nice IDisposable wrapper around manually handled objects that we could use, much like RAII objects in C++. (you may create a String and when it goes out of scope, it gets deleted without bothering the GC at all).

Additionally, I would like to know if it would be possible to utilize the “data segment” for arena allocation techniques. This could separate concern of a lot of the static objects from the GC. Moreover, it could introduce a new best-practice (on memory usage, like the how the collection pool system did) to try to use these data segments over heap allocations. In general, having more control over the memory management and taking control of some objects away from GC is what I’m talking about; more clarity, less restrictions, but the same old defaults so it won’t complicate things if we don’t want it.

Super excited for CoreCLR update!

Edit: wordings

ijemin · December 22, 2025, 8:21am

Yes, the memory behavior of HTML5 (WebGL) builds is very different from other desktop or mobile platforms.

In WebAssembly, freeing a memory block does not result in OS-level decommit or unmapping of memory. WebAssembly uses a single, grow-only linear memory model and currently does not provide a standard equivalent to OS-level mmap/munmap.

From the OS or browser’s perspective, this linear memory is treated as one large contiguous memory chunk. When free() is called, the memory is marked as available for reuse inside the allocator, but it is not returned to the OS.

The browser still considers the entire linear memory range as owned by the WebAssembly instance.

So, even when the GC frees entire heap blocks that could be decommitted on other platforms, that memory can only be reused internally by the allocator in WebAssembly builds. It cannot be returned to the operating system, so physical or dirty memory usage does not decrease.

There are ongoing proposals, such as memory.discard and virtual memory extensions, that aim to address this limitation in the future: memory-control/proposals/memory-control/Overview.md at main · WebAssembly/memory-control · GitHub

On WebGL, the garbage collector can only run when no managed code is executing and typically runs at the end of a frame. This leads to two important characteristics.

The Web platform does not support incremental garbage collection.

Because the GC cannot safely interleave with managed code execution, it must run in a single pass at a well-defined synchronization point.

Temporary allocations are not reclaimed gradually during execution.

Instead, short-lived objects created inside loops or per-frame logic can accumulate until the frame ends, causing a much higher peak memory usage than on other platforms. Even if those objects are immediately eligible for collection, they still contribute to memory pressure until the GC finally runs.

Unity’s official documentation covers these Web-specific memory in more detail here:

ijemin · December 22, 2025, 10:49am

I just added LOH section!

meredoth · December 22, 2025, 2:01pm

Really useful information about the 4 KB threshold for the LOH in Unity’s garbage collector. I had no idea about that.

You’re spoiling us, so we’ll be expecting a new post covering the intricacies of the CoreCLR GC modifications when that happens.

ijemin · December 22, 2025, 2:49pm

Thanks! Glad it was useful!

I just added a small clarification regarding the LOH 4KB threshold. I mentioned 4 KB because I’d previously seen the same question answered with “it’s probably around 4 KB”.

After double-checking gc_priv.h, I found the threshold is actually 2 KB.

MAXOBJBYTES(large-object routing threshold) is defined as HBLKSIZE / 2. With no special configuration, HBLKSIZE is 4KB on desktop, so MAXOBJBYTES will be 2KB(memory allocation for a large object is rounded up to HBLKSIZE, so 2KB large object will take 4KB blocks though).

ijemin · December 22, 2025, 9:31pm

This is actually the first time I’ve heard about Satori GC, but after reading through the GitHub discussion it does look very interesting.

I should clarify that I’m not part of the team working on Unity’s scripting backends, so I don’t have insight into whether alternatives GC like this were considered internally.

My personal understanding is that our current direction is primarily focused on supporting modern .NET standards via CoreCLR, so I would assume it’s less likely that Unity would adopt a GC that is outside the .NET runtime itself.

Still it’s genuinely fascinating to see different GC designs within the .NET ecosystem. I’ll definitely try experimenting with Satori GC myself. Thanks for the sharing!

equinecares · December 28, 2025, 7:08pm

thank you
It would be valuable to explicitly cover how the Large Object Heap behaves in Unity.

hooram950 · January 12, 2026, 3:15pm

Great write-up overall this really clears up why memory “sticking around” after GC isn’t actually a bug but a design choice. The explanation around dirty pages and Unity’s conservative GC makes a lot of real-world profiler confusion finally click.

ijemin · January 13, 2026, 8:15am

I’ve just added an additional note about LOH. In the previous version, the description could be read as implying that memory previously occupied by a large object was forever unreusable unless another similarly sized large object allocation occurred, which is not.

Memory inside a large object heap block can’t be used directly for small objects. But a completely freed LOH region can be reused for small objects by repurposing them and splitting them into blocks suitable for small objects.

For example, if a single 300 MB large object is allocated and later collected, subsequent small allocations may reuse that region by splitting it, even if no similarly sized large-object allocation occurs. I’ve updated the post to clarify this behavior.

sebascore · January 21, 2026, 4:35pm

I am old fashioned, how do I print this?

GiftedMamba · February 5, 2026, 3:35pm

It was very interesting to read. I hope we will have more articles like this in the future.

Topic		Replies	Views
Garbage Collector API Unity Engine 2018-3-beta	83	22856	November 26, 2018
4.4 etc - Fixed cost GC? Better instantiate? News & General Discussion	30	8331	November 22, 2013
Mono garbage collection, can I get a summary? News & General Discussion	173	28523	October 13, 2015
Memory voes, is there really no way to manually allocate/free objects? Unity Engine Scripting	5	4476	April 7, 2019
Very odd Huge Memory Usage 40GB ManagedHeap.UsedSize !!! Unity Engine Scripting , Profiling	8	1472	May 20, 2022