A Lesson in Memory Safety

A FreeBSD kernel bug lives in plain sight due to numeric representation.

A recently resolved OS bug was almost impossible to find. It was real, it lived in the FreeBSD kernel, and the code that contains it looks like the sort of defensive code a careful engineer might write. The video below walks through it and does an excellent job of breaking down the problem.

The function copies a bounded region of kernel memory out to a user-supplied destination. The kernel buffer kbuf is 1 kilobyte. A caller passes a destination and a length, and the code clamps that length so it can never exceed the buffer:

void *memcpy(void *dest, void *src, size_t n);

#define KSIZE 1024
char kbuf[KSIZE];

int copy_from_kernel(void *user_dest, int len) {
    int safe_len = KSIZE < len ? KSIZE : len;
    memcpy(user_dest, kbuf, safe_len);
    return safe_len;
}

The clamp looks airtight. If the caller asks for 2 kilobytes, safe_len becomes 1024, and memcpy copies exactly the buffer. The engineer checked the bound. The check is right there on line 7.

The ‘Secret’ Revealed

len is an int, a signed integer. memcpy takes its length as size_t, an unsigned integer. Nothing in the code converts between them; C does it silently at the call.

A caller passes len = -2048. The clamp asks whether 1024 < -2048. It is not, so the clamp leaves safe_len at -2048. That negative value is then handed to memcpy, which reads its argument as unsigned. The bit pattern for -2048 in a signed integer, read as unsigned, is an enormous positive number. memcpy sets out to copy that many bytes, reading past the end of the buffer and into the kernel memory that follows it, where secrets like passwords and keys are held, and out to the caller.

The bound was checked and deemed correct. It was compromised by a conversion that no line of the code performs: the signed-to-unsigned reinterpretation that C inserts, invisibly, at the boundary between the program’s logic and memcpy’s contract. The representation of the length was decided by the type names at each end, int on one side and size_t on the other, and the mismatch between those names is the vulnerability.

What makes the bug hidden is the implicit conversion. C changes a value’s type at a boundary without a word in the source, so the one operation that breaks the code is the one operation you cannot see. This was a deliberate choice in C’s design, made for convenience, not a limitation of its era: the alternative, requiring every type change to be written explicitly, was available at the time and is what structured languages take. Clef does not allow an implicit numeric conversion, and it is far from alone in that.

That design decision is why the safety of this code is human-carried. Because the language will silently convert the type, it offers no place to record the one fact the code depends on: that len must not be negative. Nothing in int says so, and nothing can. So the invariant ostensibly would have to live in the author’s head, subject to memory and silent distraction and error. Every contributor who touches this code afterward inherits it, along with the language decision that put it there. Everyone implicitly carries a language rule and a quiet fact the language will not hold. That fact wears other clothes in other bugs, a unit left out of a type, a dimension the compiler was never told, and Danger Close: Why Types Matter walks through what a Fahrenheit-for-Celsius or milligrams-for-micrograms mix-up costs when the type is silent on it. Here the fact is a sign. This is the information loss we describe in Deferred Inference: committing to int early discards the range that would have told the compiler the value is never negative, and once that range is gone, so is the safety it would have guaranteed.

Forcing It Into the Open

Simon Peyton Jones, one of the creators of Haskell, names the property at the heart of this in a recent interview. He is talking about shared mutable state, but the point is the same one:

Reading and writing shared variables that are visible outside the scope is a form of coupling between bits of code that is very invisible. Very invisible. So functional programming forces that to become visible. It forces it into the open.

That is exactly what the FreeBSD bug turns on. The dependency between len and memcpy, that the value must be non-negative for the bound to hold, is coupling between two bits of code, and it is invisible. C does not force it into the open. It lets the coupling stay implicit and the conversion stay silent, and the safety of the whole function rests on a relationship that appears nowhere in the source.

The lesson is not that the engineer should have been more careful. They were careful. The check was correct. The lesson is that a language which leaves the coupling invisible leaves the safety to vigilance, and vigilance does not survive contact with a codebase that many people touch over many years. SPJ’s own summary of the alternative is blunt: you want to know a program is safe, “rather than just say I’ve done my best.”

Correct By Construction

Clef forces that coupling into the open by making it part of the type. It does not choose a numeric representation from a type name; it chooses it from the value’s range, analyzed through the program’s dataflow, and carries that decision through compilation. This is the discipline of width inference: the bit width of an integer, and whether it is signed at all, follow from the interval a value actually occupies, not from a keyword written at a declaration. The relationship that lived invisibly between len and memcpy becomes a property the compiler holds and checks.

Trace the bug through that discipline and it has nowhere to form. A length into a 1-kilobyte buffer occupies the range [0, KSIZE]. That range is non-negative, so the value is unsigned by construction; signedness is derived from the range, not declared, and there is no signed form of this length for a caller to smuggle a negative value into. There is no -2048 to pass, because -2048 is not a value the length’s type can hold. And because the representation is selected once, from the range, and carried to the boundary as a fact the compiler still holds, there is no second, different type name at the far end for the value to be reinterpreted against. The boundary that reinterpreted int as size_t is not a boundary Clef crosses; both ends carry the same representation, chosen from the same range.

The bug is not caught by a check. A check is what failed in FreeBSD. The bug is absent because the state it depends on, a length that can be negative and a boundary that silently reinterprets it, is not representable. The range-derived representation is the Native Type Universe’s answer to exactly the class of failure that invisible coupling produces: it makes the coupling part of the type, so there is no longer a relationship for a contributor to know about and preserve.

That representation choice, once made, is not discarded on the way to the machine. It is carried through lowering as a fact the later stages hold, the same discipline that keeps a closure’s lifetime and a value’s dimensions intact from source to native code. A safety property established at design time is only worth having if it survives to the target, and information the compiler establishes is not discarded in lowering.

The Curse of Systems Archaeology

The video that opens this post is worth the watch in full; it takes a bug that hides behind a correct-looking check and makes every step of it legible. Bugs of this shape are not rare. They sit in production code everywhere, in the quiet gap between what a procedure intends and what the code assumes, and they reside in plain sight for decades precisely because the language makes them so hard to see. What is changing is that they are getting easier to surface. “AI” large language models are increasingly good at parsing details from code the way this video does, tracing a value across a boundary and noticing the assumption, and they will keep turning up anomalies wherever they lurk.

We would rather these bugs have nowhere to live than be found one at a time. That is much of why we are building full memory safety into the Fidelity Framework: not to catch this class of failure after the fact, but to make the semantic it depends on unrepresentable. Our goal is that a bug like this, someday, will be a forgotten vestige of the past.

Reading Further

Width Inference - the normative account of selecting representation, and signedness, from a value’s analyzed range
Numeric Selection - the real-valued half of the same discipline, for posit, IEEE-754, and fixed-point
Clef: From BCL to NTU - the Native Type Universe that resolves types to native representations at compile time
Wrapping C and C++ - building type and memory safety around legacy code that has these boundaries
Fewer Tests; Greater Safety - why a structural property beats a test that would have to anticipate the exact input
Information Is Not Discarded - how a design-time safety fact survives lowering to native code

The Gift of Deferred Inference Dotnet to Fidelity Concurrency