Width Inference

Normative specification for value-range analysis, minimal-width derivation, and representation selection in Clef compilation.

1. Overview

Clef does not fix numeric representations by declaration and then coerce between them. It infers the representation a value needs — the bit width of an integer, or the posit / IEEE-float / fixed-point form of a real — from the value’s range, traced through the program’s dataflow. Width inference is the spatial half of numeric lowering: how many bits does each value actually need?

The discipline is: representation follows from analyzed range, precision loss is explicit and tracked, and an unanalyzable range is a reported error, never a silent default. Signedness is part of this — a value’s range determines whether it is signed or unsigned (§3), so a representation is never selected by a target type name independent of the range the value actually occupies.

Lineage. Width inference derives from F*’s refinement-typed machine integers, where an integer carries its range as a refinement and operations are proven in-bounds. Clef takes the further step of inferring the range — and therefore the width — by interval analysis over the Program Semantic Graph, rather than requiring the developer to declare it. The position aligns with Clash (Haskell→FPGA), which sizes hardware to inferred widths rather than to machine-word defaults.

2. Value-Range Analysis

Width inference begins with interval analysis over the PSG dataflow. Each numeric node is assigned a value range [a, b]:

Literals are point intervals: 124 has range [124, 124].
Arithmetic propagates intervals: x + y has range [a_x + a_y, b_x + b_y]; products, shifts, and so on follow standard interval arithmetic.
Comparisons seed ranges: a value compared against a threshold is bounded by that threshold on the relevant branch, which in turn bounds any counter that must reach it.
Recurrences and counters are bounded by their modulus or termination condition: a free-running counter mod N has range [0, N-1].

For example, in a working FPGA design a wave-chase counter is free-running mod (defaultPeriodMs * ticksPerMs * 2) ≈ 10⁹, giving range [0, ~10⁹].

3. Width Derivation (Integers)

From a range $[a, b]$ , the minimal integer width is derived directly:

\mathrm{width}([a, b]) = \begin{cases} \lceil \log_2(b + 1) \rceil & a \ge 0 \quad (\text{unsigned}) \\ 1 + \lceil \log_2(\max(|a|,\, b + 1)) \rceil & a < 0 \quad (\text{signed}) \end{cases}

A value uses exactly the bits its range requires — no more. The following are inferred widths from a working FPGA design (Arty A7-100T):

Value	Inferred width	Range
`Counter`	31 bits	free-running mod ~10⁹
`StepTick`	21 bits	counts to ~781,250
`Phase`	10 bits	cycles `0..511`
`PeriodMs`	13 bits	latched period ≤ 4000

Each register uses exactly the bits it needs; on the FPGA each seq.compreg flip-flop is narrowed accordingly, shortening carry chains. No width is declared by hand.

4. Representation Selection (Reals)

For real-valued quantities, the analyzed range selects a representation, not merely a width. Following the dimensional-type model, representation selection is a deterministic compile-time function of the range and the target’s available formats:

r^* = \operatorname*{arg\,min}_{r \,\in\, R(\text{target})} \; \max_{x \,\in\, [a, b]} \; \frac{|x - \mathrm{round}_r(x)|}{|x|}

— the representation minimizing worst-case relative error over the range. IEEE-754 distributes precision uniformly (≈ $2^{-p}$ ); posits taper precision toward 1.0; fixed-point fixes a scale. The choice is per target (e.g. IEEE-754 on CPU, posit on FPGA, fixed-point on a neuromorphic core) and is surfaced at design time. See NTU Dimensional Architecture and Units of Measure; the dimensional range of a value is the primary input to this function.

The bare argmin above is the shape of the objective, not its sound form: for reals it is ill-posed without two side-conditions — a coverage constraint that excludes representations whose dynamic range does not cover [a, b], and a zero-crossing floor on the error metric where the range straddles zero. The full, sound objective and both side-conditions are specified in Numeric Selection §2; this chapter states only the integer-width half of the discipline.

5. Width and Representation as a Coeffect

The inferred width/representation is a coeffect: a requirement settled during design-time analysis and carried on the PSG, read (not recomputed) by every later lowering pass. The width-inference coeffect holds the range; the lowering pass that fixes the integer representation reads it and writes the chosen width into the operations it produces. This is the same coeffect discipline that governs allocation and lifetime (see Incremental Computation §10 and the coeffect model), so width inference composes with dimensional resolution and target reachability in a single analysis rather than a separate pass.

6. Unobservable Ranges

When interval analysis cannot bound a value’s range, the compiler does not guess a width or fall back to a machine-word default. It reports an error asking the source for an annotation. This preserves the decidable-by-construction property: every width is either inferred from an observed range or supplied explicitly; none is silently assumed.

7. Explicit Conversion

A genuine change of representation — narrowing a value to fewer bits than its range requires, or moving between incompatible formats (e.g. posit ↔ fixed-point) — is explicit, and:

Fidelity is tracked. A representation change that loses information records the loss; the language server surfaces it at design time (a lossless transfer is fidelity 1.0; a lossy one is < 1.0 with a bound derived from the range).
Overflow is a type error, not undefined behavior. A conversion whose target cannot hold the source range is rejected at compile time. Where a runtime-dependent value may exceed the target, the conversion must specify a rounding or saturation discipline; it may not silently truncate.
There is no polymorphic 'T -> Target conversion. Each explicit conversion names its source and target representations.

Not yet specified. The surface syntax for an explicit conversion (the operator form, how a rounding/saturation mode is written, and how the fidelity coeffect is named in source) is open. When specified, it SHALL make the representation change and its precision consequence visible at the call site.

8. Target Lowering

Target	Effect of width inference
FPGA (e.g. Arty A7-100T)	Registers / `seq.compreg` flip-flops sized to exact inferred width; shorter carry chains, less area. Posit or fixed-point selected per range.
CPU	Inferred width rounded up to the nearest native integer size for arithmetic; the analyzed range still drives representation choice and overflow checking.
NPU / GPU	Inferred width/representation informs tile/lane packing and accumulator selection.

Width inference is the spatial dimension of hardware lowering. It is necessary but not sufficient: a design may meet its width budget yet still violate timing through combinational depth, which a companion pipeline (temporal) inference addresses. The two are distinct analyses; this chapter specifies only the spatial one.

9. Relationship to Other Features

Dimensional Type System — the dimensional range of a value is the principal input to representation selection (§4); width inference is the value-level realization of the dimensional discipline.
Native Type Universe — inferred widths and representations are NTU representation choices carried through lowering.
Conversion — width inference, together with explicit conversion (§7), specifies numeric conversion in full; the Convert and NTU Conversion chapters are not part of this specification.

10. Normative Requirements

Range-driven width: Integer representation width SHALL be derived from the value’s analyzed range, not from a target type name.
Exact width: An integer value SHALL be representable in the minimal width its range admits; lowering MAY widen to a native size for arithmetic but SHALL NOT narrow below the range.
Representation selection: For real-valued quantities, representation (IEEE-754 / posit / fixed-point) SHALL be selected per target as a function of the analyzed and dimensional range.
Coeffect carriage: The inferred width/representation SHALL be recorded as a coeffect on the PSG and preserved through lowering without recomputation.
No silent default: An unanalyzable range SHALL be reported as an error requesting annotation; the compiler SHALL NOT assume a default width.
Explicit conversion: A representation change that loses information SHALL be explicit, SHALL record fidelity, and SHALL NOT be expressed as a polymorphic 'T -> Target coercion.
No undefined overflow: A conversion whose target cannot hold the source range SHALL be a compile-time error; runtime-narrowing conversions SHALL specify a rounding or saturation discipline.

References

Swamy, N., et al. F* — refinement-typed machine integers and verified low-level code. https://www.fstar-lang.org
Baaij, C., et al. CλaSH: structural descriptions of synchronous hardware (Haskell→FPGA).
Gustafson, J. (2017). Posit Arithmetic; Standard for Posit Arithmetic (2022).
Petricek, T., Orchard, D., & Mycroft, A. (2014). Coeffects: A Calculus of Context-Dependent Computation. ICFP ‘14.

Reactive Signals Numeric Selection