Numeric Selection

Normative specification for compile-time selection of the numeric representation of real-valued quantities — posit, IEEE-754, or fixed-point — from a value’s dimensional range, via the same coeffect machinery and Program Semantic Graph carriage that width inference uses for integers.

1. Overview

Numeric selection is the real-valued sibling of Width Inference. Width inference sizes an integer from the bit count its value range requires; numeric selection chooses the representation of a real — whether a value is best carried as a posit, an IEEE-754 float, or a fixed-point number — from that value’s dimensional range. The two are halves of one discipline: representation follows from analyzed range, never from a target type name, and an unanalyzable range is a reported error, never a silent default.

The objective is a single, deterministic, compile-time function: for a real value with range [a, b] on a target offering a set of representations R, select the representation that minimizes worst-case relative error over the range. This is the objective sketched in Width Inference §4, made sound here by two side-conditions the bare form requires; the present chapter specifies it in full, including those side-conditions, the tiered provenance model for the range input, the unobservable-range contract split on dimensionedness, the capability-coeffect treatment of performance, the concrete/parameterized representation scope split, and the quire pass that realizes exact accumulation.

The single observation that unifies the design: the dimensional range, not the dimension, is the input to selection. Knowing that a value carries dimension meters does not distinguish nanometers from astronomical units. The dimensional algebra (see Units of Measure and NTU Dimensional Architecture) establishes the kind; the concrete [a, b] establishes the representation. Authority over the range is therefore authority over the representation, because the objective is deterministic once the range is fixed.

Governing principle — design-time range selection provisions the runtime envelope. A representation is selected whose dynamic range and precision profile cover the value’s range with margin (§2.1), so that runtime — or training-time — behavior cannot out-run the representation’s ability to preserve precision. For an evolving computation (e.g. an adaptive model whose distribution shifts during training), the expected drift is part of the range; the margin is sized to it. Preservation of representation is therefore a design-time guarantee, not a runtime mechanism: nothing re-selects at runtime. When behavior does reach the envelope edge, that surfaces as a loud, located condition — a reported error or an explicit saturation (§2.1, §6, §7) — never a silent precision collapse. Explicit developer selection is sovereign over all inference (§3.3, §5): a developer who names a precision and range gets exactly that.

Lineage. Numeric selection inherits the range-as-refinement discipline from F*, whose refinement-typed machine integers carry their range as a refinement and whose representation choices are proven sound against it; Clef takes the further step of inferring the range and selecting a real representation from it. The tiered provenance model and its Fidelity.Physics mechanism (§4) are expressed with three features of F# lineage: units of measure (Robert Kennedy’s measure types, which key a range to its dimension), quotations (Don Syme’s Expr<_>, which carry a range-law expression the compiler interval-evaluates symbolically), and active patterns (Syme’s pattern abstraction, which classify an evaluated range into a representation regime). These are load-bearing F#-origin features and are never relabeled as ML-family-generic. The posit error model and the full-gamut quire follow Gustafson’s posit arithmetic and the 2022 Posit Standard; the b-posit bounded-regime variant and its fixed 800-bit quire follow Jonnalagadda, Thotli, and Gustafson (arXiv:2603.01615).

1.1 Status discipline

This chapter distinguishes requirements that follow directly from prior art and standards from requirements that are this specification’s own design construction. Where a requirement is a forced consequence of the objective plus the integer-twin infrastructure, it is stated as normative without qualification. Where a requirement is a design decision this specification adopts — implementable and recommended, but not compelled by external canon — it is marked [Design decision]. Genuinely unresolved items are marked [Not yet specified] rather than invented.

2. The Selection Objective

For a real value with range [a, b] on target T offering representation set R(T):

r^* = \operatorname*{arg\,min}_{r \,\in\, R_{\mathrm{cov}}(T,\, [a, b])} \; \max_{x \,\in\, [a, b]} \; \mathrm{err}_r(x)

Three properties are load-bearing:

Accuracy-only objective. The score is worst-case error over the range. There is no cost, latency, or area term. Performance is handled by filtering the candidate set (§6), never by perturbing the score.
Deterministic and compile-time computable once [a, b] is fixed.
Per-target. Both R(T) and the winner vary by target; the range does not.

The error models differ by representation family and decide the winner by where the range sits:

IEEE-754 distributes relative error approximately uniformly (≈ 2⁻ᵖ) across its normal range — a representation that makes no bet on where values cluster.
Posits taper: precision is maximal near magnitude 1.0 and degrades toward the regime extremes — a bet on locality.
Fixed-point fixes a scale, trading dynamic range for uniform absolute spacing.

The bare argmin as written above is ill-posed without two side-conditions, specified in §2.1 and §2.2. The objective form shown already incorporates them: R_cov is the feasibility-filtered candidate set (§2.1), and err_r(x) is the ULP-floored error metric (§2.2).

2.1 The feasibility (coverage) constraint

[Design decision.] A candidate whose dynamic range does not cover [a, b] is not excluded by the raw argmin. For x beyond r’s maximum representable magnitude, round_r(x) saturates and the relative-error term approaches a finite value near 1.0 — so a non-covering representation produces a bounded score and can pathologically win when every candidate scores near 1.0. A diagnostic bolted on after selection does not protect the objective. The candidate set is therefore filtered before the argmin:

R_{\mathrm{cov}}(T, [a, b]) = \{\, r \in R(T) \;:\; \mathrm{dynrange}(r) \supseteq [a, b] \,\}

with the explicit rule: if R_cov = ∅, that is a hard error (“no available representation on this target covers [a, b]; widen the range source, rescale dimensionally, or seal a representation and accept saturation”). A non-covering format SHALL NOT be selected. The Tier-3 seal check (§5) is a special case of this coverage check applied to a singleton R, not a separate mechanism. The coverage constraint is part of the objective, not a post-hoc warning.

2.2 The zero-crossing error metric

[Design decision.] The relative-error term |x − round_r(x)| / |x| diverges as x → 0. Signed dimensioned ranges routinely straddle zero (e.g. a membrane potential range [−80, +40] mV). For any range containing a neighborhood of zero, the unmodified worst-case is unbounded for every candidate — IEEE included, since near zero IEEE precision is governed by the subnormal floor, not by 2⁻ᵖ — and the argmin is undefined.

The error metric is therefore a mixed absolute/relative form with an ULP floor:

\mathrm{err}_r(x) = \frac{|x - \mathrm{round}_r(x)|}{\max(|x|,\, \mathrm{ulp}_{\min}(r))}

where ulp_min(r) is the magnitude of r’s smallest representable positive normal (IEEE) or smallest representable magnitude in the regime covering the range (posit / fixed-point). Below ulp_min(r) the metric becomes effectively absolute, which is the correct near-zero semantics: absolute spacing is what matters near zero, and the contest becomes “whose smallest representable magnitude best resolves the near-zero cluster.” An equivalent formulation excludes an [−δ, +δ] neighborhood of zero from the relative-error worst-case and scores near-zero behavior by ulp_min(r) directly.

A consequence of correct posit modeling: posits taper toward 1.0, not toward zero. Zero is a regime extreme where posits, like every representation, lose precision. Any claim that posits deliver high precision near zero is arithmetically false and SHALL NOT appear in selection diagnostics. The genuine near-zero argument for posits is the exact accumulation the quire provides (§8), not tapered near-zero precision.

Posit32/es2 anchors. Dynamic range ≈ [10⁻³⁶, 10³⁶]; relative error ≈ 2⁻²⁷ near 1.0, degrading toward ≈ 2⁻⁸ at regime extremes. These two figures are illustrative endpoints of a continuous taper, not a two-point error model. The worst-case over [a, b] depends on where [a, b] sits on the taper — which is the entire point of the objective. An implementation SHALL compute the taper at the actual range endpoints and SHALL NOT hard-code the endpoint figures.

3. The Tiered Authority Model

The range input [a, b] has exactly three provenances. These are not three selection algorithms — they are three provenances and bindingness levels of the one objective’s input. All three feed the same selector and produce identical lowering: the pipeline treats every range claim identically regardless of provenance. The three tiers map onto the Level 1 / Level 2 / Level 3 inference ladder used throughout Clef (see Incremental Computation §12 and Width Inference §5).

Tier	Range source	Inference Level	Developer writes
1 — Intrinsic	interval analysis + profiling evidence	Level 1 (inferred)	nothing (rides the width-inference frame; see §3.1 caveat)
2 — Library-assisted	library constants / physical laws	Level 2 (bounded)	`open Fidelity.Physics.X`
3 — Direct	explicit annotation at the site	Level 3 (explicit)	a seal form (syntax [Not yet specified], §11)

3.1 Tier 1 — Intrinsic

The compiler obtains [a, b] from interval analysis over the PSG composed with dimensional inference — the same kind of analysis width inference runs for integers, but over a real/dimensional interval domain that does not yet exist in the integer twin (§9). Tier 1 succeeds automatically only for computations whose range is bounded by dataflow alone: closed-form expressions; division only by a quantity with a known non-zero lower bound; profiled values; or already-annotated values. For real physics with division by a quantity whose lower bound is not in dataflow (e.g. r² in a denominator), Tier 1 cannot bound the range and falls through to Tier 2 or Tier 3, or to the §7 error. This scope limit is honest and load-bearing: the headline gravitation example (§4) is a Tier 2 success, not a garden-path success.

3.2 Tier 2 — Library-assisted

A domain library supplies the range that intrinsic analysis cannot derive. Tier 2 has authority over the range and is advisory over the representation: the library states that membrane potentials live in [−80, +40] mV; the compiler runs the argmin against the target’s format set. The domain expert knows the physics range; the compiler knows the target’s formats and their error profiles. The mechanism is specified as a design sketch in §4; the precise binding contract is [Not yet specified].

3.3 Tier 3 — Direct

The developer writes a concrete representation; the compiler runs selection in reverse — verifying that the chosen representation’s dynamic range covers the inferred or annotated value range, and emitting a coverage diagnostic if not. Tier 3 is sealing: it converts the argmin from a chooser into a checker (the §2.1 coverage check on a singleton R). A Tier-3 choice that is accuracy-suboptimal but covering still compiles, and is witnessed at design time (“you sealed posit16; the inferred range would select posit8 at equal accuracy and half the footprint”). The seal syntax is [Not yet specified] (§11) — so Tier 3, the one tier buildable on today’s infrastructure, is not yet expressible.

3.4 Tier composition — precedence-override

[Design decision.] When more than one tier produces a range claim, composition is a single, total, precedence-ordered refinement, not a set intersection:

Each present tier yields a range claim: R₁ (dataflow, if Tier-1 analysis terminates with a bound), R₂ (library), R₃ (a sealed annotation, which fixes a representation and implies its dynamic range as a range claim).
The binding range is the claim of the highest present tier (R₃ if present, else R₂, else R₁). Higher tiers override; they do not silently merge. This is the entire meaning of “precedence.”
Lower-tier claims become consistency obligations, not inputs to the selected range. When a lower tier also produced a claim, the compiler checks containment R_lower ⊆ R_binding (within a tolerance; see §11). If the observed dataflow range R₁ is not contained in a higher tier’s declared range, that is a diagnostic — the dataflow observed values the domain library or seal claimed impossible (the real-valued analogue of integer overflow, a genuine bug signal). The diagnostic does not change the binding range.
There is no empty-intersection state, because there is no intersection. Disagreement is always binding-range-wins, lower-claim-flagged, which makes composition total and decidable.

This mirrors the Level-1/2/3 ladder (inferred → bounded → explicit, each level authoritative over the one below) as monotone override with disagreement-witnessing, not as a lattice meet.

4. The `Fidelity.Physics` Mechanism (design sketch)

Fidelity.Physics is planned, not built. This section is a design sketch, not a settled specification; the binding contract is [Not yet specified] (§11).

The mechanism is quotation-symbolic: the library carries a range law, and the compiler derives the range by evaluating it. The derived capability — obtaining a range such as a force interval from the gravitation law — is the reason the law, rather than a precomputed range, is the carried object.

A type-level obstacle shapes how the law is carried. F# [<Measure>] types are a separate kind; they are not first-class type arguments placeable in arbitrary generic position, and quotations reflect over value-level Expr<'T>, not measure-level indices. A naive Expr<DomainRange<N>> keyed by a unit-of-measure N does not type-check. The range law is therefore carried as Expr<float -> ... -> float> — an unmeasured function expression over float — tagged with its dimension by a separate companion attribute rather than by a measure type-argument (matching the documented Farscape Expr<PeripheralDescriptor> precedent, a value-level descriptor expression with dimension carried separately). PSG elaboration symbolically interval-evaluates the quotation AST over its input ranges to produce [a, b]; regime classification then runs over the resulting [a, b] value — the active pattern is a post-evaluation value classifier, not a match on the quotation AST.

This is a design sketch of a planned, unbuilt mechanism; the input-range leaves are tabulated, and the export interface and terminating sub-language are open (§14).

Decidability precondition. The decidability of numeric selection rests on a two-stage structure, and it is important not to conflate the stages. Selection emits a categorical abstraction, not a scalar. What the selector produces and freezes onto the PSG is a point in a bounded, enumerable representation lattice — the regime classification of §4 (NearUnityTaper / WideDynamic / MLActivation) resolved to a concrete element of the finite codomain of §8 (Posit8/16/32/64, IEEE f32/f64, or fixed-point). Every quantity that downstream passes consume — the dynamic range dynrange(r), the near-zero floor ulp_min(r), and the quire width (a function of the selected format: a fixed 800 bits for a b-posit, n²/2 for a full-gamut posit; §10.2) — is a function of that category, inferred from the regime the way integer width is inferred from an integer range. Downstream obligations (coverage §2.1, tier containment §3.4, quire-adequacy §10.2.1) are therefore finite membership and comparison over this lattice, which is the framework’s native decidable regime — not arithmetic over a substituted number. The scalar bound is upstream of all of this; it is an input to the classifier, never the carried object.
Two consequences for the bound computation that feeds the classifier:
It must be total and terminating. The quotation range-law is restricted to a total, terminating, closed-form sub-language, and its bound [a, b] is obtained by directly evaluating the expression in the outward-rounded interval domain of §9.1 — an image computation, not a satisfiability query handed to a decision procedure. Algebraic operations (+, −, ×, reciprocal, sqrt) evaluate to image intervals by the standard outward-rounded rules (including the sign-crossing reciprocal split); the r²-in-denominator case (§3.1) is bounded here because the Tier-2 contract supplies r ≥ r_min > 0, so r²’s interval excludes zero. Transcendentals (exp, log, sin) evaluate on pre-split non-oscillatory segments where each is monotone (for sin, split at the half-period extrema), so the image interval is read from the endpoint images. They are deliberately not admitted as a logical theory — the first-order theory of the reals with sin is undecidable, which is exactly why the design evaluates rather than decides over them. The bound computation forms no logical formula at all; it is an image computation in the interval domain. Every downstream obligation that is discharged as a formula lands in the linear-integer / bit-vector fragment (QF_LIA / QF_BV) the framework already commits to.
Because only a category escapes the bound computation, the method used to compute the bound (interval arithmetic, affine forms, or any other sound image method) is an implementation detail of one terminating evaluation: it is observable downstream only through which lattice element it selects. Its sole correctness obligation is tightness — an over-estimated bound is sound (it can only select a more-covering representation, never an under-covering one, per the conservative-widening discipline used throughout the framework), but it bears on completeness of selection: an interval over-estimate of an expression where a variable recurs (e.g. r in r·r, the dependency problem) can push the bound across a regime boundary, flip which lattice element wins, or in the limit manufacture a spurious R_cov = ∅ error where a tighter method would have found a covering element. Tightness thus affects which category is chosen, never the linearity of the obligations over it.
The remaining open detail is the precise segment-splitting convention for each admitted transcendental (§14).

The library author writes (F#-lineage features named per the Lineage note):

Units of measure (Kennedy) — the dimension that keys the range (registered-under, not type-parameterized-by): [<Measure>] type N = kg m / s^2.
Quotations (Syme) — a range-law expression Expr<float -> float -> float -> float> over unmeasured floats, mirroring Farscape’s Expr<PeripheralDescriptor>, plus a rescale hint enabling the “rescale to AU” diagnostic. Symbolically interval-evaluated (within the terminating sub-language) to a dimensioned [a, b].
Active patterns (Syme) — classify the resulting [a, b] value into a representation regime (NearUnityTaper / WideDynamic / MLActivation). The pattern operates on the evaluated range value; the symbolic step is upstream.
Registration glue — an attribute (e.g. [<DomainRange("OrbitalMechanics")>]) attaches each range-law expression to its measure, so that opening the module is all the developer does.

The developer experience, under the honest scope of §3.1:

open Fidelity.Physics.OrbitalMechanics
let gravForce (m1: float<kg>) (m2: float<kg>) (r: float<m>) : float<N> =
    GravConst * m1 * m2 / (r * r)   // dimension inferred N (free, HM+ℤ unification);
                                    // range SUPPLIED BY Tier 2 — Tier 1 alone
                                    // cannot lower-bound r² in the denominator

The compiler infers N for free; the range comes from Tier 2, precisely because Tier 1 dataflow cannot lower-bound r. This is a Tier 2 success that depends on a library that is planned, not built.

ML activation tuning. A Fidelity.ML sibling ships an asymmetric posit configuration (asymmetric es/regime, exponent bias shifted to center precision on the activation mode, narrow range ≈ [10⁻¹⁴, 10¹]) so an ML author obtains the tuned representation without hand-deriving it. The asymmetry is supplied as input, not as a second objective. Rather than replacing §2’s symmetric worst-case argmin with a distribution-weighted (expected-error) objective in the general selector, Fidelity.ML feeds the existing selector a range already skewed to the activation density (the narrow, bias-shifted interval above): the symmetric argmin run over that skewed range naturally selects the asymmetric, bias-shifted configuration, because the range endpoints it scores already sit where the distribution’s mass is. This keeps the general selector single-objective and exact (Requirement 4), and confines the distribution knowledge to the Tier-2 library that owns it. A genuinely distribution-weighted objective — integrating expected error against an activation density rather than scoring a skewed range’s worst case — remains a possible future refinement scoped strictly to the ML domain library (§14), but is not required for the shipped behavior.

5. Sealing and Reverse Selection

A Tier-3 seal fixes a concrete representation at the site. The compiler runs selection in reverse and discharges exactly the §2.1 coverage check on the singleton candidate set:

If the sealed representation covers the value range, the seal compiles, even if accuracy-suboptimal. A suboptimal-but-covering seal SHALL be witnessed at design time with the representation the open argmin would have chosen.
If the sealed representation does not cover the range, that is the §2.1 hard error.

The seal form (operator, attribute, or quotation; and the rounding/saturation discipline for any accompanying lossy conversion) is [Not yet specified]; it inherits the open explicit-conversion syntax of Width Inference §7.

6. The Default and Unobservable Case

[Design decision.] Numeric selection inherits width inference’s contract — report an error, never silently default — split on dimensionedness:

Dimensioned real with unobservable range → ERROR. A float<newtons> whose range cannot be bounded by any provenance is reported, asking for an annotation or a Fidelity.Physics import. The dimension is a promise that a domain range exists; silently picking IEEE would discard the dimensional information the developer paid for. This is the same contract as the integer unobservable-range error of Width Inference §6.
Bare float with unobservable range → IEEE f64, no error. This is not a forbidden silent default. IEEE-754 is the correct representation for an unknown range: its uniform 2⁻ᵖ error is exactly “no commitment about where values cluster” (the maximum-entropy, no-bet choice). Understood precisely, f64 is what the argmin selects for a wide or unobservable range on a target whose binding offers it — not a bypass of selection. Bare-float code keeps working as it does today.

6.1 The bare/dimensioned seam

[Design decision.] Ranges are composition-dependent, so the bare/dimensioned boundary needs explicit handling. Consider:

let x : float = bareInput in
let y : float<newtons> = x * oneNewton

Here x is bare (no-error IEEE path), but y is dimensioned and would error on an unobservable range — and y’s range derives from x’s, which was never bounded. The resolution:

Bare floats carry interval analysis. There is no second analysis to switch off; the same real interval domain (§9) runs on every real value. “Bare works like today” means precisely and only: a bare float with an unobservable range incurs no error and lowers to f64. It does not mean bare floats are exempt from range propagation.
The dimensioning site is where the range obligation crystallizes. When a bare value with an unobservable range flows into a dimensioned context, the error fires at the dimensioning boundary (here, x * oneNewton), with a diagnostic that names the upstream bare source as the unbounded input ("y : float<newtons> requires a bounded range; its range derives from x (bare float, unbounded at ); annotate x, import a domain library, or seal y’s representation").
The pit-of-success boundary is therefore: dimensioning is the trigger for the range obligation; the obligation is discharged by bounding the dimensioned value’s range, which may require bounding the bare inputs that flow into it. The seam does not leak, because the obligation attaches to the dimensioned boundary and the diagnostic traces provenance across it.

This revises the treatment in Native Type Universe: float and float32 remain the bare, no-locality-claim types defaulting to IEEE as lowering targets selected for wide or unobservable ranges, not as a universal declared representation. The float = f64 / float32 = f32 NTU mappings remain correct as lowering targets; representation selection engages for dimensioned or ranged reals.

7. Performance as a Capability Gate

[Design decision.] Performance never enters the score. It enters as a capability-coeffect filter on the candidate set, with a three-valued capability per format: native, emulated, or unavailable.

R(T)      = { r : capability(T, r) ≠ unavailable }            -- the target's offered formats
R_cov     = { r in R(T)  : dynrange(r) ⊇ [a, b] }             -- §2.1 feasibility
R_eff(T)  = R_cov filtered by the per-build emulation policy:
  native-only         → { r in R_cov : capability = native }
  allow-emulated      → R_cov                                 (default)
  allow-emulated-warn → R_cov, with a perf diagnostic if r* is emulated
r* = argmin (over r in R_eff)  max (over x in [a, b])  err_r(x)   -- §2.2 metric

The relationship to the bare objective is explicit: R(T) is the offered set (capability not unavailable); R_cov and the emulation policy are filters layered above it. The argmin score is untouched.

Two reasons performance is a filter and not a cost term:

Worst-case relative error is closed-form and target-microarchitecture-independent; cost is not. Folding a λ·cost term into the score would smuggle delay-model guessing into a function that is otherwise exact.
Binary, witnessed outcomes, not silent degradation. A blended objective produces a silently degraded choice. A capability gate produces a witnessed outcome: a format is in R_eff or it is not, and an emulated accuracy-optimal choice yields a diagnostic, not a silent swap. This generalizes the quire contract — a target lacking quire support triggers a capability-coeffect failure (§8), never a fallback to lossy accumulation.

b-posit hardware. b-posits close the float/posit hardware-efficiency gap by bounding the regime field to 6 bits, so one decoder and a five-way MUX serve 16/32/64-bit operands (the non-significand field is identical across precisions, which IEEE structurally cannot do). Jonnalagadda, Thotli, and Gustafson (arXiv:2603.01615) report a 32-bit b-posit decoder at 79% less power, 71% smaller area, and 60% less latency than a standard posit decoder, matching or beating IEEE-754 float, with the margin widening at 64-bit. On targets that carry b-posit hardware the capability gate reports capability = native and reduces to an accuracy-vs-range choice. The design stays robust to how each target actually ships: where hardware is absent the gate reports capability = emulated and filters under native-only, and the accuracy-vs-range conclusions do not change; only the size of the native candidate set does.

The allow-emulated-warn policy lets emulated-ness influence which diagnostic fires — cost re-enters as a reporting side channel only, never as a selection input. The pure-accuracy invariant is scoped to the choice, not to what is reported about it.

8. Concrete vs. Parameterized Representations

[Design decision.] The representation space lives on three layers, split across the CPU/FPGA fork so that surface syntax and synthesis configurations never collide:

Surface (Tiers 1–2): float<dim> or a ranged real. No posit width appears on the garden path. A concrete posit<n, es> is the explicit Level-3 escape hatch only (seal syntax [Not yet specified]).
Lowering codomain on fixed-ISA (CPU / SIMD / RISC-V): four concrete types Posit8 / 16 / 32 / 64. Direct struct layout, clean SRTP dispatch, clean hardware mapping, clean error messages, clean quire pairing (a Quire32.fma takes Posit32 by construction). Selection chooses among these concrete types plus IEEE and fixed-point. Concrete types are the codomain of selection, not the surface syntax.
Parameterized posit<n, es, rs, bias>: FPGA / reconfigurable-only synthesis search. The (rs, es) grid is enumerable (rs ∈ [2, 6], es ∈ [1, 5], ≤ 25 points); bias and asymmetry are a bounded but continuous parameterization explored heuristically, not enumerated. Calling the full space “enumerable” would be wrong. This is type-directed hardware synthesis, not a type the developer instantiates, and it is future work.

ML routing follows from this split: the Tier-2 library supplies the narrow range and the asymmetric-bias recommendation, which lower to a parameterized b-posit config on FPGA and to the nearest concrete Posit8 on fixed-ISA. The 5-bit accuracy floor with a cliff below it is surfaced as a coverage warning.

9. Carriage and the Real Interval Domain

Numeric selection rides the same Program Semantic Graph coeffect frame that width inference uses for integers. The pattern to replicate, with honest cost annotations:

PSG coeffect computed pre-emission. Interval analysis runs once per graph before transfer; the result is carried as a coeffect. Numeric selection adds a sibling RepresentationSelection coeffect beside the width-inference coeffect, computed during the same elaboration. (Cheap: a new field and a new producer.)
Abstract sentinel at type-lowering. Just as platform-word integers lower to an abstract width sentinel resolved at narrowing, reals lower to an abstract real/float sentinel resolved by a single selectRepresentation choke point into posit<n, es, bias>, IEEE f32/f64, or fixed-point. (Moderate: a new sentinel and a new resolver; this hook does not exist for reals today.)
Single resolver choke point, analogous to integer narrowing.
Hard error on unobservability, inherited and split on dimensionedness (§6).
Check-time diagnostics, emitted in the same shape as the existing width-inference and FPGA diagnostics (§10).
Platform capability facts, the natural seat for “does this target have native posit hardware, an extended posit instruction, or quire support?” (§7).

9.1 The real interval domain is a new abstract domain, not a port

[Design decision.] The integer interval domain is int64-only (a min/max pair with two’s-complement bit-counting transfer functions). A real/dimensional interval domain reuses the PSG traversal skeleton, the coeffect-carriage discipline, the sentinel/resolver pattern, and the diagnostic plumbing — but it does not reuse the transfer functions, which must be written from scratch over a continuous interval domain. Minimally it requires:

Outward-rounded floating-point interval arithmetic (sound interval endpoints round outward to remain a superset).
Reciprocal/division with sign-crossing: 1/[lo, hi] where the interval contains zero splits into two unbounded pieces — exactly the r²-in-denominator case that makes Tier 1 fall through (§3.1). This is one instance of the general rule that a nonlinear interval operation is a sign-case analysis (interval multiplication is the bounded case; see Rounding §4.3); reciprocal is the case whose pieces are unbounded.
Transcendentals (sqrt, log, exp, sin) — the physics use cases need them.
A terminating widening over a continuous lattice — the integer monotone-widening fixpoint does not transfer; real interval widening needs an explicit widening-with-thresholds operator (∇) to guarantee termination. The thresholds SHALL be the dynamic-range boundaries ±dynrange(r) of the target’s concrete format set: when an interval loop variable expands past a threshold, the widening snaps it to that format boundary, so the analysis terminates in a number of steps bounded by the (finite) count of format boundaries rather than ascending a continuous chain. This couples termination to the same per-target format set that the §2 objective and §7 capability gate already range over — the widening introduces no thresholds of its own.

This is a new abstract interpreter of research-grade weight, materially harder than the integer one it rides. “Same traversal” means the same graph walk and carriage, not the same transfer functions.

9.2 The third reading of one PSG traversal

Width inference is the spatial reading (bits per value); pipeline/combinational-depth inference is the temporal reading (chained operations per register boundary). Numeric selection is the third reading of the same PSG traversal: it consumes the dimensional range and selects a representation. As in §9.1, “same traversal” is the graph walk and carriage, not the transfer functions.

10. The Preservation Chain and the Quire Pass

10.1 Preservation chain

Representation selection occupies the second arrow of the lowering preservation chain:

Dimension --(range)--> Representation --(width)--> Footprint --(escape)--> Allocation
   DTS        Tier 1/2/3     selection coeffect      quire pass        escape analysis
            authority        (argmin / seal)

Each step consumes the preceding inference’s output; the chain is verified during elaboration and surfaced at design time. Every arrow is a coeffect on the PSG, deferred to target-binding, because later stages have strictly more information. The representation choice is recorded as codata on the PSG beside the dimension, the grade, and the escape class — a lowering pass that does not touch representation leaves the annotation as it found it. Certified proof-transformer passes preserve it by construction; uncertified passes receive a per-edge re-check.

Cast fidelity. A posit32 → f64 widening is labeled cast fidelity 1.0 (one-way widen): posit32’s significand near 1.0 fits within f64’s mantissa, so the widening is lossless. The round-trip f64 → posit32 → f64 is < 1.0. The fidelity label applies to the one-way widening cast only and SHALL NOT be read as implying a posit32 value is f64-accurate.

Cross-target transfer fidelity. When a value crosses a target boundary and is re-represented (e.g. posit32 on an FPGA lane transferred to f64 on a host lane), the transfer edge SHALL carry a transfer-fidelity annotation computed at compile time from the source and target representation specifications: the annotation is 1.0 if and only if the target representation’s dynamic range covers the source range, and < 1.0 with a range-derived loss bound otherwise. The direction matters — posit32 → f64 is provably lossless (1.0) while f64 → posit32 is < 1.0 — so the annotation is directional per edge. This is a distinct annotation from cast fidelity above and from the representation spec; it is derived, not declared, and is surfaced at design time.

10.2 The quire pass

[Design decision — placement.] The quire pass is a Composer nanopass, downstream of selection (it depends on the selected posit width) and before the target-lowering fork. It recognizes accumulation, sizes the quire, writes a coeffect bundle, and defers lowering to target-binding — annotations, not instructions, the same coeffect discipline selection uses.

Recognition. An active pattern over PSG nodes recognizes fma/fold/reduce-of-products over a selected posit type, e.g. Array.fold (fun acc (a, b) -> acc + a*b) zero, and fuses it into a quire MAC.
Sizing. The quire width Q follows the selected format. A b-posit (arXiv:2603.01615) uses a fixed universal quire of 800 bits (25 32-bit integers) for any width n > 12, independent of n. A full-gamut posit (Posit Standard 2022) uses n²/2 bits, precision-dependent — 512 bits for posit32.
Coeffect emission — four coeffects on one PSG node: allocation (100 B for a b-posit quire; 64 B for a full-gamut posit32 quire; stack or arena by escape class); lifetime (loop scope unless escaping, via the existing escape analysis); capability (is exact accumulation available on this target?); dimension (an fma of newtons × meters accumulates as joules, with a single rounding at Quire.toPosit and the dimension verified at output).
Per-capability lowering at the fork. The quire is substrate-portable: it is placed on whichever processor the accumulation is advantaged, and each substrate carries it at its own cost. The b-posit’s fixed 800-bit quire is the enabling property — one accumulator width for every precision, so the same recognition and coeffect emission lower to any of these without re-sizing per format.

Target	b-posit 800-bit quire residency	Local cost character
x86_64 (AVX-512)	two 512-bit `zmm` registers (1024 bits, 224 bits slack); 25 32-bit lanes	register-resident vectorized shift-add; the quire is coherent to GPU/NPU across unified memory
Xilinx FPGA	a fabric accumulator (fixed-point shift-add tree); many quires bank in parallel across the LUT budget	fabric; ≈ 1 cycle/FMA, parallel across interactions
RISC-V + extended posit	an architectural quire register	≈ 1 cycle/FMA
Neuromorphic	not available	capability failure

Where the quire sits is a placement decision, not a fixed target: the accumulation is a sequential, stateful reduction, so the design weighs each substrate’s local accumulate cost against the transport cost of the boundary it would otherwise cross. A quire co-located with the ALUs that feed it moves one rounded result across a slow boundary; a quire on the far side of that boundary moves every operand. For a full-gamut posit the residency figures scale with n²/2 (a full-gamut posit32 quire is 512 bits, half a b-posit quire); the placement reasoning is identical.

10.2.1 The quire-adequacy invariant

[Design decision.] The quire is sized at the format’s fixed width Q precisely so that the accumulation length k is not bounded by the accumulator. This is a deliberate resource spend, not a saving: the pass allocates the full fixed Q-bit accumulator unconditionally so that k may be unbounded — it is not a minimization of accumulator size. A k-cap invariant of the form k · bits-per-product ≤ Q would re-impose exactly the bound the quire was designed to eliminate, and is rejected. The invariant the pass discharges is instead:

Quire-adequacy. For the selected format, the allocated quire is exactly the format’s fixed width Q (800 bits for a b-posit of width n > 12; n²/2 bits for a full-gamut posit of width n), and the pass verifies (as a bit-vector obligation) that the per-product intermediate (full-width product plus carry/guard bits) fits within the quire field layout — independent of k. The accumulation length k does not enter the bound.

The fixed width follows the format’s standard (the b-posit’s 800-bit universal quire from arXiv:2603.01615; the full-gamut n²/2 from the Posit Standard 2022); the Composer sizing-pass obligation (verify per-product fit, allocate, emit the coeffect bundle) is this specification’s extension.

10.2.2 Why the quire matters

IEEE rounding makes the structural zeros of Clifford/Cayley algebra numerically non-zero (phantom grades). The quire’s exact accumulation keeps structural zeros structurally zero through training. Exact accumulation also defeats catastrophic cancellation — the loss of significance when differencing or summing nearly-equal large quantities, as in the pairwise coordinate differences and force sums of an n-body integration — by deferring all rounding to a single final conversion. The quire’s value here is exactness of accumulation — not near-zero posit precision, which does not exist (§2.2): a close-encounter accuracy win comes from exact accumulation against cancellation and from design-time range-matched selection, never from “tapered precision near zero.”

11. Design-Time Surfacing and Accuracy Preservation

The selection objective is computable at design time, so the compiler can show — before anything runs — how much relative accuracy each candidate representation preserves across the value’s actual range. This is the capability with the most direct leverage for users, and the clearest demonstration that the compiler is materially different from an IEEE-only framework: the tapered-versus-uniform tradeoff is made visible and quantitative at authoring time, not discovered empirically after a long run drifts. IEEE-754 spends precision uniformly because it makes no bet on where values cluster; a posit concentrates precision where the values are. Where a domain’s values live near magnitude 1.0 — true after natural-unit normalization for most physics, and for normalized ML activations — a posit preserves materially more accuracy than IEEE-754 at equal width, and the developer sees exactly that, per target, at the point of writing the code. Surfacing that bet and its payoff at design time is the showcase capability of numeric selection.

Numeric selection emits check-time diagnostics in the same shape as the existing width-inference and FPGA diagnostics (severity, source range, related nodes, reachability), from a pass inside program checking, surfaced as squiggles and hovers. “Design time” here is continuous Lattice elaboration, distinct from ML “compile time.” Canonical readouts (taper figures are illustrative continuous-taper anchors, not a fixed two-point model; see §2.2):

force: float<newtons>
  Dimensional range: [1e-11, 1e30] (from gravitational constant and stellar masses; Tier 2)
  ├── x86_64:  float64         (worst-case rel error: 1.11e-16, uniform)      [WideDynamic → IEEE]
  ├── xilinx:  posit<32, es=2> (~2.3e-8 at range extremes, ~1.5e-9 near 1.0)  [NearUnityTaper]
  └── Note: posit gives ~10x better precision in [0.01, 100] where most forces reside

Warning: posit<32, es=2> dynamic range [1e-36, 1e36] does NOT cover the full
  dimensional range [1e-11, 1e72] of astronomicalDistance<meters>     (R_cov filter, §2.1)
  Consider: float64 (covers the full range) or scaling to AU (fits posit range)

The “rescale to AU” suggestion is itself a dimensional operation (1 AU ≈ 1.5 × 10¹¹ m; re-dimensioning brings the range into posit32 bounds). New diagnostic codes form a numeric-selection family (siblings of the width-inference and FPGA codes) and SHALL include at least: coverage-empty (R_cov = ∅), near-zero degeneracy (a range straddling zero with no representation resolving it under the ULP floor), bare-source-unbounded-at-the-dimensioning-seam (§6.1), tier disagreement (R_lower ⊄ R_binding, §3.4), suboptimal seal (§5), and quire capability failure (§10.2).

12. Relationship to Other Features

Width Inference — numeric selection is the real-valued sibling: width inference sizes integers from their value range; numeric selection chooses the representation of reals from their dimensional range, via the same coeffect frame. The unified objective is stated in both chapters; see Width Inference §4.
Dimensional Type System — the dimensional range is the principal input; see Units of Measure and NTU Dimensional Architecture. The dimension establishes the kind; the range establishes the representation.
Native Type Universe — selection refines the IEEE-default treatment of float/float32, which remain the bare lowering targets for wide or unobservable ranges (§6.1); see Native Type Universe.
Incremental Computation — the tiered authority model reuses the Level-1/2/3 inference ladder; see Incremental Computation §12.
Negative types and reversibility (non-normative) — a typed reversibility contract (negative types) is orthogonal to representation selection: it checks structurally and is representation-agnostic, so IEEE and posit alike satisfy it. Representation selection makes no promise of numerical reversibility; it provisions the precision envelope within which a reversible computation’s residual stays bounded, and the quire’s exact accumulation extends that horizon. A symplectic integrator typed via negative types and lowered with the quire is a client of both disciplines; that composition lives in a Fidelity.Numerics library, not in this chapter.

13. Normative Requirements

Range-driven representation. For a real-valued quantity, the representation (IEEE-754 / posit / fixed-point) SHALL be selected per target as a function of the analyzed and dimensional range, not from a target type name.
Feasibility constraint. Selection SHALL be over the coverage-filtered candidate set R_cov; a non-covering representation SHALL NOT be selected, and R_cov = ∅ SHALL be a compile-time error.
Zero-crossing soundness. The error metric SHALL be the ULP-floored form of §2.2 (or its equivalent neighborhood-exclusion form); a range straddling zero SHALL NOT yield an undefined argmin.
Accuracy-only objective. The selection score SHALL be worst-case error over the range and SHALL NOT include a cost, latency, or area term; performance SHALL be expressed only as a candidate-set filter (§7).
Tier composition. When multiple tiers produce a range claim, the binding range SHALL be the claim of the highest present tier; lower-tier claims SHALL become consistency obligations whose violation is a diagnostic, not a change to the binding range. There SHALL be no empty-intersection state.
Unobservable range, dimensioned. A dimensioned real whose range cannot be bounded by any provenance SHALL be reported as an error requesting annotation; the compiler SHALL NOT assume a representation.
Unobservable range, bare. A bare float with an unobservable range SHALL lower to IEEE f64 without error; this is the argmin outcome for an unknown range, not a bypass of selection. Bare reals SHALL still carry range propagation.
Dimensioning seam. When an unbounded bare value flows into a dimensioned context, the range error SHALL fire at the dimensioning boundary and SHALL name the upstream bare source.
Sealing. A Tier-3 seal SHALL be honored when it covers the range; a covering-but-suboptimal seal SHALL compile and SHOULD be witnessed at design time with the representation the open argmin would have selected.
Coeffect carriage. The selected representation SHALL be recorded as a coeffect (codata) on the PSG beside the dimension and preserved through lowering without recomputation.
Capability gating. A representation unavailable on the target SHALL NOT be selected; an accuracy-optimal but emulated choice under an emulation-permitting policy SHALL be selected and MAY be accompanied by a performance diagnostic; a required exact-accumulation capability that the target lacks SHALL be a capability failure, never a silent fallback to lossy accumulation.
Quire adequacy. A recognized accumulation over a selected posit SHALL be lowered with a quire of exactly the format’s fixed width: 800 bits for a b-posit of width n > 12 (arXiv:2603.01615), or n²/2 bits for a full-gamut posit of width n (Posit Standard 2022). Per-product fit SHALL be verified independent of the accumulation length k; no k-cap invariant SHALL be imposed.

14. Genuinely-Open Items

Not yet specified. The following are open and SHALL be resolved before the corresponding capability is claimed shippable. None is invented here as settled.

Seal syntax. The Tier-3 seal form (operator, attribute, or quotation) and any accompanying lossy-conversion rounding/saturation discipline are open; this inherits the open explicit-conversion syntax of Width Inference §7. This is the critical-path gap, since Tier 3 is the only tier buildable on today’s infrastructure.
Fidelity.Physics binding contract. The export interface that composes a quotation-symbolic module (§4) into a Tier-2 range claim is unspecified, and the library is planned, not built. The terminating range-expression sub-language is now resolved in §4: it is a closed-form image computation in the interval domain (§9.1) that feeds the regime classifier, so the carried object is a categorical regime (a point in the bounded representation lattice), not a scalar, and downstream obligations are finite lattice operations over that categorical codomain. The residual gaps are (a) the per-transcendental segment-splitting convention (where exactly sin and other non-monotone functions are split into monotone pieces) and (b) the tightness target for the bound method — since interval over-estimation under the dependency problem (r·r) bears on which lattice element is selected (selection completeness), it must be pinned how tight the bound has to be, and whether affine/Taylor refinement is required when an over-estimate would straddle a regime boundary.
Tier-disagreement tolerance. Whether containment R_lower ⊆ R_binding is exact or within an ε, and whether the tolerance is per-dimension, is open; measurement reality means observed ranges may legitimately exceed declared ranges by epsilon.
Profiling-evidence provenance. The third range source has no recording, versioning, or cross-build trust model; whether a profiling range is Tier 1 (with an evidence-provenance tag) or its own tier is unresolved.
Asymmetric/biased ML objective. Resolved in §4 toward the shifted-range approach: ML selection stays on §2’s symmetric worst-case argmin, run over a range pre-skewed to the activation density by the Fidelity.ML library, so the general selector remains single-objective and exact (Requirement 4). What remains open is only the optional future refinement — a genuinely distribution-weighted (expected-error) objective that integrates error against an activation density. This is the one place an expectation term might legitimately re-enter; it is scoped strictly to the ML domain library, never to the general selector, and is not required for the shipped behavior.
Quire sizing-pass obligation. The fixed quire width per format is settled (800 bits for a b-posit; n²/2 for a full-gamut posit); the per-product-fit verification, allocation, and coeffect-emission pass (§10.2.1) is downstream and must be fully defined.
Type-directed posit hardware synthesis. The FPGA parameterized-config search (§8, layer 3) is future work; the bias/asymmetry portion of the search is bounded-but-continuous, not enumerable, and the synthesis pipeline is not built.
Error precedence. The ordering between R_eff = ∅ (no native-policy-permitted format) and R_cov = ∅ (no covering format) when a target both lacks native support and offers no covering format needs a stated precedence.
ULP-floor definition. The exact ulp_min(r) per representation family (IEEE subnormal floor vs. posit smallest-regime magnitude vs. fixed-point LSB), and whether the ULP-floor form or the [−δ, +δ] exclusion form is canonical, must be pinned. Normative once chosen.

References

Swamy, N., et al. F* — refinement-typed machine integers and verified low-level code. https://www.fstar-lang.org
Kennedy, A. (1996). Programming Languages and Dimensions. PhD thesis, University of Cambridge. (Units of measure; later realized in F#.)
Syme, D. (2006). Leveraging .NET Meta-programming Components from F#. ML Workshop ‘06. (Quotations.)
Syme, D., Neverov, G., & Margetson, J. (2007). Extensible Pattern Matching via a Lightweight Language Extension. ICFP ‘07. (Active patterns.)
Gustafson, J. L. (2017). Posit Arithmetic. (Tapered-precision real representation.)
Standard for Posit Arithmetic (2022). (Quire, n²/2 accumulator width.)
Jonnalagadda, A. A., Thotli, R., & Gustafson, J. L. (2026). Closing the Gap Between Float and Posit Hardware Efficiency. arXiv:2603.01615. (Bounded-regime b-posit; a 6-bit regime cap and five-way decode MUX shared across widths; fixed 800-bit quire; the source for this chapter’s b-posit hardware figures and quire sizing.)
Cousot, P., & Cousot, R. (1977). Abstract Interpretation: A Unified Lattice Model for Static Analysis of Programs by Construction or Approximation of Fixpoints. POPL ‘77. (Interval domains, widening.)
Moore, R. E. (1966). Interval Analysis. Prentice-Hall. (Outward-rounded interval arithmetic.)
Petricek, T., Orchard, D., & Mycroft, A. (2014). Coeffects: A Calculus of Context-Dependent Computation. ICFP ‘14.
Baaij, C., et al. CλaSH: structural descriptions of synchronous hardware (Haskell→FPGA). (Sizing hardware to inferred representations.)
IEEE Standard for Floating-Point Arithmetic, IEEE 754-2019.

Width Inference Rounding and Directed Rounding