Hardware-Aware Quantisation Explorer

Visualise how neural network weights map to fixed-point and low-bit formats

Select Number Format

0.0

Distribution Type

Std Dev 0.05

Num Weights 4096

Quantise To

MSE

SNR (dB)

Max Error

Clipped %

FP32 Original

Quantised

Scheme

Calibration

Granularity

Group Size 128

Bit Width 8

Quantising one column introduces error that propagates to remaining columns. Watch how compensating updates reduce total error.

A 3-layer MLP trained on a spiral dataset. Quantise and see how the decision boundary degrades.

Accuracy

Accuracy

Degradation

Parameter Count 7B

Relative multiplier area vs estimated accuracy retention. The Pareto-optimal frontier defines the best achievable tradeoffs.

Assign precision per layer. Sensitive layers benefit from higher precision.

Model Size

Est. Accuracy

vs FP16 Size