AI & Machine LearningAdvanced

What is Quantisation?

Reducing the precision of numbers in an AI model to make it smaller and faster without losing much accuracy.

Why It Matters

Quantisation allows large AI models to run on consumer hardware and mobile devices.

Real-World Example

Converting a model from 32-bit to 8-bit precision to run it on a laptop instead of a data centre.

“Understanding terms like Quantisation matters because it helps you have better conversations with developers and make smarter decisions about your software. You do not need to be technical. You just need to know enough to ask the right questions.”
Callum Holt, Founder, 13Labs

Related Terms

Model Distillation

Creating a smaller, faster AI model that mimics the behaviour of a larger, more capable one.

Inference

The process of using a trained AI model to generate predictions or outputs from new input data.

Edge AI

Running AI models directly on local devices like phones or sensors rather than in the cloud.

Large Language Model (LLM)

An AI system trained on massive amounts of text that can understand and generate human language.

From definition to deployment

Knowing the term is step one. Using it in something real is the rest.

Build it yourself in 3 weeks

buildAcademy cohort

Have me build it for you

buildAgency

What is Quantisation?

Why It Matters

Real-World Example

Related Terms

Model Distillation

Inference

Edge AI

Large Language Model (LLM)

From definition to deployment

Related Terms

Large Language Model (LLM)

Model Distillation

Inference

Edge AI

Transformer

Attention Mechanism