AI & Machine LearningIntermediate

What is Inference?

The process of using a trained AI model to generate predictions or outputs from new input data.

Why It Matters

Inference is what happens every time you use an AI tool, and its speed and cost directly affect user experience.

Real-World Example

When you ask ChatGPT a question, the model runs inference to generate the answer.

“Understanding terms like Inference matters because it helps you have better conversations with developers and make smarter decisions about your software. You do not need to be technical. You just need to know enough to ask the right questions.”
Callum Holt, Founder, 13Labs

Related Terms

Large Language Model (LLM)

An AI system trained on massive amounts of text that can understand and generate human language.

Training Data

The dataset used to teach an AI model patterns and knowledge during its initial training.

Quantisation

Reducing the precision of numbers in an AI model to make it smaller and faster without losing much accuracy.

Latency

The time delay between a request and its response

From definition to deployment

Knowing the term is step one. Using it in something real is the rest.

Build it yourself in 3 weeks

buildAcademy cohort

Have me build it for you

buildAgency

What is Inference?

Why It Matters

Real-World Example

Related Terms

Large Language Model (LLM)

Training Data

Quantisation

Latency

From definition to deployment

Related Terms

Latency

Large Language Model (LLM)

Quantisation

Training Data

Transformer

Attention Mechanism