AI & Machine LearningAdvanced

What is Attention Heads?

Parallel attention mechanisms within a transformer that each focus on different types of relationships in the input.

Why It Matters

Multiple attention heads let AI models capture different types of patterns simultaneously, improving understanding.

Real-World Example

One attention head might focus on grammar while another tracks which pronouns refer to which nouns.

“Understanding terms like Attention Heads matters because it helps you have better conversations with developers and make smarter decisions about your software. You do not need to be technical. You just need to know enough to ask the right questions.”
Callum Holt, Founder, 13Labs

Related Terms

Attention Mechanism

A technique that lets AI models focus on the most relevant parts of the input when generating output.

Self-attention

A mechanism where each word in a text considers its relationship to every other word in the same text.

Transformer

A type of AI architecture that processes text by paying attention to relationships between all words at once, rather than reading sequentially.

Learn More at buildDay Melbourne

Want to understand these concepts hands-on? Join our one-day workshop and build a real web application from scratch.

Join buildDay Explore More Terms

What is Attention Heads?

Why It Matters

Real-World Example

Related Terms

Attention Mechanism

Self-attention

Transformer

Learn More at buildDay Melbourne

Related Terms

Transformer

Attention Mechanism

Self-attention

Large Language Model (LLM)

Tokenisation

Embeddings