Embeddings

Definition

An embedding is a fixed-length vector of floating-point numbers (typically 768 or 1536 dimensions) produced by an embedding model — usually a transformer trained on contrastive objectives. Two pieces of text with similar meaning produce embeddings that are close in vector space. Embeddings power semantic search, recommendation systems, clustering, deduplication and the retrieval step in RAG.

Why it matters

Embedding-model choice and re-indexing strategy are easy to underestimate. Switching embedding models means re-embedding the entire corpus. Plan for that upgrade path from day one.

Definition

Why it matters

See also

RAG (Retrieval-Augmented Generation)

Vector Database

LLM (Large Language Model)

Working on Embeddings?