POST /v1/embeddings converts text into a vector of floating-point numbers. These vectors capture semantic meaning, so similar texts produce similar vectors regardless of exact wording.
Common uses:
- Semantic search — find the most relevant documents for a query
- RAG (retrieval-augmented generation) — retrieve context before calling a chat model
- Clustering — group related content without predefined labels
- Similarity scoring — measure how closely two pieces of text relate
Code examples
Response
embedding field is an array of floats. Store it in any vector database (Pinecone, pgvector, Weaviate, Chroma, etc.) and use cosine similarity or dot product to compare vectors at query time.
Embedding models
| Model ID | Dimensions | Notes |
|---|---|---|
text-embedding-3-small | 1536 | Fast and efficient — good default |
text-embedding-3-large | 3072 | Higher quality for precision-sensitive tasks |
text-embedding-ada-002 | 1536 | Legacy model |
text-embedding-3-small is a good starting point for most use cases. Upgrade to text-embedding-3-large if retrieval quality needs improvement.