Documentation Index
Fetch the complete documentation index at: https://docs.lemondata.cc/llms.txt
Use this file to discover all available pages before exploring further.
For coding agents, discover the current recommended embedding shortlist first with
GET /v1/models?recommended_for=embedding, then send the selected model explicitly to this endpoint.Request Body
Synchronous request timeout: This non-chat endpoint waits for the routed model to finish. Large inputs, long audio, or large batches can exceed common 30s client defaults, so set your HTTP client timeout to at least120s.
ID of the embedding model to use (e.g.,
text-embedding-3-small).Input text to embed. Can be a string or array of strings.
Format for the embeddings:
float or base64.Number of dimensions for the output (model-specific).
A unique identifier representing your end-user for abuse monitoring.
Available Models
| Model | Dimensions | Description |
|---|---|---|
text-embedding-3-large | 3072 | Best quality |
text-embedding-3-small | 1536 | Balanced |
text-embedding-ada-002 | 1536 | Legacy |
Response
Always
list.Array of embedding objects.Each object contains:
object(string):embeddingindex(integer): Index in the input arrayembedding(array): The embedding vector
Model used.
Token usage with
prompt_tokens and total_tokens.