Baseten
BasetenEmbedding #
Bases: OpenAIEmbedding
Baseten class for embeddings.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
model_id
|
str
|
The Baseten model ID (e.g., "03y7n6e3"). |
required |
api_key
|
Optional[str]
|
The Baseten API key. |
None
|
embed_batch_size
|
int
|
The batch size for embedding calls. |
DEFAULT_EMBED_BATCH_SIZE
|
additional_kwargs
|
Optional[Dict[str, Any]]
|
Additional kwargs for the API. |
None
|
max_retries
|
int
|
The maximum number of retries to make. |
10
|
timeout
|
float
|
Timeout for each request. |
60.0
|
callback_manager
|
Optional[CallbackManager]
|
Callback manager for logging. |
None
|
default_headers
|
Optional[Dict[str, str]]
|
Default headers for API requests. |
None
|
Examples:
from llama_index.embeddings.baseten import BasetenEmbedding
# Using dedicated endpoint
# You can find the model_id by in the Baseten dashboard here: https://app.baseten.co/overview
embed_model = BasetenEmbedding(
model_id="MODEL_ID,
api_key="YOUR_API_KEY",
)
# Single embedding
embedding = embed_model.get_text_embedding("Hello, world!")
# Batch embeddings
embeddings = embed_model.get_text_embedding_batch([
"Hello, world!",
"Goodbye, world!"
])
Source code in llama-index-integrations/embeddings/llama-index-embeddings-baseten/llama_index/embeddings/baseten/base.py
15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 |
|