Completions

Creates a new chat completion response using the specified AI model.

post

Sends a chat completion request to generate AI responses based on the provided messages. Supports both streaming and non-streaming modes with automatic token billing.

Authorizations
AuthorizationstringRequired

Bearer authentication with JWT

Path parameters
modelIdentifierstring · uuidRequired

The unique identifier of the AI model to use for completion.

Body
streambooleanOptional
max_tokensinteger · int32 · nullableOptional
Responses
chevron-right
200

Successfully generated chat completion response

application/json
post
/inference/completions/{modelIdentifier}

Last updated