Responses
Endpoints
Responses
Create a model response using the Responses API format.
POST
Responses
Creates a model response using the Responses API request and response format. Like all inference
endpoints, requests are served through the attested TEE gateway and return an
The
x-receipt-id header
for verification.
Request Body
Model id, for example
qwen/qwen3-30b-a3b-instruct-2507 or openai/gpt-4o.The input prompt. A string, or an array of input items.
Tool definitions the model may call.
Sampling temperature.
Nucleus sampling.
Example
Response
output array contains items by type. A message item holds the assistant content; a
reasoning item appears for reasoning models. Read assistant text from
output[].content[].text on the message item.