OptionalappOptionalchat_Optionalfrequency_Optionalmax_Optionalmin_Optionalpresence_Optionalrepetition_Optionalresponse_Per-request structured-output constraint (OpenAI-compatible).
Optionalsogni_Optionalsogni_OptionalstopOptionalstreamOptionaltaskOptionaltemperatureOptionaltokenOptionaltool_OptionaltoolsOptionaltop_Optionaltop_
Per-request chat template arguments (e.g.
{ enable_thinking: false }for llama.cpp).