QuantizationLoRA AdaptersTool CallingStructured OutputsAutomatic Prefix CachingDisaggregated Prefilling (experimental)Speculative DecodingCompatibility Matrix