Supported ModelsAdding a New ModelEnabling Multimodal InputsEngine ArgumentsUsing LoRA adaptersUsing VLMsSpeculative decoding in vLLMPerformance and Tuning