vllm.model_executor.model_loader.dummy_loader ¶
DummyModelLoader ¶
Bases: BaseModelLoader
Model loader that will set model weights to random values.
Source code in vllm/model_executor/model_loader/dummy_loader.py
_process_online_quant_layer ¶
_process_online_quant_layer(
layer: Module, info: LayerReloadingInfo
) -> None
Materialize, apply dummy weights, and run quantization processing.