vllm.v1.simple_kv_offload.metadata ¶
Metadata for SimpleCPUOffloadConnector.
SimpleCPUOffloadMetadata dataclass ¶
Bases: KVConnectorMetadata
Metadata passed from scheduler to worker for CPU offload operations.
The worker receives flat block lists keyed by a monotonic event_idx. Job->req_id translation is handled by the scheduler-side manager (via inverse maps), so the worker never knows about request identities.
Source code in vllm/v1/simple_kv_offload/metadata.py
SimpleCPUOffloadWorkerMetadata dataclass ¶
Bases: KVConnectorWorkerMetadata
Worker -> Scheduler metadata for completed store events.
Each worker reports {event_idx: 1} for newly completed stores. aggregate() sums counts across workers within a step. The scheduler-side manager accumulates across steps and processes a store completion only when count reaches world_size.