vllm.v1.worker.gpu.model_states.interface ¶
ModelSpecificAttnMetadata ¶
Base class for model-specific attention metadata.
vllm.v1.worker.gpu.model_states.interface ¶ ModelSpecificAttnMetadata ¶Base class for model-specific attention metadata.
vllm/v1/worker/gpu/model_states/interface.py