Skip to content

vllm.entrypoints.pooling.utils

encode_pooling_output_float_or_ndarray

encode_pooling_output_float_or_ndarray(
    output: PoolingRequestOutput,
) -> Any

Return an ndarray when the response renderer can serialize NumPy.

Source code in vllm/entrypoints/pooling/utils.py
def encode_pooling_output_float_or_ndarray(output: PoolingRequestOutput) -> Any:
    """Return an ndarray when the response renderer can serialize NumPy."""
    try:
        data = output.outputs.data
        if not data.is_contiguous():
            data = data.contiguous()
        return data.numpy()
    except (RuntimeError, TypeError):
        return output.outputs.data.tolist()