vllm.kernels.aiter_ops ¶
AITER_SUPPORTED module-attribute ¶
Most kernels in this file are supported if AITER is installed.
aiter_lib module-attribute ¶
aiter_lib = Library('vllm_aiter', 'FRAGMENT')
This library holds torch custom ops for wrapped AITER ops. Many AITER ops want to remain invisible to torch.compile even after lowering. They are thus wrapped into torch custom ops inside the IR op implementations.
direct_register_aiter_op module-attribute ¶
direct_register_aiter_op = partial(
direct_register_custom_op, target_lib=aiter_lib
)
Syntactic sugar for registering AITER custom ops.
rms_no_var_16bit_only module-attribute ¶
rms_no_var_16bit_only = (
lambda x, weight, epsilon, variance_size=None: (
variance_size is None
and dtype in (float16, bfloat16)
)
)
AITER rms_norm only supports float16 and bfloat16 acts and no var_size override.