b2b_causal_conv1d#
- b2b_causal_conv1d(x, weight_proj, weight_mixer, skip_bias)#
Back-to-back fused causal 1D conv via the subq_ops CUDA kernel.
Fused kernel performing projection conv, pre-gate, mixer conv with skip, and post-gate. See upstream docstring for the exact algorithm.