Wavefront Operations
These intrinsics provide efficient operations across wavefronts.
AMDGPU.wfred — Functionwfred(op::Function, val::T) where T -> TPerforms a wavefront-wide reduction on val in each lane, and returns the result. A limited subset of functions are available to be passed as op. When op is one of (+, max, min, &, |, ⊻), T may be <:Union{Cint, Clong, Cuint, Culong}. When op is one of (+, max, min), T may also be <:Union{Float32, Float64}.