gpu_neon: new intrinsics-only implementation