gpu_neon: brand new x86 SSE2+ implementation