gte_neon: implement MVMVA, some fixes