target/arm: Fix VUDOT/VSDOT (scalar) on big-endian hosts

The helper functions for performing the udot/sdot operations against a scalar were not using an address-swizzling macro when converting the index of the scalar element into a pointer into the vm array. This had no effect on little-endian hosts but meant we generated incorrect results on big-endian hosts. For these insns, the index is indexing over group of 4 8-bit values, so 32 bits per indexed entity, and H4() is therefore what we want. (For Neon the only possible input indexes are 0 and 1.) Signed-off-by: Peter Maydell <peter.maydell@linaro.org> Reviewed-by: Richard Henderson <richard.henderson@linaro.org> Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org> Message-id: 20201028191712.4910-3-peter.maydell@linaro.org
author: Peter Maydell <peter.maydell@linaro.org> 2020-11-02 16:52:15 +0000
committer: Peter Maydell <peter.maydell@linaro.org> 2020-11-02 16:52:15 +0000
commit: d1a9254be5cc93afb15be19f7543da6ff4806256 (patch)
tree: 0b3ba2b93095b519746dc2664fd26ebdc2ebe1b9 /target/arm
parent: 552714c0812a10e5cff239bd29928e5fcb8d8b3b (diff)
download: qemu-d1a9254be5cc93afb15be19f7543da6ff4806256.zip
1 files changed, 2 insertions, 2 deletions
diff --git a/target/arm/vec_helper.c b/target/arm/vec_helper.c
index 30d76d05be..0f33127c4c 100644
--- a/target/arm/vec_helper.c
+++ b/target/arm/vec_helper.c
@@ -293,7 +293,7 @@ void HELPER(gvec_sdot_idx_b)(void *vd, void *vn, void *vm, uint32_t desc)
     intptr_t index = simd_data(desc);
     uint32_t *d = vd;
     int8_t *n = vn;
-    int8_t *m_indexed = (int8_t *)vm + index * 4;
+    int8_t *m_indexed = (int8_t *)vm + H4(index) * 4;
 
     /* Notice the special case of opr_sz == 8, from aa64/aa32 advsimd.
      * Otherwise opr_sz is a multiple of 16.
@@ -324,7 +324,7 @@ void HELPER(gvec_udot_idx_b)(void *vd, void *vn, void *vm, uint32_t desc)
     intptr_t index = simd_data(desc);
     uint32_t *d = vd;
     uint8_t *n = vn;
-    uint8_t *m_indexed = (uint8_t *)vm + index * 4;
+    uint8_t *m_indexed = (uint8_t *)vm + H4(index) * 4;
 
     /* Notice the special case of opr_sz == 8, from aa64/aa32 advsimd.
      * Otherwise opr_sz is a multiple of 16.
author	Peter Maydell <peter.maydell@linaro.org>	2020-11-02 16:52:15 +0000
committer	Peter Maydell <peter.maydell@linaro.org>	2020-11-02 16:52:15 +0000
commit	d1a9254be5cc93afb15be19f7543da6ff4806256 (patch)
tree	0b3ba2b93095b519746dc2664fd26ebdc2ebe1b9 /target/arm
parent	552714c0812a10e5cff239bd29928e5fcb8d8b3b (diff)
download	qemu-d1a9254be5cc93afb15be19f7543da6ff4806256.zip