Visual Studio Code – Simple AVX512 dot-product loop only 10.6x faster, expected 16x
The task is to sum the products of multiplying each float in array A with the corresponding element in array B. The arrays could have tens of thousands of elements, and must run say 100,000x sec to handle a real-time…