Slicing a 300MB CuPy array is ~5x slower than NumPy – CentOS
My code involves slicing into 432x432x400 arrays a total of ~10 million times to generate batches of data for neural network training. As these are fairly large arrays (92 million data points/300MB), I was hoping to speed this up using…