Amazon web services – Unzipping multiple files from 1 zip files using EMR
I have multiple zip files called f1.zip, f2.zip, ... f7.zip each contain around 200k xml files and I am using this code to multiprocess to unzip them in parallel, but even if each file is very small, the read/write process…