Why is there a difference in filesize between using standalone bzip2 and pandas to_csv(, compression='bz2') function? – Debian
I write out a csv using pandas, and apply bzip2 compression as follows: df.to_csv('/home/user/file.bz2', index=False, mode=writemode, header=header) According to the documentation, to_csv infers from the filename that it needs to compress it using the bzip2 method. This ensures my ~100…