Postgresql – The most fast and efficient way to clean ~100GB csv from duplicates by one column
I have ~100GB csv file with following columns: sex;name;dob;hash This files was created after some processing of another .csv file. And it can contain tuples, that's why there is this hash column. What I need is to delete duplicates from…