I have two CSV files, each about 4 Gbyte and about 1 million rows. The first column is a unique integer key.
Most of the contents should be the same, but a few percent will differ. I could settle for comparing only the unique keys and get a report containing orphans for each side.
I tried a regular table compare, but it seems to take "forever". It has been "thinking" for about an hour now.
Any better way to do this?
Most of the contents should be the same, but a few percent will differ. I could settle for comparing only the unique keys and get a report containing orphans for each side.
I tried a regular table compare, but it seems to take "forever". It has been "thinking" for about an hour now.
Any better way to do this?
Comment