I very commonly run into this issue:
- I have a csv file with a list of data in it
- I need to remove duplicates (or sometimes, find the values that are duplicated)
The csv
is easy to bring into excel
, but I can't find (or, never remember) a good method to find/remove/count duplicate values.
I can export the data to a database and run some simple sql queries to do all this, but then the database gets in the way with most other operations and I wind up exporting the data back out to excel to do cell level work.
Is there not some tool that can make working with tabular data as easy as excel, but include the more powerful features of a sql query language?