I am parsing large amounts of complex files (mostly CSV files but some are not) and I need to structure/parse them into some standard formats. This not only involves row wise cleanup of data but some simple individual cell-based logic. I want a tool that a non-programmer can use also so a business team member can write simple drag and drop logic and not take up engineering time. So far, I have looked at Google Refine and Data wrangler and the last one looks great. Are there any other such tools out there?
Asked
Active
Viewed 3,930 times
1 Answers
2
ETL tools are oriented more towards relational databases, but also have support for XML and CSV file input/output. Examples:
Could easily be too complicated for your requirements though. Also, see this similar question on SO (with additional links): What software is availible for data quality checking .
-
Another ETL tool with data cleansing abilities mentioned usually with the two above is [CloverETL](http://www.cloveretl.com) – Jan Tichy Feb 13 '12 at 09:09