I'm new to SSIS Development. I need some guidance from experts on SSIS. Following are the list of questions :
We are having files with sizes from 1GB to 25 GB of type txt or dat files with tab delimited . Some times,these file might contain invalid rows, invalid data types or large of amount of string value which results in failure while importing file data into SQL Tables
Can any one provide use how to perform Data Profiling on the File before loading into actual SQL Tables like
- No of Invalid Row which are having NULL values in a columns
No of Rows present in Files
No of Row with wrong data type value in a column for example: there are chance of file holding
string data in int field - Some time Row Delimiter in between columns for example : If there are 10 columns in a files, If there is row delimiter in between 5-6 column, which leads to current row data from 6 - 10 columns moved to next row. This scenario should be checked in Data Profiler.
- Is there way in SSMS to identify all the columns in each row are of Equal Column length. Incase if it not present throw error. But identifying those rows needs to be faster