I'm developing an SSIS package to extract data from a table in an Oracle 11g database and load it to a table in a SQL Server 2014 database.
The table in the Oracle database has string columns defined as VARCHAR2 (single byte / codepage 8859-1); the table in the SQL Server database has string columns defined as VARCHAR (single byte).
However, when I add an OLE DB source / ADO .NET source to the SSIS package and configure it to select columns from the table in the Oracle source, the External Columns (Advanced Editor) shows the string columns as DT_WSTR (Unicode).
This Oracle article (see section 'How Oracle Unicode Support Works') advises that "When the client character set is not a superset of the server character set or the database character set is a multibyte character set, OraOLEDB automatically enables the Unicode mode."
How do I get the OraOLEDB provider / SSIS OLE DB data source to expose the string columns in the source as single byte?
What I've tried
- Setting the AlwaysUseDefaultCodePage property (of the OLE DB Source) to true and setting the DefaultCodePage property to 1252 (8859-1 is a sub set of this).
- Setting the AlwaysUseDefaultCodePage property (of the OLE DB Source) to true and setting the DefaultCodePage property to 28591 (the Windows equivalent of 8859-1).
- Reinstalled the Oracle client.
Constraints
- I'm transferring 2M+ rows and the duration the package takes to execute is an issue. Ideally, I don't want to have to add a Unicode > Non-Unicode Data Conversion transformation to the data flow that could negatively impact performance.
- I have no control of the Oracle server / database configuration.
Environment Configurations
Windows Client
- Windows 7 Pro
- BIDS / Visual Studio 2013
- Oracle client 11g Revision 5 (32 bit only - 64 bit client not installed)
- Oracle NLS_LANG = ENGLISH_UNITED KINGDOM.WE8MSWIN1252 (Set in the Wow6432Node registry - no overriding environment variable)
SSIS Package / Data Flow Task / OLE DB Data Source
LocaleID = English (United Kingdom)
Client Session Connection Info
SELECT * FROM V$SESSION_CONNECT_INFO
- CLIENT_CHARSET = UTF8 (When query called from SSIS using Full Instant Client)
- CLIENT_CHARSET = Unknown (When query called from SQL Developer using jdbcthin)
Oracle Server
SELECT * FROM SYS.NLS_DATABASE_PARAMETERS
- NLS_LANGUAGE = ENGLISH
- NLS_TERRITORY = UNITED KINGDOM
- NLS_CHARACTERSET = WE8ISO8859P1