Questions tagged [datastage]

DataStage is the ETL (Extract, Transform, Load) component of the IBM InfoSphere Information Server suite. It allows the user to integrate various data sources and targets in an enterprise environment as a GUI based client tool.

DataStage is the ETL (Extract, Transform, Load) component of the IBM InfoSphere Information Server suite. It allows the user to integrate various data sources and targets in an enterprise environment as a GUI based client tool. Data Sources/Targets could be database tables, flat files, datasets, csv files etc. Basic design paradigm consists of a unit of work called as DataStage job. Multiple jobs can be controlled and conditionally sequenced using 'Sequences'.

IBM® InfoSphere® DataStage® integrates data across multiple systems using a high performance parallel framework, and it supports extended metadata management and enterprise connectivity. The scalable platform provides more flexible integration of all types of data, including big data at rest (Hadoop-based) or in motion (stream-based), on distributed and mainframe platforms.

Read more here

InfoSphere DataStage provides these features and benefits:

  • Powerful, scalable ETL platform
  • Support for big data and Hadoop
  • Near real-time data integration
  • Workload and business rules management
  • Ease of use

Support for big data and Hadoop

  • Includes support for IBM InfoSphere BigInsights, Cloudera, Apache and Hortonworks Hadoop Distributed File System (HDFS).
  • Offers Balanced Optimization for Hadoop capabilities to push processing to the data and improve efficiency.
  • Supports big-data governance including features such as impact analysis and data lineage

Powerful, scalable ETL platform

  • Manages data arriving in near real-time as well as data received on a periodic or scheduled basis.

  • Provides high-performance processing of very large data volumes.

  • Leverages the parallel processing capabilities of multiprocessor hardware platforms to help you manage growing data volumes and shrinking batch windows.

  • Supports heterogeneous data sources and targets in a single job including text files, XML, ERP systems, most databases (including partitioned databases), web services, and business intelligence tools.

Near real-time data integration

  • Captures messages from Message Oriented Middleware (MOM) queues using Java Message Services (JMS) or WebSphere MQ adapters, allowing you to combine data into conforming operational and historical analysis perspectives.

  • Provides a service-oriented architecture (SOA) for publishing data integration logic as shared services that can be reused over the enterprise.

  • Can simultaneously support high-speed, high reliability requirements of transactional processing and the large volume bulk data requirements of batch processing.

Ease of use

  • Includes an operations console and interactive debugger for parallel jobs to help you enhance productivity and accelerate problem resolution.

  • Helps reduce the development and maintenance cycle for data integration projects by simplifying administration and maximizing development resources.

  • Offers operational intelligence capabilities, smart management of metadata and metadata imports, and parallel debugging capabilities to help enhance productivity when working with partitioned data.

609 questions
1
vote
1 answer

Datastage XML Generation Issue

For generation of new XML I had created an XSD first then created the job like below Oracle connector>XML>XML_Output In the edit assembly of XML stage>XML composer step I choose the option as "Write to File " and provided output file directory and…
NMB
  • 39
  • 7
1
vote
1 answer

Unicode character not visible while doing cat

I have a CSV file generated by a windows system. The file is then moved to linux. The linux environment is NAME="Red Hat Enterprise Linux Server".VERSION="7.3 (Maipo)".ID="rhel". When I use vi editor, all characters are visible. For example, one…
adhithiyan
  • 168
  • 1
  • 9
1
vote
1 answer

Abnormal termination of stage in Datastage

I have a DX server job in Datastage v8.1 It has very simple flow DRS stage --> Transformer --> seq file stage In DRS stage I have an oracle sql query (complex join query). I am able to view data through VIEW DATA option in DRS stage but when I…
fairplay
  • 67
  • 1
  • 1
  • 8
1
vote
1 answer

Talend equivalent of Datastage "Dataset" or store intermediate result in talend

I'm trying to find the Talend equivalent of IBM Infosphere Datastage "Dataset" component or in other words What is the best way to store intermediate results in talend? the purpose of storing the result is to use it in some other job as a…
user7343922
  • 316
  • 4
  • 17
1
vote
1 answer

ORA-01410: invalid ROWID

When I am trying to fetch updates on source table with below code, I am getting **Error code: 1,410, Error message: ORA-01410: invalid ROWID ORA-06512: at "DS2ODS_DW_PRODUCT", line 43** error and updated rows can not be mirrored to target. (NOTE:…
Lyrk
  • 1,936
  • 4
  • 26
  • 48
1
vote
1 answer

In Datastage, in oracle connector stage can I use parameters in external sql file?

I am using oracle connector stage and I have selected "Read select statement from file" option. In the sql file I am using a parameter like where eff_start_date = #eff_start_date#. I have defined eff_start_date parameter in the job and I am also…
Shiva Sharma
  • 121
  • 1
  • 6
1
vote
1 answer

Convert Varchar2(24-JAN-2016) to number (YYYYMMDD

I want to convert varchar2(24-JAN-16) to number(20160124) in Oracle and Datastage. Can you help me? Thanks in advance.
Wiz
  • 113
  • 12
1
vote
3 answers

Figuring out what this Trim() function is doing?

I have a constraint on a transformer with this: Trim(CollectFrom.collect_from,"-","A")<=TheDate Here is what collect_from looks like: '2017-02-27' And here is what TheDate looks like: '20170227' I am unsure exactly how this Trim() function works.…
camohreally
  • 149
  • 3
  • 4
  • 17
1
vote
0 answers

ODBC Connection to Microsoft Azure SQL "The server does not support SSL"

I am trying to establish a connection to a Microsoft Azure SQL database using an ODBC connection (IBM Datastage v11.5) but I get the following error: Connection failed ODBC function "SQLConnect" reported: SQLSTATE=HY000: Native Error Code=0;…
Jopela
  • 5,415
  • 2
  • 18
  • 19
1
vote
1 answer

Error using datastage to connect to informix database

when I use datastage to connect to informix database, there comes an error: main_program: PATH search failure: main_program: Error loading "orchinformix": Could not load "orchinformix": libifasf.so: wrong ELF class: ELFCLASS32. main_program: Could…
Jason
  • 73
  • 2
  • 17
1
vote
2 answers

IBM DataStage: Evaluate string as code/expression

I have a complex transformation where a lookup stage specifies one of approximately 30 different/specific string operations that has to be done on a row. I am wondering how to do this efficiently in DataStage? The requirement is something like…
1
vote
0 answers

DataStage gives error using partitioned reads to Netezza

Very often we are unable to use Partitioned reads in Netezza connector. Example When partitioned read = Yes and Generated SQL at Runtime = Yes this works: SELECT "Firma", "KundeNr", "ArtikkelNr"," LagerstedNr" FROM dwhusr."TI_FT_Salg" When…
gfredhei
  • 11
  • 1
1
vote
1 answer

DataStage - issue with an input sequential file with pipe delimited

Source: Text file in UNIX box DS Stage: Sequential file Sample record (Line 1): Div03|Fac-12|Labor|2,543.30 Short desc: Stage is Pipe delimited and all are VarChar connected to a transformer to convert it to Decimal(19,2) output is a table in…
boombox2014
  • 65
  • 2
  • 8
1
vote
0 answers

Reading hex value in datastage

We have a mainframe file which we are trying to read using Complex Flat File Stage. The column has data type PIC X(1) which we are reading as char(1) and assigning to char (10). Problem is it converts to value "26" when the value should be 30. The…
dna
  • 483
  • 3
  • 10
  • 32
1
vote
1 answer

Datastage 8.1 Change temporary directories for SORT operations

Someone knows how to change temporary directory location from Datastage configuration? My problems is while a SORT operation is performed, unix directories /var and /tmp are filled over its limit and whole process is failing. I tried change dsenv…
Osy
  • 1,613
  • 5
  • 21
  • 35