Questions tagged [datastage]

DataStage is the ETL (Extract, Transform, Load) component of the IBM InfoSphere Information Server suite. It allows the user to integrate various data sources and targets in an enterprise environment as a GUI based client tool.

DataStage is the ETL (Extract, Transform, Load) component of the IBM InfoSphere Information Server suite. It allows the user to integrate various data sources and targets in an enterprise environment as a GUI based client tool. Data Sources/Targets could be database tables, flat files, datasets, csv files etc. Basic design paradigm consists of a unit of work called as DataStage job. Multiple jobs can be controlled and conditionally sequenced using 'Sequences'.

IBM® InfoSphere® DataStage® integrates data across multiple systems using a high performance parallel framework, and it supports extended metadata management and enterprise connectivity. The scalable platform provides more flexible integration of all types of data, including big data at rest (Hadoop-based) or in motion (stream-based), on distributed and mainframe platforms.

Read more here

InfoSphere DataStage provides these features and benefits:

  • Powerful, scalable ETL platform
  • Support for big data and Hadoop
  • Near real-time data integration
  • Workload and business rules management
  • Ease of use

Support for big data and Hadoop

  • Includes support for IBM InfoSphere BigInsights, Cloudera, Apache and Hortonworks Hadoop Distributed File System (HDFS).
  • Offers Balanced Optimization for Hadoop capabilities to push processing to the data and improve efficiency.
  • Supports big-data governance including features such as impact analysis and data lineage

Powerful, scalable ETL platform

  • Manages data arriving in near real-time as well as data received on a periodic or scheduled basis.

  • Provides high-performance processing of very large data volumes.

  • Leverages the parallel processing capabilities of multiprocessor hardware platforms to help you manage growing data volumes and shrinking batch windows.

  • Supports heterogeneous data sources and targets in a single job including text files, XML, ERP systems, most databases (including partitioned databases), web services, and business intelligence tools.

Near real-time data integration

  • Captures messages from Message Oriented Middleware (MOM) queues using Java Message Services (JMS) or WebSphere MQ adapters, allowing you to combine data into conforming operational and historical analysis perspectives.

  • Provides a service-oriented architecture (SOA) for publishing data integration logic as shared services that can be reused over the enterprise.

  • Can simultaneously support high-speed, high reliability requirements of transactional processing and the large volume bulk data requirements of batch processing.

Ease of use

  • Includes an operations console and interactive debugger for parallel jobs to help you enhance productivity and accelerate problem resolution.

  • Helps reduce the development and maintenance cycle for data integration projects by simplifying administration and maximizing development resources.

  • Offers operational intelligence capabilities, smart management of metadata and metadata imports, and parallel debugging capabilities to help enhance productivity when working with partitioned data.

609 questions
0
votes
2 answers

Datastage -hierarchial stage-mutiple spaces are trimmed to single space in varchar feild

In Datastage, I am using XML hierarchial to parse the source XML files. For the Varchar columns, the leading and trailing spaces are getting trimmed and as well as the mutiple spaces are trimmed to single spaces. But I need the source data as it is…
0
votes
0 answers

DataStage server jobs are loading only 6-10 rows per second using DB2 stage in Database

I am facing a weird error since last 1 month of with 4 specific load jobs which are not dependent on each other. Lets consider them job1, job2, job3 and job4 which has the target table as T1, T2, T3 and T4. All four jobs has file as the source and…
0
votes
1 answer

IBM DataStage - unstructured data stage imports NULL from excel cells that contain formulas

I'm loading an excel file into a dataset using DataStage 11.7.1.1. The excel file has some columns which contain formulas: column L=D4+E4+F4. All the cells that contain formulas are imported as NULL in the dataset. The import option is set as value.…
C.B.Luca
  • 1
  • 2
0
votes
1 answer

How to add 0 before in the month field using transformer in datastage

I would like to add 0 for month field For example from source I receiving 1 and I would like to add 0 before 1 Example: Source - 1 Target - 01 Field Data type - Integer
Krishna
  • 25
  • 4
0
votes
0 answers

Sort - U The system cannot find the file specified

We are using this line of unix commands inside a Datastage Command Stage. It used to work fine on an older setup but keeps failing on this new server we stood up. Server is Windows but MKS toolkit allows us to run unix commands inside Datastage. We…
irshan syed
  • 89
  • 1
  • 6
0
votes
0 answers

DataStage integration with Gitlab

I Need help in how we can integrate GITLAB with ETL (Data stage) tool. Currently the developers are exporting the file from data stage and pushing it manually to the Gitlab repositories. Need automate this process. I tried checking out metalci but…
0
votes
0 answers

SQLSTATE = 42S02: Native Error Code = -204: Msg = [IBM][CLI Driver][DB2/AIX64] SQL0204N "SYSIBMADM.ENV_INST_INFO" is an undefined name

I am trying to run a DataStage job code of 8.7 version on 11.7. job has below stages: output file1 output File2 ^ ^ | | Db2 --> transform <-- hash file…
Naresh
  • 1
0
votes
1 answer

how to call DataStage job from an API

I have a requirement to call /run a DataStage job from an API. Please provide reference links/websites/steps API-->DS job
0
votes
0 answers

Datastage 11.7 odbc connector error loading data into SQL Server

I am trying to Extract data from Snowflake,put it in a dataset and use that to Load the same table in SQL Server. Extract part executed without any issues but when i am trying to load into SQl server i am getting this error. "Designed Schema field…
John Paul
  • 1
  • 1
0
votes
0 answers

Migrating from IBM Datastage to SSIS

Good Afternoon, Does anyone know how to migrate IBM Datastage jobs into SSIS? OR has anyone done it and can offer any tips and tricks or a walkthrough. Thank you.
Steven
  • 11
  • 5
0
votes
1 answer

Where those variable come from in datastage

I have a User Variables Activity in datastage job like this. enter image description here Name Expression START_DT If START_DT = "" Then Ereplace(Oconv(@DATE-1, "D-YMD[4,2,2]","-","") ELSE START_DT START_DT2 If START_DT = "" Then…
buzzy
  • 15
  • 3
0
votes
1 answer

in datastage, parameter of job acitivity in job and job properties parameter of that job activity are always get same parameters?

I have a question... (sorry about i'm a beginner at datastage) First Image, show seq_1000_mas job. and in that seq_1000_mas_job, there is a seq_1100_src job(second image). I found that parameters of seq_1100_src in seq_1000_mas_job have same…
buzzy
  • 15
  • 3
0
votes
1 answer

Calculate Age based on DOB

I am using IBM DataStage Below is transformer stage expression, where I have date from Link_4.DOB with format of %yyyy%mm%dd (20231221). I want to calculate the age: Below is my simple try, but it's not an accurate DOB YearFromDate(CurrentDate()) -…
william007
  • 17,375
  • 25
  • 118
  • 194
0
votes
2 answers

User Variable Activity Scope in datastage

for example, enter image description here if I set parameter (ex: index = 3) at UserVariable and then, If i want to use this parameter in Job_Acitivity_312, I have to set paramter also like this(in Job_Activity_312 job), right? enter image…
buzzy
  • 15
  • 3
0
votes
1 answer

Pulling Data using Oracle View in Data Stage

I have to pull data using VIEW on Oracle and insert them into one table on Oracle. Even though the source connector for the View in Data Stage can show Data by [View Data] option, ETL job fails when I run it. Target connector also responds when I…
llearner
  • 37
  • 5