Questions tagged [datastage]

DataStage is the ETL (Extract, Transform, Load) component of the IBM InfoSphere Information Server suite. It allows the user to integrate various data sources and targets in an enterprise environment as a GUI based client tool.

DataStage is the ETL (Extract, Transform, Load) component of the IBM InfoSphere Information Server suite. It allows the user to integrate various data sources and targets in an enterprise environment as a GUI based client tool. Data Sources/Targets could be database tables, flat files, datasets, csv files etc. Basic design paradigm consists of a unit of work called as DataStage job. Multiple jobs can be controlled and conditionally sequenced using 'Sequences'.

IBM® InfoSphere® DataStage® integrates data across multiple systems using a high performance parallel framework, and it supports extended metadata management and enterprise connectivity. The scalable platform provides more flexible integration of all types of data, including big data at rest (Hadoop-based) or in motion (stream-based), on distributed and mainframe platforms.

Read more here

InfoSphere DataStage provides these features and benefits:

  • Powerful, scalable ETL platform
  • Support for big data and Hadoop
  • Near real-time data integration
  • Workload and business rules management
  • Ease of use

Support for big data and Hadoop

  • Includes support for IBM InfoSphere BigInsights, Cloudera, Apache and Hortonworks Hadoop Distributed File System (HDFS).
  • Offers Balanced Optimization for Hadoop capabilities to push processing to the data and improve efficiency.
  • Supports big-data governance including features such as impact analysis and data lineage

Powerful, scalable ETL platform

  • Manages data arriving in near real-time as well as data received on a periodic or scheduled basis.

  • Provides high-performance processing of very large data volumes.

  • Leverages the parallel processing capabilities of multiprocessor hardware platforms to help you manage growing data volumes and shrinking batch windows.

  • Supports heterogeneous data sources and targets in a single job including text files, XML, ERP systems, most databases (including partitioned databases), web services, and business intelligence tools.

Near real-time data integration

  • Captures messages from Message Oriented Middleware (MOM) queues using Java Message Services (JMS) or WebSphere MQ adapters, allowing you to combine data into conforming operational and historical analysis perspectives.

  • Provides a service-oriented architecture (SOA) for publishing data integration logic as shared services that can be reused over the enterprise.

  • Can simultaneously support high-speed, high reliability requirements of transactional processing and the large volume bulk data requirements of batch processing.

Ease of use

  • Includes an operations console and interactive debugger for parallel jobs to help you enhance productivity and accelerate problem resolution.

  • Helps reduce the development and maintenance cycle for data integration projects by simplifying administration and maximizing development resources.

  • Offers operational intelligence capabilities, smart management of metadata and metadata imports, and parallel debugging capabilities to help enhance productivity when working with partitioned data.

609 questions
-1
votes
1 answer

Datastage.. I have a field which has the value say 59.0900..I want to convert it to 05909

I have a field which has the value say 59.0900..I want to convert it to 05909 Can somebody help me with that..Im working on datastage IBM infosphere
ramee
  • 47
  • 7
-2
votes
1 answer

issue with reading data from csv file to a dataset

I need your help I have an easy datastage job. the source is a csv file and the target is a dataset. im using a transformer between to do some transformations. the first column in my csv contains numbers defined as string. I have for exemple 3…
diwa
  • 43
  • 8
-2
votes
1 answer

How to hardcode the hex decimal value in transformer?

I have a requirement where I need to hardcode the hex-decimal value x'FF' in datastage transformer where the column length is "1". Could any help me how to achieve this ? Thanks, Siva
-2
votes
2 answers

Move files to sFTP

I gotta move a couple of files from our DataStage server directories to an outside sFTP, the connect has been set and i have HTTP URL, SFTP URL, user and password. Any idea how can I reach that in DataStage.
AGDiaz
  • 53
  • 1
  • 6
-2
votes
1 answer

Datastage v9.1 - run user defined sql query file using odbc connector

I want to execute multiple lines of DDL and DML commands from a file in datastage. I have used the ODBC connector with the write mode selected as user defined SQL and the and the SQL statements are available in the file. But the connector stage is…
-2
votes
1 answer

How to use istool command in perl system() function

I am trying to put below command in perl system() function.But getting so many compilation errors (syntax). ./istool export -domain serviceshost:9080 -u dsadm -p password -ar test.isx -pre -ds '-base="ENGINEHOST/Dev_Project" Jobs/Batch/\*.*' I was…
Venkatesh k
  • 19
  • 1
  • 5
-3
votes
1 answer

Parallel Shared Container in datastage Job

I created a Datastage Parallel job with Parallel Shared Container. The job was working fine. I did not make any change to the job. Suddenly the job has started failing with below error since 1 day: main_program: Failed to create collation sequence…
-3
votes
1 answer

ETL Programming Methodology

Data Stage Methodological development at all projects that i took place was by a principle of Keep It Simple. That meens that every job contain as less sources/targets for a maintenance in the future will be very easy and simple. I noticed that in…
Sara
  • 19
  • 5
1 2 3
40
41