Questions tagged [pentaho-data-integration]

Tag to be used for Pentaho Data Integration (all versions). Pentaho Data Integration prepares and blends data to create a complete picture of your business that drives actionable insights.

Pentaho Data Integration prepares and blends data to create a complete picture of your business that drives actionable insights.

It provides intuitive drag-and-drop data integration coupled with data agnostic connectivity spanning from flat files and RDBMS to Hadoop and beyond.

Features:

  • Graphical extract-transform-load (ETL) designer to simplify the creation of data pipelines
  • Rich library of pre-built components to access, prepare, and blend data from relational sources, big data stores, enterprise applications, and more
  • Powerful orchestration capabilities to coordinate and combine transformations, including notifications and alerts
  • Agile views for modeling and visualizing data on the fly during the data preparation process
  • Integrated enterprise scheduler for coordinating workflows and debugger for testing and tuning job execution
825 questions
0
votes
1 answer

"Get XML Data" step of pentaho is not able to read same xml file sometimes

I am using pentaho kettle tool for ETL job. In the job, one of the step(Get XML Data) is not able to read/parse xml file sometime. Sometime same XML file didn't throw any exception and sometime it threw. The list of errors are as given below - 1)…
0
votes
0 answers

Pentaho Data Integration: Data Warehouse dimensions: Transformations in one job

I'm creating a Data Warehouse and I'm using Pentaho Data Integration for the ETL. I have 2 options: Each dimension ETL in its own transformation (I have to open all the transformation files to run my dimensions), I thought this might be practical…
0
votes
1 answer

Number format in Pentaho Data Integration

I'm using Pentaho Data Integration Version: 8.0.0.0-28. Steps used in transformation: Table input (Connection: MySQL database) Select values Insert/ Update (Connection: MySQL database with the same data types as the input) The input table has a…
metk
  • 69
  • 1
  • 10
0
votes
1 answer

Pentaho Server error: A serious error occurred during job execution: Could not initialize class java.net.NetworkInterface

I recently did a yum update all on one of my aws linux instances and rebooted my pentaho server. It was 200+ days since a reboot. I then tried to run my existing pentaho jobs and got this error in the carte.log file for any and all of my jobs. log…
DJ_007
  • 1
  • 1
0
votes
1 answer

PDI 8.0 S3 CSV Input

I'm trying to connect to a S3 bucket using S3 CSV Input and got an error. I tried on 7.1 PDI version and worked fine, but on 8.0 PDI I couldn't even select a bucket.
M.Vitti
  • 13
  • 4
0
votes
0 answers

Pentaho Job stays at start and does not continue

So, I start running my Job with just one transformation, it starts and stays that way. It doesn´t throw any errors. The transformation alone runs perfect and fast, but the Job doesn´t. What can the problem be? I´ve been searching without…
0
votes
1 answer

Compare one filed in different csv using Pentaho

I have 2 csv files . 1) CSV that contains 10 records and one filed named city . This field has value in short forms like "M" for "Melbourne", etc . 2) Second CSV has two fields like , "cityshortname" and "city" . What i want to do is compare the…
0
votes
1 answer

How to validate one csv data compare with another csv file using Pentaho?

I have two csv file . In one file i have 10 rows and in another list of data . What i want to do is , check the data of one filed of first csv and compare it with another csv file . So how can i achieve this ? Any help would be great .
0
votes
2 answers

PDI - Multiple file input based on date in filename

I'm working with a project using Kettle (PDI). I have to input multiple file of .csv or .xls and insert it into DB. The file name are AAMMDDBBBB, where AA is code for city and BBBB is code for shop. MMDD is date format like MM-DD. For example…
Rio Odestila
  • 125
  • 2
  • 19
0
votes
1 answer

(stderr) =256m""=="" was unexpected at this time in Pentaho Shell script

This question is almost similar to pentaho: error (stderr) =256m""=="" was unexpected at this time. while calling kitchen command (dos command using shell script job entry) from job But It differs in clarity as I am providing exact details and also…
0
votes
1 answer

Kettle does not fetch data into dashboard table

I'm trying to use kettle query in pentaho cde, I uploaded ktr file, then browse it from kettle query and determined the step of transformation but when I preview dashboard, table that use from kettle datasource is empty. What's the problem?
stella
  • 117
  • 1
  • 2
  • 8
0
votes
1 answer

Pentaho Data Integration - Star schema (PostgreSQL)

I have a CSV file with data and the database I need for the star schema. However, the CSV file doesn't have the ID's of the dimensions tables (the primary keys), which means I only get those ID's after inserting the data into the dimension's tables…
0
votes
0 answers

Can we integrate Pentaho Data Integration tool with .NET

I have Dot Net application, Now I want to integrate and call Pentaho Data Integration jobs with existing application asp.net c# is there any way to do this. How I could integrate Pentaho with .NET looking for some tutorials or articles but here is…
0
votes
0 answers

Why would New Repository Connection window not load?

I've freshly installed windows-10 Pro N(64-bit)(not activated & fully updated to date) and downloaded pentaho from this source. After installing java (64-bit) and creating an environment variable PENTAHO_JAVA_HOME with the path to its JRE folder I…
Nae
  • 14,209
  • 7
  • 52
  • 79