Questions tagged [pentaho-data-integration]

Tag to be used for Pentaho Data Integration (all versions). Pentaho Data Integration prepares and blends data to create a complete picture of your business that drives actionable insights.

Pentaho Data Integration prepares and blends data to create a complete picture of your business that drives actionable insights.

It provides intuitive drag-and-drop data integration coupled with data agnostic connectivity spanning from flat files and RDBMS to Hadoop and beyond.

Features:

  • Graphical extract-transform-load (ETL) designer to simplify the creation of data pipelines
  • Rich library of pre-built components to access, prepare, and blend data from relational sources, big data stores, enterprise applications, and more
  • Powerful orchestration capabilities to coordinate and combine transformations, including notifications and alerts
  • Agile views for modeling and visualizing data on the fly during the data preparation process
  • Integrated enterprise scheduler for coordinating workflows and debugger for testing and tuning job execution
825 questions
0
votes
1 answer

How to copy latest files from source folder to dest folder in same sftp using pentaho

I have to copy the latest file from source folder to destination folder in same sftp location using pentaho. i have tried with "copy step"step from kettle jobs by providing sftp source and destination details in file/folder source and file/folder…
user3625039
  • 1
  • 1
  • 3
0
votes
2 answers

How to lowercase field name in pdi (pentaho)?

I'm actually new to PDI and i need to do some extract from csv however sometimes field name are in lowercase or uppercase. I know how to modify it for rows but don't know how to do it for fields names. Does exist a step to do it? I tried…
0
votes
1 answer

pentaho, spoon: how to validate text file content and move file after processing

I am doing a work for a master with pentaho and I would like to know hot to di something. I receive a text file that I have to validate the content(type, lenght,..), if the content is correct the file is proccesed but if there is any bad field I…
JJoe
  • 1
  • 1
0
votes
1 answer

Pentaho: how to retrieve data stored in two files: one for mapping and other for rows

I need read these data and then load them into oracle tables using (if possible) a single spoon transformation. The data are stored in pairs of two files. Each pair has one file (data-type_yyyymmdd.txt) with each row placed in each line ended with…
gwarah
  • 241
  • 2
  • 10
0
votes
0 answers

Pentaho Spoon chrashes when rightclicking in the Explorer

Every time I right-click in the Explorer on the left side (e.g. on Database Connections in the View-tab), Spoon crashes and I have to terminate it via task manager. The right click works on any other part of Spoon, except in the Explorer. I tried…
MConan
  • 171
  • 1
  • 2
  • 8
0
votes
0 answers

Metadata injection on JSON input

I'm creating a job that takes the inputs of a json input from a table and i'm trying to do a etl metadata injection, i tried with v5.4,v6.0 but non is working. Is there any work around for my scenario?
0
votes
1 answer

Kettle or PDI: share same DB connection between different steps

I've multiple DB connections in a Kettle (Pentaho Data Integration) trasformation. There are some lookup steps and a table output steps. They must use the same DB connection. I'm searching a way to change the DB connection in a step and…
Daniele Licitra
  • 1,520
  • 21
  • 45
0
votes
1 answer

Left Outer join drops rows in Pentaho DI

I've got two data sets (csv files) which I bring into a Pentaho DI transformation using a "CSV file input" step. File A has a column joincol whose values appear in file B in column joincol. I want to left join the data in file A with the data in…
Rookatu
  • 1,487
  • 3
  • 21
  • 50
0
votes
0 answers

how to combine the column from 2 different xml file in one xml file in pentaho

Friends i have 2 get xml data function to get data from 2 different xml containing same number of rows. Now, i want to combine the column from file1 to file 2 or vice versa or even combining all column in a different file. I have tried many ways but…
SuKu
  • 41
  • 6
0
votes
1 answer

Iterate the input step over different number of files in Pentaho

I have a get file names step with a Regular expression that gets 4 csv files. After that I have a text file input step which sets the fields of the csv, and read these files. Once this step is completed a Table output step is executed. The problem…
mrc
  • 2,845
  • 8
  • 39
  • 73
0
votes
1 answer

Pentaho-carte rest api?

I just started with Pentaho data-integration and was able to create transformations/jobs from UI and run them from UI as well as Pentaho DI java SDK. Also, I successfully setup carte server, submitted transformation, and monitored it's status on…
Nitin Bansal
  • 2,986
  • 3
  • 23
  • 30
0
votes
1 answer

How to imitate SQL update on in memory table variables in pentaho

I am new to pentaho. Basically i have a requirement in hierarchy structured data where i select a sub set of hierarchy and i need to do count / sum on lower level nodes then go up level by level doing count + previous sum of count etc. This is all i…
0
votes
2 answers

Problems with text file output in PDI Pentaho

I need help with this trouble. In a transformation I create a text file delimited by semicolons from a table input, but when I look at the file, this shows me empty lines. this is the transform I made: options in text file output And this is the…
0
votes
1 answer

Dynamically generate DDL from text file input fields for table output

Consider the following three files: 1.csv [contains 3 fields: a, b, c] 2.csv [contains 4 fields: d, e, f, g] 3.csv [contains 2 fields: h, i] My assignment is to load all three files to their respective table output. So File "*.csv" ->loads-> Table…
Alan
  • 51
  • 8
0
votes
1 answer

Pentaho Zimbra Web Service (zimbra.xsd not found)

I am trying to implement a database integration on a system that triggers, after a user creation, a account creation on Zimbra service through ZimbraAdminService. The server version is 8.6 On Pentaho Web Service Lookup step, when I fill the URL…
Alexandre Ribeiro
  • 1,384
  • 1
  • 13
  • 19