Questions tagged [pentaho-data-integration]

Tag to be used for Pentaho Data Integration (all versions). Pentaho Data Integration prepares and blends data to create a complete picture of your business that drives actionable insights.

Pentaho Data Integration prepares and blends data to create a complete picture of your business that drives actionable insights.

It provides intuitive drag-and-drop data integration coupled with data agnostic connectivity spanning from flat files and RDBMS to Hadoop and beyond.

Features:

  • Graphical extract-transform-load (ETL) designer to simplify the creation of data pipelines
  • Rich library of pre-built components to access, prepare, and blend data from relational sources, big data stores, enterprise applications, and more
  • Powerful orchestration capabilities to coordinate and combine transformations, including notifications and alerts
  • Agile views for modeling and visualizing data on the fly during the data preparation process
  • Integrated enterprise scheduler for coordinating workflows and debugger for testing and tuning job execution
825 questions
0
votes
0 answers

How to extract data from the XML file includes the XSL content?

I have an XML file. In that file some dates are coming from the XSL file. So how can I get those dates. So basically I am using Pentaho and that read XML file. My XML file contains dates which is in another XSL file. So how can I get the dates from…
0
votes
1 answer

How to Skip a Transformation in Pentaho Data Intigration job

I have single Job to run multiple Transformations. I want to parameterize this single job by some parameter that decides if all the transformation has to run or a single transformation has to run based on the name passed. E.g Start --> …
Arpan Saini
  • 4,623
  • 1
  • 42
  • 50
0
votes
1 answer

How can i extract data from the XML using Pentaho when xml tag is repeating?

I am extracting data from the XML . In that i have two duplicate tag with different values . So how can i get this data in different columns ? …
0
votes
0 answers

Pentaho Data Integration Kettle - Why when one record is updated it appears as if two records are updated?

I am using Pentaho Data Integration Kettle and SQL Server and have already made a Data Warehouse, but there is something I don't understand. When I update a record on a dimension it appears as if two records were updated, although it only updated…
KronosL
  • 299
  • 3
  • 4
  • 11
0
votes
1 answer

Pentaho Dummy set not passing data

I am trying to use a Dummy set in latest version of Pentaho to UNION two streams. When I do that the next step always says "unable top find rowset" Am I forgetting something?
ed4becky
  • 1,488
  • 1
  • 17
  • 54
0
votes
1 answer

How to find which user triggered a Pentaho Job from Repository

I have a job running in my Pentaho ETL server but am unable to figure out which user (i.e. username) had triggered the job. The default logging I can see does not seem to give any details on the user that triggered the job. There must be an easy way…
0
votes
3 answers

Sybase connection is idle for a long time

I'm reading data from a table in Sybase using a Table Input step. The query is really simple: SELECT person_ref, displayname FROM person That table has about 2 million rows. I'm connecting to Sybase ASE 12. My user has read-only rights. PDI is…
0
votes
1 answer

Pentaho MapReduce Job throwing error in Hortonworks Enviroment

I am stuck with a strange problem. Pentaho Data integration provides sample Job "word Count Job" in order to understand MapReduce Jobs. I am learning MapReduce and I am really lost with one strange error. Error is : "Caused by: java.io.IOException:…
0
votes
2 answers

Execute row SQL script can't read .sql file. I am getting this error

2018/02/01 11:20:00 - Execute row SQL script.0 - We can not find field [C:\Users\abc\Documents\xyz\query\record.sql] in input stream! My steps involve - get variables - input Execute row SQL script - where I am trying to read my query file text…
P.C
  • 24
  • 6
0
votes
1 answer

Unable to open JAR file, probably deleted: error in opening zip file while running pentaho job through linux

Help me to resolve the error in pentaho kettle, please. This is error log: ERROR: Bundle pentaho-webjars-deployer [10] Unable to get module class path. (java.lang.RuntimeException: Unable to open JAR file, probably deleted: error in opening zip…
0
votes
1 answer

can we use "prompt" function of javascript in pentaho?

can we use "prompt" function of javascript in pentaho? I am getting this error JavaScript - ERROR (version 7.0.0.0-25, build 1 from 2016-11-05 15.35.36 by buildguy) : Couldn't compile javascript: org.mozilla.javascript.EcmaError:…
P.C
  • 24
  • 6
0
votes
1 answer

How to schedule a pentaho transformation/job using crontab

Can anyone please provide me a step by step procedure for scheduling a pentaho job/transformation using crontab.
sree
  • 3
  • 7
0
votes
2 answers

Pentaho spoon error: "Connection refused. Check that the hostname and port are correct and that the postmaster is accepting TCP/IP connections."

I'm new in Pentaho environment. I installed pentaho CE recently on another computer and couldn't preview a simple mongoDB input step result which fetches data from mongodb. It is giving me the following error message: Blockquote …
0
votes
0 answers

Loading fact table with dimension key that are not in stream

I want to load fact table and one of my dimension is not in the stream and I want to store that dimension key to my fact table when my transformation runs. Issue is that dimension key is not present in my stream then how can I compare my key to…
0
votes
1 answer

I want to create transformation in which I want to add multiple inputs table column value into one table in the same column in pentaho

I want to create transformation in which I want to add multiple inputs table column value into one table in the same column in pentaho e.g Two input products table having id and name and one output table with id and column wants to add all values of…
A B
A1 B2