Questions tagged [marklogic-dhf]

Use for questions about the MarkLogic Data Hub Framework. More people will see your question if you also tag it with "marklogic".

The MarkLogic Data Hub Framework is a data integration framework and tool-set to quickly and efficiently integrate data from many sources into a single MarkLogic database, and expose that data.

The MarkLogic Data Hub Framework lives at https://github.com/marklogic/marklogic-data-hub

190 questions
0
votes
1 answer

MarkLogic Smart Mastering not returning matches

I am using smart mastering framework based on Data Hub framework. I am running into an issue when executing smart mastering. I do get a match result back, but it does not contain the documents which were matched. When I run: import module namespace…
X2015jpp
  • 143
  • 2
  • 14
0
votes
0 answers

Marklogic DataHub Harmonization flow failure

I have a java application thats using the marklogic datahub API. I am running a flow on two separate entities. During the harmonization process, one entity passes fine while the other fails. This just seemed to start happening out of the blue as it…
X2015jpp
  • 143
  • 2
  • 14
0
votes
1 answer

Marklogic server version downgrade?

I currently have Marklogic server version 9.0-12 installed and running. A project I am working on requires me to have version 9.0-8 installed, as the data hub version is incompatible with the current version I have. Is there a way I can downgrade…
X2015jpp
  • 143
  • 2
  • 14
0
votes
1 answer

Is XML or JSON in MarkLogic Data Hub preferred?

We are starting a new data hub project and can freely choose between XML and JSON. What would you prefer? We are ingesting documents into the data hub from SQL databases and CSV files using Nifi. We use the Quickstart application. We have XML and…
Andreas H
  • 77
  • 7
0
votes
2 answers

Installing DHF on MarkLogic Docker

Now I am trying to install latest DHF on top of MarkLogic Docker instance (on Docker Mac). I could successfully run MarkLogic 10 docker instance based on the image provided on Docker Hub. The problem occur when I try to install the latest DHF. I…
user8420733
0
votes
1 answer

MarkLogic DataHub V5 How do I ristrict the number of rows coming back in custom flow

I have a collection (ABR) with 1.5 million documents I want to create a custom flow to process these documents; however, in the 1st instance, I only want the flow to process 10 documents so I can test and assess my custom code. In the past (prior…
0
votes
1 answer

MarkLogic - Design suggestion for efficient Batch processing

MarkLogic version 9.0-6.1 We have implemented two patterns for batch ingestion. Pattern 1 : MLCP Pattern 2: Informatica(or NiFi) reading an NDJSON file and making MarkLogic REST API PUT calls for each JSON in the NDJSON file Our production box is a…
Bhanu
  • 427
  • 2
  • 8
0
votes
2 answers

NiFi - Salesforce JDBC connection approach

I am trying to connect a Nifi QueryDatabaseTable processor to Salesforce in order to retrieve data to load into a Marklogic data hub. I found a couple of shareware JDBC drivers for Salesforce but have not been able to successfully connect to the SF…
Paulwms
  • 95
  • 8
0
votes
1 answer

Data hub framework version upgrade to latest supported version from 2.0.6 to 4.3.1

I am upgrading to Data hub framework 4.3.1 from 2.0.6 on MarkLogic server 9.0-6.2 (localhost). Build got successful using mlDeploy but at the same time i am encountering an issue: Error occurred while loading modules; host: localhost; port: 8010;…
Manish
  • 59
  • 3
0
votes
0 answers

Marklogic Data Hub Framework 2.x Deploy Failure

mlDeploy failure while trying to install data hub framework using gradle I'm trying to install DHF 2.0.6 over MarkLogic 9.0-5 server (community edition) on Windows 10. I'm able to perform "gradlew -PenvironmentName=local hubInit -i" but "gradle…
haritrichy
  • 107
  • 9
0
votes
2 answers

About Schedule taks

As am new to the ML Datahubframework,exploring the functionalities and services provided by the datahubframework. Has seen the gradle command to create service extensions and transforms. Haven't seen any gradle command to create the schedule tasks…
0
votes
1 answer

Facing an error while harmonizing the data

I am new to MLDataHubFramework,started to create a new entity flow by following the marklogic documentation. 1. Created Customer entity and ingested to staging database. 2. While trying to perform the harmonization am getting the below issue. …
0
votes
1 answer

where to customize default mapping code in Marklogic Data Hub 5?

I am using Data Hub version 5. In my case a single record is distributed into 5 entities like one entity has the complete address info, one has the telephone info, one has the qualification info etc.. Each entity has a unique ID which will be used…
Dixit Singla
  • 2,540
  • 3
  • 24
  • 39
0
votes
1 answer

How to use obfuscated passwords with Marklogic Datahub Framework

I'm trying to avoid having plain-text passwords in the server's configuration files. Being so, I want to use the gradle-credentials (https://github.com/etiennestuder/gradle-credentials-plugin) plug-in in order to obfuscate their value. I already…
capl
  • 1
  • 2
0
votes
1 answer

MarkLogic - Transaction size in harmonize flow

MarkLogic version : 9.0-6.2 We use -batch_size and -transaction_size in mlcp. We are able to use -Pbatchsize in harmonize flow, but could not find an equivalent parameter for transaction size. When would harmonize flow apply commit? For every…
Bhanu
  • 427
  • 2
  • 8