0

We are working on MarkLogic 9.0.8.2

We are setting up MarkLogic Cluster (3 VMs) on Azure and as per failover design, want to have 3 forests (each for Node) in Azure Blob.

I am done with Setup and when started ingestion, i found that documents are distributed across 3 forests and not stored all in each Forest.

For e.g.

i ingested 30000 records and each forest contains 10000 records.

What i need is to have all forest with 30000 records.

Is there any configuration (at DB or forest level) i need to achieve this?

Manish Joisar
  • 1,256
  • 3
  • 23
  • 47

1 Answers1

4

MarkLogic does not work the same as some of the other noSQL document databases failover which may keep a copy of every document on each host.

The clustered nature of MarkLogic distributes the documents across the hosts to provide a balance of availability and resource consumption. For failover protection, you must create additional forests on each host and attach them to your existing forests as replicas. This ensures availability should any 1 of the 3 hosts fail.

Here is a sample forest layout:

Host 1:    primary_forest_01     replica_forest_03
Host 2:    primary_forest_02     replica_forest_01
Host 3:    primary_forest_03     replica_forest_02

The replica forest must be on a different host than the primary forest, and if there are multiple forests per host, they should be striped across hosts to best balance out resource consumption when failed over.

It's also important to note that for HA, you need replicas configured for the system databases as well.

So there is no database setting to put all the documents on every hosts, because that is not the way MarkLogic is designed to work. The Scalability, Availability and Failover Guide is very informative, and in this case, the High Availability of Data Nodes with Failover section is particularly relevant. I also highly recommend checking out the free training that MarkLogic offers.

grtjn
  • 20,254
  • 1
  • 24
  • 35
Mike Gardner
  • 6,611
  • 5
  • 24
  • 34
  • Hello Michael, Can we have above approach On Azure with Blob configured as suggested in this guide Page No. 28? https://docs.marklogic.com/guide/azure.pdf – Manish Joisar May 20 '19 at 14:53
  • Thanks Michael, i have created separate post, https://stackoverflow.com/questions/56227457/marklogic-failover-cluster-on-azure-forest-configuration-on-azure-blob, can you please have a look? – Manish Joisar May 20 '19 at 20:04
  • Hello Michael, can you please have a look? – Manish Joisar May 21 '19 at 20:32