Cassandra : isolated workloads

Question

I have three workloads.

DATACENTER1 sharing data by rest services - streaming ingest
DATACENTER2 load bulk - analysis
DATACENTER3 research

I want to isolated workloads, i am going to create one datacenter foreach workloads. The objective of the operation is to prevent a heavy process from consuming all the resources and gurantee hight availablity data.

Is anyone already trying this ?

During a loadbulk on datacenter2, is data availability good on datacenter1 ?

Will your datacenters sync the data? – Imran Saeed Mar 28 '17 at 15:38 — Imran Saeed, Mar 28 '17 at 15:38
YES, i will sync data all datacenter. – V.HL Mar 28 '17 at 15:39 — V.HL, Mar 28 '17 at 15:39

Imran Saeed · Accepted Answer · 2017-03-30T07:10:53.740

Short answer is that workload won't cause disruption of load across datacenter. How it works is as follows:

Conceptually when you create a Keyspace, Cassandra creates a Virtual Data Center (VDC). Nodes with similar workloads must be assigned to same VDC. Segregating workload will ensure that only (exactly) one workload is ever executed at a VDC. As long as you follow this pattern, it works.

Data sync needs to be monitored under load on busy nodes but thats a normal concern on any Cassandra deployment.

Datastax Enterprise also support this model as can be seen from: https://docs.datastax.com/en/datastax_enterprise/4.6/datastax_enterprise/deploy/deployWkLdSep.html#deployWkLdSep__srchWkLdSegreg

Cassandra : isolated workloads

1 Answers1