Questions tagged [datastax]

DataStax Enterprise is big data platform consisting of open source tools such as Apache, Cassandra, Solr and Hadoop (and several others).

DataStax Astra simplifies cloud-native application development, and reduces time to deploy Apache Cassandra and scale from days to minutes. DataStax Astra eliminates the operational overhead of using Apache Cassandra, the open-source NoSQL database behind the largest applications in the world. Learn more on datastax.com/dev about new APIs for REST, schemaless document-style JSON, and GraphQL APIs for Cassandra - available with the Astra cloud service and stargate.io for standalone Cassandra.

For those who prefer self-managed technology, DataStax Enterprise (DSE) is a big data platform that provides a version of Cassandra enhanced with enterprise security features and that adds support for more workload types such as graph, search, and analytics. DataStax Enterprise enables companies to build transformational data architectures for applications, microservices and experiences that require data sovereignty, availability, scale, agility, and accessibility by any user.

2409 questions
7
votes
1 answer

How to enable streaming from Cassandra to Spark?

I have the following spark job: from __future__ import print_function import os import sys import time from random import random from operator import add from pyspark.streaming import StreamingContext from pyspark import SparkContext,SparkConf from…
HackCode
  • 1,837
  • 6
  • 35
  • 66
7
votes
1 answer

Cannot connect java client to cassandra with password authentication enabled

I have a default install of Datastax enterprise on my macbook. I was able to create my keyspace and setup all my applications including using solr. I am trying to develop a set of steps to turn on password authentication for our dev cluster. Thus…
Greg L.
  • 286
  • 2
  • 4
  • 9
7
votes
1 answer

What does "nodetool compact" do for DateTieredCompactionStrategy?

The documentation for nodetool compact says: This command starts the compaction process on tables that use the SizeTieredCompactionStrategy and DateTieredCompactionStrategy. You can specify a keyspace for compaction. But what it does for…
piotrwest
  • 2,098
  • 23
  • 35
7
votes
1 answer

Is it possible to submit a CQL script to a cassandra cluster via the datastax driver?

Simply put, I have a cql script I would like to submit to a cluster via the cassandra Session, without having to go through cqlsh. I've looked through their API and I don't see anything to take in a CQL script, or a way to parse a script into…
adapt-dev
  • 1,608
  • 1
  • 19
  • 30
7
votes
1 answer

Are Cassandra user defined data types recommended in view of performance?

I have a Cassandra Customers table which is going to keep a list of customers. Every customer has an address which is a list of standard fields: { CustomerName: "", etc..., Address: { street: "", city: "", …
Milen Kovachev
  • 5,111
  • 6
  • 41
  • 58
7
votes
3 answers

Cassandra LOCAL_QUORUM

I'm having trouble understanding / finding information about how various quorums are calculated in cassandra. Let's say I have a 16 node cluster using Network Topology Strategy across 2 data centers. The replication factor is 2 in each datacenter…
JDesuv
  • 1,034
  • 2
  • 9
  • 19
7
votes
1 answer

Cassandra: Long Par New GC Pauses when Bootstrapping new nodes to cluster

I've seen an issue that happens fairly often when bootstrapping new nodes to a Datastax Enterprise Cassandra cluster (ver: 2.0.10.71) When starting the new node to be bootstrapped, the bootstrap process starts to stream data from other nodes in the…
petecheslock
  • 161
  • 6
7
votes
2 answers

Error in accessing cassandra from spark in java: Unable to import CassandraJavaUtil

I am using the below blog in configuring to access Cassandra from apache…
Anand Sai Krishna
  • 319
  • 1
  • 2
  • 11
7
votes
2 answers

Cannot achieve consistency level ONE: info={ 'required_replicas': 1, 'alive_replicas': 0, 'consistency': 1}

It looks like keyspace replication is not happening correctly in my Cassandra setup, I need some ideas in troubleshooting this. I have configured multi datacenter cluster but to begin with I have set the keyspace to use SimpleStrategy with RF 3. …
vrtx54234
  • 2,196
  • 3
  • 30
  • 53
7
votes
1 answer

Cassandra read timeout

I am pulling big amount of data from cassandra 2.0, but unfortunately getting timeout exception. My table: CREATE KEYSPACE StatisticsKeyspace WITH REPLICATION = { 'class' : 'SimpleStrategy', 'replication_factor' : 3 }; CREATE TABLE…
6
votes
0 answers

Datastax transport exception error writing

I have a java API application which connects to cassandra instance. My cassandra instance is hosted on google cloud platform. When I run the program in my IDE, it works fine and is able to connect to the cassandra instance. However when I deploy the…
zakSyed
  • 1,364
  • 1
  • 13
  • 30
6
votes
3 answers

Cassandra python driver: Client request timeout

I setup a simple script to insert a new record into a Cassandra database. It works fine on my local machine, but I am getting timeout errors from the client when I moved the database to a remote machine. How do I properly set the timeout for this…
Mark Jones
  • 147
  • 2
  • 13
6
votes
1 answer

Cassandra java driver set global consistency level

So in the datastax doc, it states that ConsistencyLevel can be set globally through QueryOptions: QueryOptions qo = new QueryOptions().setConsistencyLevel(ConsistencyLevel.ALL); I noticed that this is not a static method and returns an instance of…
Quan Ding
  • 727
  • 1
  • 10
  • 21
6
votes
1 answer

com.datastax.driver.core.exceptions.BusyPoolException

Whenever I insert data in table in Cassandra, more than 1000 and fetching the data by id, it throws the following exception: com.datastax.driver.core.exceptions.NoHostAvailableException: All host(s) tried for query failed (tried:…
Mahesh Chand
  • 3,158
  • 19
  • 37
6
votes
2 answers

Achieving zero downtime Cassandra/DataStax migrations

I've got a Cassandra cluster (3 nodes, all nodes deployed to AWS) that I am trying to migrate over to a DataStax cluster. It's simply time to stop managing these nodes myself. I have multiple producers and consumers all reading/writing data, all day…
smeeb
  • 27,777
  • 57
  • 250
  • 447