Questions tagged [datastax]

DataStax Enterprise is big data platform consisting of open source tools such as Apache, Cassandra, Solr and Hadoop (and several others).

DataStax Astra simplifies cloud-native application development, and reduces time to deploy Apache Cassandra and scale from days to minutes. DataStax Astra eliminates the operational overhead of using Apache Cassandra, the open-source NoSQL database behind the largest applications in the world. Learn more on datastax.com/dev about new APIs for REST, schemaless document-style JSON, and GraphQL APIs for Cassandra - available with the Astra cloud service and stargate.io for standalone Cassandra.

For those who prefer self-managed technology, DataStax Enterprise (DSE) is a big data platform that provides a version of Cassandra enhanced with enterprise security features and that adds support for more workload types such as graph, search, and analytics. DataStax Enterprise enables companies to build transformational data architectures for applications, microservices and experiences that require data sovereignty, availability, scale, agility, and accessibility by any user.

2409 questions
11
votes
3 answers

Datastax cassandra-driver (python) failed import

When importing datastax cassandra-driver(python) get the following error Error File "cassandra.py", line 1, in from cassandra.cluster import Cluster File "/home/vagrant/cassandra.py", line 1, in from cassandra.cluster…
jereques
  • 113
  • 1
  • 1
  • 5
10
votes
2 answers

Cassandra batch query vs single insert performance

I use Cassandra java driver. I receive 150k requests per second, which I insert to 8 tables having different partition keys. My question is which is a better way: batch inserting to these tables inserting one by one. I am asking this question…
Prakash P
  • 3,582
  • 4
  • 38
  • 66
10
votes
2 answers

Is it necessary to submit spark application jar?

As stated in title I'm wondering if is it necessary to spark-submit *.jar? I'm using Datastax Enterprise Cassandra for a while, but now I need to use Spark too. I watched almost all videos from DS320: DataStax Enterprise Analytics with Apache Spark…
Marcin Lagowski
  • 628
  • 2
  • 12
  • 26
10
votes
3 answers

Get a BigInteger attribute from Cassandra ResultSet

I'm trying to get the number of key value pairs in a Cassandra column family. Following is the code I used. PreparedStatement statement = client.session .prepare("select count(*) from corpus.word_usage"); ResultSet results =…
Chamila Wijayarathna
  • 1,815
  • 5
  • 30
  • 54
10
votes
2 answers

Why swap needs to be turned off in Datastax Cassandra?

I am new to Datastax cassandra. While going through the installation procedure of cassandra. It is recommended that the swap area of OS should be turned off. Does anyone provide the reason for that? Will it affect any OS level operations ?
Sathishkumar
  • 109
  • 2
  • 10
9
votes
2 answers

java.lang.NoClassDefFoundError: ch/qos/logback/core/joran/spi/JoranException while connecting Cassandra DB

I was trying out a simple connection to my Cassandra instance through Java. I made a 'demo' keyspace to cqlsh and created a table in the java program. The code is below: Jars Used: slf4j.api-1.6.1 cassandra-all-2.1.2 public class…
Anand Kumar
  • 389
  • 1
  • 7
  • 21
9
votes
1 answer

Cassandra Error message: Not marking nodes down due to local pause. Why?

I have 6 nodes, 1 Solr, 5 Spark nodes, using datastax. My cluster is on a similar server to Amazon EC2, with EBS volume. Each node has 3 EBS volumes, which compose a logical data disk using LVM. In my OPS center the same node frequently becomes…
peter
  • 674
  • 1
  • 12
  • 33
9
votes
1 answer

Pandas and Cassandra: numpy array format incompatibility

I'm using the Python cassandra driver to connect and query our Cassandra cluster. I want to manipulate my data via Pandas, there is an area in the documentation for the cassandra driver that mentions this…
joao
  • 292
  • 2
  • 7
9
votes
2 answers

How to configure access permissions for Cassandra on Linux Ubuntu

Thank-you for reading this. I am stuck at step three on this tutorial pertaining to installing Cassandra: http://wiki.apache.org/cassandra/GettingStarted#Step_3:_Start_Cassandra I can only run this software as root. (shouting this over fictional…
user3808269
  • 1,321
  • 3
  • 21
  • 40
9
votes
2 answers

Cassandra timeout during read query at consistency ONE (1 responses were required but only 0 replica responded)

I am doing read and update queries on a table having 500000 rows and some times getting below error after processing around 300000 rows, even when no node is down. Cassandra timeout during read query at consistency ONE (1 responses were required…
9
votes
2 answers

How to prevent Cassandra commit logs filling up disk space

I'm running a two node Datastax AMI cluster on AWS. Yesterday, Cassandra started refusing connections from everything. The system logs showed nothing. After a lot of tinkering, I discovered that the commit logs had filled up all the disk space on…
plamb
  • 5,636
  • 1
  • 18
  • 31
9
votes
1 answer

How to get pass "requires authentication" while connecting to remote Cassandra cluster using SparkConf?

I am trying to use apache spark and cassandra for the data analysis. So i wrote a java code to access the cassandra which is running on a remote machine. I used the following java code. public class JavaDemo implements Serializable { private…
Justin
  • 735
  • 1
  • 15
  • 32
9
votes
2 answers

Cassandra Datastax Enterprise using Amazon Elastic IP

I would like to run Cassandra Datastax Enterprise on Amazon EC2 instances using not the node private IP address but Elastic Public IPs My current configuration looks as follows: /etc/dse/cassandra/cassandra.yaml seed_provider: - class_name:…
8
votes
2 answers

In cassandra when to use decimal Vs float/double?

I am using apache cassandra 3.x version. I am bit confused regarding when should I use decimal vs float types? Is there any specific use-cases/differences when should go for float or avoid decimal and vice-versa? I have gone through some quick…
BdEngineer
  • 2,929
  • 4
  • 49
  • 85
8
votes
1 answer

What would be the defined behavior of this cassandra query with nonaggregate columns?

Suppose I had this table. CREATE TABLE keyspace.user_event ( user_name varchar, user_email varchar, event_type int, event_time timestamp, a varchar, b varchar, c varchar PRIMARY KEY ((user_name, user_email), event_type,…
onesiumus
  • 279
  • 6
  • 26