Questions tagged [sharding]

Sharding is a technique of partitioning database tables by row ("horizontally"); typically this technique requires a key to be selected that determines how the rows are to be partitioned.

Sharding is a concept in database design; it refers to the technique of physically partitioning a table or collection by row (also known as horizontal partitioning). To execute the partition, a key or key collection must be defined, which tells the database engine how to determine to which partition each record should belong.

References

1666 questions
0
votes
1 answer

Using a sharded mongo db vs using multiple databases and switch_db function in mongoengine

For my new application , I am planning to implement customized distribution over multiple mongo databases. Will that scale well with mongoengine using switch_db ? I am using tornado server. Some more information. (My userids (uid) are generated so…
user303730
  • 384
  • 1
  • 5
  • 15
0
votes
1 answer

MongoDB replicate a sharded setup

I want to get sharding up together with replication. Don't get me wrong, I managed to install a sharded cluster that accesses replication sets as shards. My problem is to get the sharded cluster processes replicated just like this: [ Replication Set…
bash0r
  • 774
  • 6
  • 17
0
votes
1 answer

Replicated MongoDB server slower than simple shards

I have asked this question on serverfault, but I am actually looking for answers from MongoDB configuration perspective. I tried to compare the performance of a sharded database server against a sharded and replicated database server. The sharded…
displayName
  • 13,888
  • 8
  • 60
  • 75
0
votes
1 answer

errmsg" : "exception: getMore: cursor didn't exist on server, possible restart or time out?"

I am trying to run an aggregation pipeline using node.js and mongodb native driver on a sharded mongodb cluster with 2 shards. The monogdb ver. is 2.6.1. The operation runs for about 50 minutes and throws the error 'errmsg" : "exception: getMore:…
0
votes
1 answer

MongoDB with 1B documents, what is most optimum filter to return recently updated documents

I have a production mongo database of over 1B documents in a single collection sharded on _id across multiple servers. I'm trying to replicate recently updated records from this collection into Red Shift. Shard…
KevinC
  • 191
  • 1
  • 4
0
votes
4 answers

How to understand "The shards are replica sets."

When I put shard and Replica Set together, I am confused. Why does the reference say that the shards are replica sets? Do replica sets contains shards? Can someone give me a conceptual explanation?
louis
  • 3
  • 2
0
votes
1 answer

Redundant chunk in mongo shard

I am testing mongodb sharding on my local machine. Everything seems to run fine, except that my first chunk is redundant. I initially had a mongod instance running, which had 100,000 entries like this, { "_id" :…
user3848191
  • 103
  • 1
  • 6
0
votes
1 answer

Solr cloud - creating collection using implicit routing

I am not being able to decide that out of the two ways of creating collection in solr cloud which one I should go for. I want that - I should be able to add/create shard to a existing collection on the fly so that I can scale up the cluster as and…
Vijay Tiwary
  • 151
  • 10
0
votes
1 answer

Cannot get a covered query for sharded collection in MongoDB

The collection is a sharded collection over the hashed field. The following query should definitly be indexOnly but explain shows otherwise. db.collection.ensureIndex({field : "hashed"}) db.collection.ensureIndex({field : 1, "field2" : 1, "field3" :…
daniel
  • 47
  • 1
  • 6
0
votes
1 answer

Does the number of shards and replicas have any effect on disk size?

My ES nodes are using the default settings. 5 primary shards, replicas = 1. Does changing the settings from 5 to 3 shards and 1 replica have any effect on disk space used or is disk size solely affected by indices and documents? My nodes keep…
AfDev
  • 1,240
  • 19
  • 21
0
votes
0 answers

Error to connect mongos when trying to create replicated sharded cluster

I'm trying to create an replicated sharded cluster in mongodb. Initially I've created two shards and there are a replica set with three members in each shard. And all the shards and replicasets run in a single machine. I followed…
0
votes
0 answers

Elasticsearch condensed shards?

I am running a single node (15GB RAM, half allocated on the heap) with ES (1.2.1) and had allocated two indexes with 16 shards a piece with no replicas. Health always reported my "cluster" in the yellow due to a lack of replicas, but aside from…
Famous Food Finder
  • 273
  • 1
  • 3
  • 8
0
votes
0 answers

SQLAlchemy ORM for 12 duplicate tables on each of 6 shard databases

The situation: I have a set of 12 tables of the same structure (representing data by month) that exist on each of 6 shards across different databases. I need to get a sample set of data across any of these databases for any given set of months. The…
hello-klol
  • 735
  • 10
  • 20
0
votes
1 answer

does NuoDB ensures to get latest/updated data in all read operation?

I want to install NuoDB in a cloud (multiple amazon ec2 instances). before that I need to know one things: does NuoDB ensures data consistency? i am not a DB expert so you can treat this part as a separate point of this question or explanation of…
user3011768
  • 191
  • 2
  • 3
  • 11
0
votes
0 answers

Query Result Mismatch in MongoDB (Sharded Environment)

What I am doing is: 1) Created a new shard database. 2) Created two new shard collections (source and target) and dumped data in it 3) Then I am running two map reduce on two sharded collections one after the other And putting the result of both…
Bhavesh Shah
  • 3,299
  • 11
  • 49
  • 73
1 2 3
99
100