Questions tagged [large-data-volumes]

302 questions
2
votes
1 answer

How to efficiently filter objects out of an (initially) large list of objects

I need to filter a large list of complex (20+ properties) objects into multiple sub lists. To create the sub-lists, I have a list of filter specifications. Requirements are: a) An item is not allowed to be part of two sub lists and b) it must be…
Björn Boxstart
  • 1,098
  • 1
  • 12
  • 25
2
votes
3 answers

How to split data over MySQL tables

I have a website with members who message each other. There are getting to be a few members and they like to send messages - I'm sure you can see where this is going. Currently I have said messages stored in a nicely relational table cunningly…
da5id
  • 9,100
  • 9
  • 39
  • 53
2
votes
3 answers

Getting started with massive data

I'm a mathematician and occasionally do some statistics/machine learning analysis consulting projects on the side. The data I have access to are usually on the smaller side, at most a couple hundred of megabytes (and almost always far less), but I…
Max
  • 21
  • 1
2
votes
3 answers

Performing Aggregate Functions on Multi-Million Row Tables

I'm having some serious performance issues with a multi-million row table that I feel I should be able to get results from fairly quick. Here's a run down of what I have, how I'm querying it, and how long it's taking: I'm running SQL Server 2008…
Dan Short
  • 9,598
  • 2
  • 28
  • 53
2
votes
4 answers

Graphing large amounts of data

In a product I work on, there is an iteration loop which can have anywhere between a few hundred to a few million iterations. Each iteration computes a set of statistic variables (double precision), and the number of variables can be up to 1000…
Walter Williams
2
votes
1 answer

How to efficiently store a large but EXTREEMLY sparse amount of data in a 3D representation?

I ran into this problem in the code I an working on: I need to be able to store a large amount of data in RAM, and be able to access it using three integer i,j,k (similar to x,y,z coordinates). The problem is that if I try to use a 3D array, then…
Vladimir
  • 225
  • 1
  • 11
2
votes
4 answers

Avoid an "out of memory error" in Java(eclipse), when using large data structure?

OK, so I am writing a program that unfortunately needs to use a huge data structure to complete its work, but it is failing with a "out of memory error" during its initialization. While I understand entirely what that means and why it is a problem,…
gnomed
  • 5,483
  • 2
  • 26
  • 28
2
votes
1 answer

Displaying large sorted lists in WPF QUICKLY?

I am developing a program that should be able to display long (up to 500 items) lists of data that need to be resorted when their contents change. Essentially, I have a viewmodel with an observable collection that contains classes with observable…
bluebit
  • 2,987
  • 7
  • 34
  • 42
2
votes
4 answers

Efficient File I/O and Conversion of Strings to Floats

I have some gigantic (several gigabyte) ASCII text files that I need to read in line-by-line, convert certain columns to floating point, and do a few simple operations on these numbers. It's pretty straightforward stuff, except that I'm thinking…
dsimcha
  • 67,514
  • 53
  • 213
  • 334
2
votes
6 answers

Read from one large file and write to many (tens, hundreds, or thousands) files in Java?

I have a large-ish file (4-5 GB compressed) of small messages that I wish to parse into approximately 6,000 files by message type. Messages are small; anywhere from 5 to 50 bytes depending on the type. Each message starts with a fixed-size type…
Rudiger
  • 6,634
  • 9
  • 40
  • 57
2
votes
1 answer

MySQL - Handling 200M+ records per day

I have a MySQL table that will need to store about 200 million records on a daily basis. The data comes in flat files that I will be bulk uploading using "LOAD DATA INFILE". Most of the data elements are IDs, so not much text. One flat file for a…
user333746
  • 2,605
  • 3
  • 18
  • 11
2
votes
4 answers

Select Count(*) over large amount of data

I want to do this for a Report but i have 20,000,000 of records in my table and it causes an TimeOut in my application. SELECT T.transactionStatusID, TS.shortName AS TransactionStatusDefShortName, count(*) AS qtyTransactions …
jmpena
  • 1,399
  • 6
  • 19
  • 26
2
votes
5 answers

how to handle large lists of data

We have a part of an application where, say, 20% of the time it needs to read in a huge amount of data that exceeds memory limits. While we can increase memory limits, we hesitate to do so to since it requires having a high allocation when most…
Edward Q. Bridges
  • 16,712
  • 8
  • 35
  • 42
2
votes
1 answer

serialize list of huge composite graphs using protobuf-net causing out-of-memory-exception

I am trying to serialize an object containing a list of very large composite object graphs (~200000 nodes or more) using Protobuf-net. Basically what I want to achieve is to save the complete object into a single file as fast and as compact as…
mda
  • 53
  • 5
2
votes
5 answers

Is there a way to maintain a 200MB immutable data structure in memory and access it from a script?

I have a list of 9 million IPs and, with a set of hash tables, I can make a constant-time function that returns if a particular IP is in that list. Can I do it in PHP? If so, how?
konr
  • 1,173
  • 12
  • 26