c++ sparse vector

Question

Question 1)

I have a large sparse vector of doubles in c++, I need to efficiently parse out the indices of the non zero elements from the vector. I can obviously loop over the length and do it, any better way to do it?

I mention it right, its trivial to loop over it and solve. is there a better way ? — ganesh reddy, Feb 17 '13 at 17:52
There's no standard sparse vector in C++. It is unclear what you mean. Do you have some data structure (which, and what is meant by "parsing out")? A data file (in which format, and what kind of data structure you want to create?) Something else? — n. m. could be an AI, Feb 17 '13 at 18:02
sorry sparse means that the vector say is 200 long, but mostly is 0's — ganesh reddy, Feb 17 '13 at 18:04

Benjamin Lindley · Answer 1 · 2013-02-17T18:10:29.850

3

Unless you have some special knowledge of the makeup of the vector of doubles, (for example, it's sorted), a loop over its entirety is the most efficient you're gonna get.

Of course, a change in structure as suggested by eladidan is probably something you should consider.

edited Feb 17 '13 at 18:10

answered Feb 17 '13 at 17:56

Benjamin Lindley

101,917
9
204
274

eladidan · Answer 2 · 2013-02-17T18:16:51.007

I have a large sparse vector of doubles in c++, I need to efficiently parse out the indices of the non zero elements from the vector. I can obviously loop over the length and do it, any better way to do it?

If the vector is truly sparse (n = o(N) where n is the number of non-zero elements and N is the size of the vector), then representing it in an std::map<int,double> or std::unordered_map<int,double> is probably best. With std::mapway you get to find an element in O(log(n)) . With std::unordered_map a find operation takes amortized time of O(1). In both cases, the number of non-zero elements is simply the size of the container. Both approaches also take O(n) space instead of O(N).

score 0 · Answer 3 · edited May 23 '17 at 11:56

0

If you cannot change the representation of your data, you have to examine each element to filter out those that are almost equal to zero. However, this task is embarrassingly parallel, so maybe you can partition the workload to a bunch of threads to at least improve the runtime (although not the complexity).

edited May 23 '17 at 11:56

Community

1
1

answered Feb 17 '13 at 18:21

bitmask

32,434
14
99
159

c++ sparse vector

3 Answers3