9

I am looking for a Dataflow / Concurrent Programming API for Java.
I know there's DataRush, but it's not free. What I'm interested in specifically is multicore data processing, and not distributed, which rules out MapReduce or Hadoop.
Any thoughts?
Thanks, Rollo

Yishai
  • 90,445
  • 31
  • 189
  • 263
Rollo Tomazzi
  • 3,120
  • 3
  • 28
  • 21

4 Answers4

7

You might try gpars; it apparently has implementations of data flow variables and streams in Java even though it is geared towards providing concurrent programming goodies for Groovy.

sirolf2009
  • 819
  • 8
  • 15
Cagatay
  • 1,372
  • 1
  • 12
  • 16
1

Might try the upcoming fork/join library which will (hopefully) be in Java 7 as part of the JSR 166y update.

Main project page: - http://gee.cs.oswego.edu/dl/concurrency-interest/index.html

Pointers to lots of links about what it is: - http://tech.puredanger.com/java7#jsr166

Alex Miller
  • 69,183
  • 25
  • 122
  • 167
0

Does the built in Java concurrent package meet your needs? It's a very nice package, built in ThreadPools, CopyOnWriteCollections, Executors, Future. We use it to process large volumns of data in thread pools.

Steve K
  • 19,408
  • 6
  • 52
  • 50
0

https://github.com/rfqu/df4j is simple but powerful dataflow library. If it lacks some desired features, they can be added easly. It can exploit java.concurrent.ExecutorService.

Alexei
  • 11
  • 1