Questions tagged [cascading]

Cascading is a Query API, Query Planner, and Process Scheduler used for defining and executing complex, scale-free, and fault tolerant data processing workflows on a Hadoop cluster.

Cascading is a Query API, Query Planner, and Process Scheduler used for defining and executing complex, scale-free, and fault tolerant data processing workflows on a Hadoop cluster.

Cascading is a thin Java library that sits on top of Hadoop's MapReduce layer and is executed from the command line like any other Hadoop application. It is not a new text based query syntax (like Pig) or another complex system that must be installed on a cluster and maintained (like Hive). Though Cascading is both complimentary to and is a valid alternative to either application.

Cascading lets the developer quickly assemble complex distributed data-processing applications without having to "think" in MapReduce. And to efficiently schedule them based on their dependencies. Obviously simple data processing applications are supported as well, as complex applications tend to start simple.

Cascading is Open Source and dual licensed under the GPL and OEM/Commercial Licenses. OEM/Commercial Licenses and Developer Support can be obtained through Concurrent, Inc.

Cascading has a strong community of users and contributors, see our Cascading modules page for related projects and extensions.

Cascading, extensions, and related libraries are also hosted in the Conjars maven repository maintained by Concurrent, Inc. The repository is open to the public.

Cascading application-stack overview: enter image description here

Links:

364 questions
0
votes
1 answer

How to read first line of files in a directory using java cascading?

I'm working on a project to learn cascading, and I'm stumped on this problem. Cascading doesn't seem to have anything to read the first line of each individual file in a directory, which I need to do in order to discover the content type from text…
CalebJ
  • 159
  • 1
  • 10
0
votes
1 answer

jQuery Chosen with cascading selects not working

I have 3 select boxes that are being populated from a database. I then use this javascript to make them cascade (dependent on selection in parent select box): function cascadeSelect(parent, child){ var childOptions =…
user2559175
0
votes
1 answer

Kendo UI Cascading DropDownList in MVVM does not bind data with auto selection

I'm getting mad with this problem. The cascading dropdown is not binding data if the parent is selected and children are auto selected after that. here is my jsbin link. Anyone please help. http://jsbin.com/ujorer/253/edit
0
votes
1 answer

cascading-parquet throwing ClassNotFoundException

Trying to run this code Main.java: public static void main(String[] args) { ... .. Properties properties = new Properties(); AppProps.setApplicationJarClass(properties, Main.class); HadoopFlowConnector flowConnector = new…
hdmi3killer
  • 89
  • 1
  • 1
  • 12
0
votes
2 answers

Javafx Cascading dropdown based on selection

am migrating from swing to javafx. Can anyone help with a link/code snippet on how to cascade combobox(es) based on parent-child selection in javafxe.g. country-state, branch-department-unit.
Abiodun Osinaike
  • 155
  • 1
  • 4
  • 14
0
votes
1 answer

Unable to get this Hadoop / Cascading code to run without exception? New to Both

Can someone explain why this code is throwing an exception when I attempt to run the job on local file system? I am new to Hadoop/Cascading and am trying to get multiple columns per line and count the frequency of the values in those columns…
Chris Manning
  • 499
  • 5
  • 14
0
votes
1 answer

Has anyone performed joins using 3 tables in cascading

Cascading is one of the best framework to deal with hadood jobs by using simple data flows. Has anyone tried and performed joins usings 3 tables in cascading, if so can u please share ur ideas regarding joins in cascading and how to perform join on…
Nagendra kumar
  • 215
  • 1
  • 2
  • 14
0
votes
2 answers

Cascading join two files very slow

I am using cascading to do a HashJoin two 300MB files. I do the following cascading workflow: // select the field which I need from the first file Fields f1 = new Fields("id_1"); docPipe1 = new Each( docPipe1, scrubArguments, new ScrubFunction( f1…
user3019299
  • 189
  • 1
  • 5
  • 19
0
votes
0 answers

Insert UTF-8 value in mysql using Cascading

I am using Cascading 1.2. I am trying to insert data into mysql using JDBCTap. Some of values are in utf-8 and other values are not in utf-8 format. The values which are in utf-8 decode format are being inserted in mysql as "???" like this. I have…
user2664210
  • 145
  • 2
  • 13
0
votes
2 answers

How Cascading TextDelimited the log file

I am following the guide of Cascading on its website. I have the following TSV format input: doc_id text doc01 A rain shadow is a dry area on the lee back side of a mountainous area. doc02 This sinking, dry air produces a rain shadow, or area…
user2597504
  • 1,503
  • 3
  • 23
  • 32
0
votes
2 answers

Cascading tutorial word count example error

I am learning Cascading now. Now I am looking the second tutorial on its official website which is about Work Count example. I copy the code from it and try to run, it always gives me the following errors: Exception in thread "main"…
user2597504
  • 1,503
  • 3
  • 23
  • 32
0
votes
3 answers

Cascading 2.1.6 + hadoop 1.0.4 Error

Hi I am new to cascading and following "Enterprise Data Workflows with Cascading" book . I checked with hadoop 1.0.4 and I download cascading 2.1.6 . I setup everything with netbeans IDE with all jar files. Code : package…
nadia
  • 43
  • 7
0
votes
1 answer

Good Tutorials on Hadoop Cascading

Could anybody point me some simple examples,links and books to learn Hadoop cascading.I couldnt find any good resources except Cascading site http://www.cascading.org/
maali
  • 134
  • 3
  • 14
0
votes
1 answer

Is there a way to detect the objects affected by a JPA cascade persist call?

Say you have an Entity called ContainerType1 and a child called Thing and you have a one to many relationship between ContainerType1 and Thing. The ContainerType1 has a set of Thing. And say it's not a 2 way relationship, so Thing is not aware of…
0
votes
1 answer

How to handle exception in MapReduceFlow of Cascading

I have written cascade flow which executes MapReduce flow containg both Mapper and Reducer. In reduce() method, it throwsIllegalArgumentException. How to handle this exception ? I have written catch block in class where I created JobConf for the…
user2664210
  • 145
  • 2
  • 13