Questions tagged [kite-dataset]

Used for dataset questions related to Kite Software Development Kit.

See also:

4 questions
2
votes
1 answer

kite-dataset option mentioning hive database

I have to create a movies table in the movielens database instead of default hive database using Kite-dataset. I have already created a movielens database in hive manually. So please advice what is the option I need to use with below command. $…
SriniDK
  • 69
  • 5
1
vote
1 answer

how to avoid mapreduce OutOfMemory Java heap space error while using kite-dataset to import data?

on my hortonworks HDP 2.6 cluster, I'm using kite-dataset tool to import data: ./kite-dataset -v csv-import ml-100k/u.data ratings I'm getting this error: java.lang.Exception: java.lang.OutOfMemoryError: Java heap space at…
Eugene Goldberg
  • 14,286
  • 20
  • 94
  • 167
1
vote
2 answers

how to avoid IO error while using kite-dataset to import data?

I'm using Hortonworks HDP distro (2.4) on Ubuntu 14 Downloaded kite-dataset Running this command: ./kite-dataset -v csv-import --delimiter '|' ml-100k/u.item movies Getting this error: WARNING: Use "yarn jar" to launch YARN applications. SLF4J:…
Eugene Goldberg
  • 14,286
  • 20
  • 94
  • 167
1
vote
1 answer

how to properly import csv data set using kite-dataset partitioned schema?

I'm working with the publicly-available csv dataset from MovieLens I have created a partitioned dataset for the ratings.csv: kite-dataset create ratings --schema rating.avsc --partition-by year-month.json --format parquet Here is my…
Eugene Goldberg
  • 14,286
  • 20
  • 94
  • 167