SnappyData v.0-5
Goal: I want to create a persistent, replicated ROAD table and load it from a CSV file using the Snappy Shell. The ROAD table should have 'road_id' as a primary key to prevent duplicate IDs.
The commands I tried are:
SET SCHEMA A;
DROP TABLE IF EXISTS ROAD;
DROP TABLE IF EXISTS STAGING_ROAD;
CREATE TABLE STAGING_ROAD
(road_id string, name string)
USING com.databricks.spark.csv
OPTIONS(path 'roads.csv', header 'true');
CREATE TABLE ROAD
(
road_id VARCHAR(64) NOT NULL,
name VARCHAR(64) NOT NULL,
CONSTRAINT road_PK PRIMARY KEY (road_id)
) USING row OPTIONS (BUCKETS '5', REPLICATE, PERSISTENT)
AS (select road_id, name from STAGING_ROAD);
This does not work. And, I have to dumb down my ROAD create to this to make it even get created. This does not have a PK. It does not have replication or persistence.
CREATE TABLE ROAD USING row OPTIONS ()
AS (select road_id, name from STAGING_ROAD);
How should I be scripting a SnappyData SQL file to accomplish the goal stated above?