Questions tagged [vertica]

Vertica is an MPP column-store analytics database that runs on clusters of Linux servers or in the cloud. The original configuration (Enterprise mode) distributes data and query execution among nodes (typically with duplication). A newer configuration (Eon mode) separates compute from storage and relies on S3 for storage.

Vertica is a columnar database designed for data warehousing. Vertica is based on C-Store, which was an academic project involving academic and entrepreneur Michael Stonebraker (a co-founder of Vertica, Ingres, VoltDB) and colleagues from Brandeis University, Brown University, MIT, and University of Massachusetts Boston.

As part of the C-store architecture table data is stored on disk in columns rather than in rows. Columns are stored independently of other columns and allow for increased compression on storage and decreased I/O during query operations.

Vertica has advanced analytical and machine learning functions built-in.

Vertica is designed to run on clusters of Linux nodes. A Vertica cluster runs in a MPP (Massive Parallel Processing) fashion, and it is designed to scale linearly by adding nodes to the cluster.

Vertica clusters can be deployed on premise (in Enterprise Mode) or in the cloud (in either Enterprise Mode or Eon Mode). Enterprise Mode consists of Vertica nodes that share compute and storage resources. Eon Mode is a cloud based mode in which there is a separation of compute nodes (AWS EC2 instances) and storage (S3). This separation of compute and storage allows for rapid scalability (adding or removing compute nodes) without the need to rebalance data across the nodes in the cluster.

Veritca was purchased by HP in 2011, and then acquired by Micro Focus as part of the HPE merger in 2017. As such, Vertica is sometimes referred to in old documentation and community posts as HP Vertica or HPE Vertica.

Vertica comes with a command line utility called vsql for connecting to the database.

1420 questions
4
votes
3 answers

Bulk insert into vertica using Python

I am using python to transfer data (~8 million rows) from oracle to vertica. I wrote a python script which transfers the data in 2 hours, but I am looking for ways to increase the transfer speed. Process I am using : Connect to Oracle Pull the…
Data Enthusiast
  • 521
  • 4
  • 12
  • 22
4
votes
1 answer

Select min/max from group defined by one column as subgroup of another - SQL, HPVertica

I'm trying to find the min and max date within a subgroup of another group. Here's example 'data' ID Type Date 1 A 7/1/2015 1 B 1/1/2015 1 A 8/5/2014 22 B 3/1/2015 22 B 9/1/2014 333 A 8/1/2015 333 B …
Megan
  • 43
  • 4
4
votes
3 answers

Vertica, describe table

I have a vertica database and I need to describe some tables. In MySQL you use describe table_name. And I know that in vertica it's \d table_name. Also I need to do it throw python, but when I make query: cur.execute("\d table_name") I get this…
Opperix
  • 123
  • 2
  • 2
  • 5
4
votes
0 answers

Python ODBC decimal error

I've look for solutions to this error but I'm still striking out. I'm using a python 2.7 venv with pyodbc 3.0.6 connection = pyodbc.connect(myconnstring) cursor = connection.cursor() cursor.execute("""SELECT varchar_column, date_column as epoch,…
rhealitycheck
  • 650
  • 3
  • 8
  • 19
4
votes
4 answers

How to free up vertica raw space after deleting records

We are using Vertica community edition which has raw data limit of 1TB. recently reached 1 TB raw data limit so we decided to delete some records from all tables. After deletion of old records Vertica still shows Utilization : 104% dbadmin=>…
roy
  • 6,344
  • 24
  • 92
  • 174
4
votes
1 answer

Is there a way to set AUTO_INCREMENT property on existing table column in Vertica?

Suppose I have a simple table: CREATE TABLE user( id INT NOT NULL PRIMARY KEY, name VARCHAR(32) NOT NULL, ) Is there a way to alter this table so id will become AUTO_INCREMENT field? I tried the following with no luck: ALTER TABLE (no such…
Michael Spector
  • 36,723
  • 6
  • 60
  • 88
4
votes
2 answers

How to speed up a slow multi-join query in a small Vertica database (~120K total rows, 10 minutes)

I'd love to get your help understanding why this join-heavy query takes about 10 minutes to run on a small database of seven tables totaling < 120K rows, and ideally get your suggestions on how to make it faster on our little cluster of four nodes.…
Matthew Cornell
  • 4,114
  • 3
  • 27
  • 40
4
votes
2 answers

SQL query date according to time zone

We are using a Vertica database with table columns of type timestamptz, all data is inserted according to the UTC timezone. We are using spring-jdbc's NamedParameterJdbcTemplate All queries are based on full calendar days, e.g. start date 2013/08/01…
4
votes
5 answers

Use Vertica Database for OLTP data?

Can Vertica Database be used for OLTP data? And if so what are the pros and cons on doing this? Looking for a Vertica vs Oracle fight :)Since Oracle license is so costly, would Vertica do it job for a better price ? thx all
Up_One
  • 5,213
  • 3
  • 33
  • 65
4
votes
2 answers

Convert the datetime value of one time zone to another time zone in Vertica

Is there anything in Vertica to get the same output as given by the following Sql query in Oracle? SELECT FROM_TZ(CAST(TO_DATE('1999-12-01 11:00:00', 'YYYY-MM-DD HH:MI:SS') AS TIMESTAMP), 'America/New_York') AT TIME ZONE…
Amit
  • 19,780
  • 6
  • 46
  • 54
3
votes
0 answers

Docker container ODBC connection to external Vertica database with Amazon lambda

I am trying to connect to Vertica DB from my Serverless ASP.NET Core API (with Amazon Lambda). To do this, I build a docker container to install the ODBC driver along with creating my lambda function. The docker image successfully is built and…
3
votes
1 answer

Why does SQL complain that a reference to a join key is ambiguous?

Why in the following code does Vertica return an error that zip3 ambiguous? Since it is used as the key for the left join, it must be equal for both tables. SELECT zip, zip3, city, zip3_name FROM zip_codes LEFT JOIN zip3_codes USING (zip3); This…
prototype
  • 7,249
  • 15
  • 60
  • 94
3
votes
2 answers

JOIN on range between current and previous row

could anyone help me please? I have two tables: Prices and Intervals: Prices: Intervals: Price Interval_bound Category 16 5 cheap 11 10 cheap 9 15 …
Polly
  • 1,057
  • 5
  • 14
  • 23
3
votes
2 answers

How to convert Hive array type to Vertica varchar on exporting data

I have a table in Hive db with array type column which I want to copy it with content to Vertica db. The Version of Vertica db is v9.0.1-0 and I cant create table with array type. To copy table I tried to use COPY vertica_schema.destination FROM…
Vadim
  • 753
  • 8
  • 22
3
votes
1 answer

How to connect python with vertica using ODBC?

import sqlalchemy as sa conn = "vertica+pyodbc://dbadmin:password@VMart" sa.create_engine(conn, pool_size=10, max_overflow=20) %load_ext sql %sql vertica+pyodbc://VMart error (pyodbc.InterfaceError) ('IM002', '[IM002] [Microsoft][ODBC Driver …