Questions tagged [vertica]

Vertica is an MPP column-store analytics database that runs on clusters of Linux servers or in the cloud. The original configuration (Enterprise mode) distributes data and query execution among nodes (typically with duplication). A newer configuration (Eon mode) separates compute from storage and relies on S3 for storage.

Vertica is a columnar database designed for data warehousing. Vertica is based on C-Store, which was an academic project involving academic and entrepreneur Michael Stonebraker (a co-founder of Vertica, Ingres, VoltDB) and colleagues from Brandeis University, Brown University, MIT, and University of Massachusetts Boston.

As part of the C-store architecture table data is stored on disk in columns rather than in rows. Columns are stored independently of other columns and allow for increased compression on storage and decreased I/O during query operations.

Vertica has advanced analytical and machine learning functions built-in.

Vertica is designed to run on clusters of Linux nodes. A Vertica cluster runs in a MPP (Massive Parallel Processing) fashion, and it is designed to scale linearly by adding nodes to the cluster.

Vertica clusters can be deployed on premise (in Enterprise Mode) or in the cloud (in either Enterprise Mode or Eon Mode). Enterprise Mode consists of Vertica nodes that share compute and storage resources. Eon Mode is a cloud based mode in which there is a separation of compute nodes (AWS EC2 instances) and storage (S3). This separation of compute and storage allows for rapid scalability (adding or removing compute nodes) without the need to rebalance data across the nodes in the cluster.

Veritca was purchased by HP in 2011, and then acquired by Micro Focus as part of the HPE merger in 2017. As such, Vertica is sometimes referred to in old documentation and community posts as HP Vertica or HPE Vertica.

Vertica comes with a command line utility called vsql for connecting to the database.

1420 questions
7
votes
1 answer

How “LIKE” by LONG VARCHAR?

I had DB Vertica 8.0. Create schema WAREHOUSE with table include field by type LONG VARCHAR. Now I tried to execude SELECT for example SELECT * FROM WAREHOUSE.ALL_EVENTS a WHERE a.original_data like '%d963%' returned error SQL Error [4286]…
Nikolay Baranenko
  • 1,582
  • 6
  • 35
  • 60
7
votes
1 answer

NotImplementedError: executemany is implemented for simple INSERT statements only

I try to append my vertica (SQL-type) table through pandas using sqlalchemy import pandas as pd import sqlalchemy as sa Create engine to vertica: def get_engine(base): engine = sa.create_engine("{sys}+{dri}://{user}:" + \ …
Ivan Savin
  • 151
  • 8
7
votes
2 answers

How to fetch the filename from CSV and insert it into one of columns of a table

Suppose mt table name is Table_1 Structure of the table is something like: EMP_id, Emp_Name, File_Name_Input_Date (columns) I am loading Table_1 from a CSV file. The name of this CSV file is daily_01122014.csv I am using vertical copy command to…
Hdk
  • 197
  • 1
  • 1
  • 8
7
votes
1 answer

how connect to vertica using pyodbc

I've read the iODBC documentation regarding the odbc.ini, and the Vertica documentation. I also saw a question with the same error, but can't get the connection string to work as suggested in the previous question's answer. What I currently…
Kermit
  • 33,827
  • 13
  • 85
  • 121
7
votes
2 answers

Connect R and Vertica using RODBC

This is my first time connecting to Vertica. I have already connected to a MySQL database sucessfully by using RODBC library. I have the database setup in vertica and I installed the windows 64-bit ODBC driver from…
Jana
  • 1,523
  • 3
  • 14
  • 17
7
votes
2 answers

Is there a way to query and find out the version of Vertica installed?

I searched the Vertica documentation for the keywords Version and Version Number. But I couldn't really get any mention of SQL query to select the version of Vertica installed. I thought there would be some system table that had some version…
7
votes
6 answers

Delete Duplicate rows in Vertica database

Vertica allows duplicates to be inserted into the tables. I can view those using the 'analyze_constraints' function. How to delete duplicate rows from Vertica tables?
Amit
  • 19,780
  • 6
  • 46
  • 54
7
votes
3 answers

Vertica: Data validation of duplicate/primary key

I'm trying to create a validation procedure during a load that checks to make sure data isn't duplicated. Vertica doesn't support this natively: Vertica checks for constraint violations when queries are run, not when data is loaded. To detect…
Peter
  • 1,065
  • 14
  • 29
7
votes
4 answers

PROJECTION in vertica database

Could someone explain me the concept of PROJECTION in vertica database with an example query?
Murali
  • 811
  • 2
  • 7
  • 12
6
votes
4 answers

Using FIRST_VALUE without including inner columns in group by

I am using a table that looks like this: userID, eventDate, eventName 1 2019-01-01 buySoup 2 2019-01-01 buyEggs 2 2019-01-03 buyMilk 2 2019-01-04 buyMilk 3 2019-01-02 buyBread 3 2019-01-03 buyBread My current query is: SELECT …
Cyborgcanoe
  • 61
  • 1
  • 4
6
votes
5 answers

SQL Query for events that happend in a specific order

I have the following table: +--------+-------+------+--+ | Object | Event | Time | | +--------+-------+------+--+ | Obj1 | A | 1 | | | Obj1 | B | 3 | | | Obj2 | A | 7 | | | Obj2 | B | 4 | …
valenzio
  • 773
  • 2
  • 9
  • 21
6
votes
3 answers

aggregate function to concatenate strings in Vertica

have a table in vertica: test like this: ID | name 1 | AA 2 | AB 2 | AC 3 | AD 3 | AE 3 | AF how could I use an aggregate function or how to write a query to get data like this (vertica…
yabchexu
  • 543
  • 7
  • 21
6
votes
6 answers

Convert comma separated string to a list

I want to pass a list of int's (comma separated) which is a field in my table ie. 1234, 2345, 3456, 4567 to my IN clause in WHERE. But the list is a string (VARCHAR), and I'm comparing to an int field. Is there a way for me to convert the list to…
Neha
  • 233
  • 1
  • 3
  • 11
6
votes
2 answers

Vertica-Tableau error Multiple commands cannot be active

We have dataset in Vertica and Tableau is querying the data (4 Billions record) from vertica for dashboard as shown below : All list and graphs are separate worksheets in tableau and using same connection to Vertica DB. Each list is a column in DB…
roy
  • 6,344
  • 24
  • 92
  • 174
6
votes
2 answers

Return Top N rows for each group (Vertica/vsql)

Familiar question, but with Vertica. I'd like to return the top 5 geo_country rows based on sum(imps) for each tag_id. This is the query I started: SELECT tag_id, geo_country, SUM(imps) AS imps, RANK() OVER (PARTITION BY tag_id…
ChrisArmstrong
  • 2,491
  • 8
  • 37
  • 60
1
2
3
94 95