Expanding the power of Vertica Library for AWS with Filter Functions

After publishing the “Loading data from AWS S3 bucket to Vertica made easy” post I had people asking me about the possibility of using the S3 function to load data in different formats. The most frequently asked question was about loading GZipped files from the S3 bucket. Vertica COPY command expects an uncompressed CSV input Read More …

Loading data from AWS S3 bucket to Vertica made easy

The COPY command is an obvious choice when you need to load a significant amount of data into a Vertica database. In the past a lot of people have run into difficulties when their data was stored in an AWS S3 bucket. At that time, the COPY command assumed that all the file(s) with data Read More …

Installing a Single-Node HPE Vertica 8 Cluster on Ubuntu 14.04

HPE Vertica 8 was released back in August. This post will cover basic installation of single-node Vertica 8.0.0 cluster on Ubuntu Server. My assumptions this is minimal install, which is not intended for production use I am not worried about redundancy, so I can do a single-node cluster I am not planning to do benchmarking Read More …

Tips and Tricks on Working With vertica.log

Vertica.log is the main log file of the Vertica database. After reading my previous post, HP Vertica Database Log Files and Their Locations, you know how to locate vertica.log on each node. In this post we will examine the content of this log file and will use it for different operational scenarios. Each node in the Read More …

HP Vertica Database Log Files and Their Locations

An HP Vertica database has 2 main log files. Their names are dbLog and vertica.log. In the majority of cases you will be working with vertica.log. The dbLog file is used by Vertica during database startup. If you have issues starting the database then check dbLog for any recent messages. If the database started successfully Read More …

Exporting Vast Amounts of Data Using Parallel Export for HPE Vertica

Sometimes you can find yourself in a position in which you need to export data out of Vertica. It is not a problem for small data sets, but could be challenging if you have millions of records. If you try to use vsql for this purpose you will be doing the export through a single Read More …

Using Database Designer to Optimize Specific Class of Queries in HP Vertica

A lot of people who are new to Vertica ask if you can create an index on a table in a Vertica database. The answer they get is NO. Vertica does not have a CREATE INDEX statement and does not have the concept of an index. Let’s see if we can come up with something Read More …

Working with IDENTITY columns in HP Vertica

HP Vertica has 2 different types of columns that incremented automatically by the database. They are AUTO_INCREMENT and IDENTITY. The key difference is in the level of control the user has. The AUTO_INCREMENT column always starts with 1 and increments by 1. Those values are hardcoded and can not be changed. IDENTITY provides more control. Read More …

Using Database Designer to Build Projections with Optimal Sort Order in HP Vertica

The performance level of your queries in Vertica depend on the design of your projections. In the projection definitions you specify how your data will be segmented and how your data will be sorted. If you are going to make the right choices your queries will perform really well. I covered projection segmentation in my Read More …