Custom Installation

As described in the Components chapter, there is a couple of ways how to set up pgwatch. Two most common ways though are using the central configuration database approach and the YAML file based approach, plus Grafana to visualize the gathered metrics.

Configuration Database based setup

Overview of installation steps

Install Postgres or use any available existing instance - v11+ is required but the latest major version is recommended.
Bootstrap the configuration database.
Bootstrap the metrics measurements storage database aka sink (PostgreSQL here).
Install pgwatch - either from pre-built packages or by compiling the Go code.
Prepare the "to-be-monitored" databases for monitoring by creating a dedicated login role name as a minimum.
Add some databases to the monitoring configuration via the Web UI, REST API or directly in the configuration database.
Start the pgwatch metrics collection agent and monitor the logs for any problems.
Install and configure Grafana and import the pgwatch sample dashboards to start analyzing the metrics.
Make sure that there are auto-start services for all components in place and optionally set up also backups.

Detailed steps for the configuration database approach with Postgres sink

Below are the sample steps for a custom installation from scratch using Postgres for the pgwatch configuration database, measurements database and Grafana configuration database.

All examples here assume Ubuntu as OS but it's basically the same for RedHat family of operations systems also, minus package installation syntax differences.

Install Postgres

Follow the standard Postgres install procedure basically. Use the latest major version available, but minimally v11+ is required.

To get the latest Postgres versions, official Postgres PGDG repos are to be preferred over default disto repos. Follow the instructions from:
- https://www.postgresql.org/download/linux/debian/ - for Debian / Ubuntu based systems
- https://www.postgresql.org/download/linux/redhat/ - for CentOS / RedHat based systems
- https://www.postgresql.org/download/windows/ - for Windows

Install pgwatch either from pre-built packages or by compiling the Go code

Using pre-built packages which are available on the GitHub releases page:

# find out the latest package link and replace below, using v3.0 here
wget https://github.com/cybertec-postgresql/pgwatch/releases/download/3.0.0/pgwatch_Linux_x86_64.deb
sudo dpkg -i pgwatch_Linux_x86_64.deb

Compiling the Go code yourself

This method of course is not needed unless dealing with maximum security environments or some slight code changes are required.
1. Install Go by following the official instructions
2. Get the pgwatch project's code and compile the gatherer daemon
```
git clone https://github.com/cybertec-postgresql/pgwatch.git
cd pgwatch/internal/webui
yarn install --network-timeout 100000 && yarn build
cd ../..
go build ./cmd/pgwatch/
```
  After fetching all the Go library dependencies (can take minutes) an executable named "pgwatch" should be generated. Additionally, it's a good idea to copy it to /usr/bin/pgwatch.

Configure a SystemD auto-start service (optional). Here is the sample:

[Unit]
Description=pgwatch
After=network-online.target

# After=<postgresql@17-main.service>

[Service]
User=pgwatch
Type=exec
ExecStart=/usr/bin/pgwatch --sources=postgresql://pgwatch:xyz@localhost:5432/pgwatch --sink=postgresql://pgwatch:xyz@localhost:5432/pgwatch_metrics
Restart=on-failure
TimeoutStartSec=0

[Install]
WantedBy=multi-user.target

Bootstrap the configuration database

Note

The detailed steps are described in the Bootstrapping the Configuration Database chapter
- Create a user to "own" the pgwatch database
  
  Typically called pgwatch but can be anything really, if the schema creation file is adjusted accordingly.
```
psql -c "create user pgwatch password 'xyz'"
psql -c "create database pgwatch owner pgwatch"
```
- Roll out the pgwatch config schema (optional)
  
  pgwatch will automatically create the necessary tables and indexes in the database when it starts. But in case you want to create the schema as a separate step, you can use the config init command-line command:
```
pgwatch --sources=postgresql://pgwatch:xyz@localhost/pgwatch config init
```
Bootstrap the measurements storage database (sink)

Note

The detailed steps are described in the Bootstrapping the Metrics Measurements Database (Sink) chapter

Create a dedicated database for storing metrics and a user to "own" the measurements schema. Here again default scripts expect a role named pgwatch but can be anything if to adjust the scripts:
```
psql -c "create database pgwatch_metrics owner pgwatch"
```
Prepare the "to-be-monitored" databases for metrics collection

As a minimum we need a plain unprivileged login user. Better though is to grant the user also the pg_monitor system role, available on v10+. Superuser privileges should be normally avoided for obvious reasons of course, but for initial testing in safe environments it can make the initial preparation (automatic helper rollouts) a bit easier still, given superuser privileges are later stripped.

To get most out of your metrics some SECURITY DEFINER wrappers functions called "helpers" are recommended on the DB-s under monitoring. See the detailed chapter on the "preparation" topic for more details.

Start the pgwatch metrics collection agent

The gatherer has quite some parameters (use the --help flag to show them all), but simplest form would be:

pgwatch \
    --sources=postgresql://pgwatch:xyz@localhost:5432/pgwatch \
    --sink=postgresql://pgwatch:xyz@localhost:5432/pgwatch_metrics \
    --log-level=debug

Or via SystemD if set up in previous steps

useradd -m -s /bin/bash pgwatch # default SystemD templates run under the pgwatch user
sudo systemctl start pgwatch
sudo systemctl status pgwatch

After initial verification that all works, it's usually good idea to set verbosity back to default by removing the --log-level=debug flag.

Configure sources and metrics with intervals to be monitored
- from the Web UI "Sources" page
- via direct inserts into the Config DB pgwatch.source table
Monitor the console or log output for any problems

Wait for a few minutes or restart the gatherer daemon to reread the monitored sources and metrics configuration. You can control the refresh timeout via the --refresh parameter, default is 120 seconds.

If you see metrics trickling into the "pgwatch_metrics" database (metric names are mapped to table names and tables are auto-created), then congratulations - the deployment is working! When using some more aggressive preset metrics config then there are usually still some errors though, due to the fact that some more extensions or privileges are missing on the monitored database side. See the according chapter.
Install Grafana
1. Create a Postgres database to hold Grafana internal config, like dashboards etc.
  
  Theoretically it's not absolutely required to use Postgres for storing Grafana internal settings, but doing so has 2 advantages - you can easily roll out all pgwatch built-in dashboards and one can also do remote backups of the Grafana configuration easily.
```
psql -c "create user pgwatch_grafana password 'xyz'"
psql -c "create database pgwatch_grafana owner pgwatch_grafana"
```
2. Follow the instructions from Grafana documentation, basically something like:
```
wget -q -O - https://packages.grafana.com/gpg.key | sudo apt-key add -
echo "deb https://packages.grafana.com/oss/deb stable main" | sudo tee -a /etc/apt/sources.list.d/grafana.list
sudo apt-get update && sudo apt-get install grafana

# review / change config settings and security, etc
sudo vi /etc/grafana/grafana.ini

# start and enable auto-start on boot
sudo systemctl daemon-reload
sudo systemctl start grafana-server
sudo systemctl status grafana-server
```
  Default Grafana port: 3000
3. Configure Grafana config to use our pgwatch_grafana DB
  
  Place something like this below in the [database] section of /etc/grafana/grafana.ini
```
[database]
type = postgres
host = my-postgres-db:5432
name = pgwatch_grafana
user = pgwatch_grafana
password = xyz
```
  Taking a look at [server], [security] and [auth*] sections is also recommended.
4. Set up the pgwatch metrics database as the default datasource
  
  We need to tell Grafana where our metrics data is located. Add a datasource via the Grafana UI (Admin -> Data sources) or use pgwatch/grafana/postgres_datasource.yml for provisioning.
5. Add pgwatch predefined dashboards to Grafana
  
  This could be done by importing the pgwatch dashboard definition JSONs manually, one by one, from the pgwatch/grafana folder ("Import Dashboard" from the Grafana top menu) or by dashboard provisioning.
6. Optionally install also Grafana plugins
  
  Currently, one pre-configured dashboard (Biggest relations treemap) use an extra plugin - if planning to use that dash, then run the following:
```
grafana-cli plugins install savantly-heatmap-panel
```
7. Start discovering the preset dashbaords
  
  If the previous step of launching pgwatch succeeded, and it was more than some minutes ago, one should already see some graphs on dashboards like "DB overview" or "DB overview Unprivileged / Developer mode" for example.

YAML Configuration based setup

The process is completely identical to the above, but instead of setting up a Postgres database for the configuration, one would use a YAML file. For details on individual steps like installing pgwatch see the above paragraph.

Install Postgres or use any available existing instance - v11+ is required but the latest major version is recommended.
Edit the YAML file to include the sources to be monitored.
Bootstrap the metrics measurements storage database aka sink (PostgreSQL here).
Install pgwatch - either from pre-built packages or by compiling the Go code.
Prepare the "to-be-monitored" databases for monitoring by creating a dedicated login role name as a minimum.
Add some databases to the monitoring configuration via the Web UI, REST API or directly in the configuration database.
Start the pgwatch metrics collection agent and monitor the logs for any problems.
Install and configure Grafana and import the pgwatch sample dashboards to start analyzing the metrics.
Make sure that there are auto-start services for all components in place and optionally set up also backups.

YAML Configuration file

The content of a file is a array of sources definitions, like this:

- name: test1       # An arbitrary unique name for the monitored source
  kind: postgres    # One of the:
                      # - postgres
                      # - postgres-continuous-discovery
                      # - pgbouncer
                      # - pgpool
                      # - patroni
                      # - patroni-continuous-discovery
                      # - patroni-namespace-discover
                      # Defaults to postgres if not specified
  conn_str: postgresql://pgwatch:xyz@somehost/mydb
  preset_metrics: exhaustive # from list of presets defined in "metrics/preset-configs.yaml"
  custom_metrics:    # if both preset and custom are specified, custom wins
  preset_metrics_standby: # optional metrics configuration for standby / replica state, v1.8.1+
  custom_metrics_standby:
  include_pattern: # regex to filter databases to actually monitor for the "continuous" modes
  exclude_pattern:
  is_enabled: true
  group: default # just for logical grouping of DB hosts or for "sharding", i.e. splitting the workload between many gatherer daemons
  custom_tags:      # option to add arbitrary tags for every stored data row,
      aws_instance_id: i-0af01c0123456789a       # for example to fetch data from some other source onto a same Grafana graph
...