Postgres WAL Replication: Easy Step-By-Step Guide

Q: What is the difference between wal_receiver_timeout and wal_sender_timeout?

wal_receiver_timeout : Defines the maximum time (in milliseconds) that a WAL receiver waits for data from the WAL sender before timing out and disconnecting. wal_sender_timeout : Specifies the maximum time (in milliseconds) that a WAL sender waits for an acknowledgment from the WAL receiver before timing out and closing the connection.

Q: What is wal_level in PostgreSQL?

The wal_level setting in PostgreSQL determines the amount of information written to the WAL logs. It can be set to different levels (e.g., minimal, replica, logical), depending on whether the WAL is used for basic backups, replication, or logical decoding for use cases like logical replication.

Data replication ensures the duplication of data on an ongoing basis so that the replicate is in a consistently updated state and is synchronized with the source. This ensures high data availability and data safety against undesired events such as crashes, system errors, etc. Postgres WAL replication meets this requirement through a feature called streaming replication. This feature is achieved through a master-slave configuration.

This blog aims to explain how to perform Postgres WAL replication (Write-Ahead logging).

Note: For those new to database administration, understanding the intricacies of Postgresql WAL replication is essential for maintaining a resilient and efficient data environment.

Table of Contents

Postgres WAL Replication

Postgres WAL streaming is a preferred replication strategy for large-scale databases due to its efficiency in handling transaction logs. WAL helps to replicate data between the database servers in two ways:

File-Based Log Shipping
Streaming WAL Records

Hevo is the only real-time ELT No-code Data Pipeline platform that cost-effectively automates data pipelines that are flexible to your needs. With integration with 150+ Data Sources (40+ free sources), we help you not only export data from sources & load data to the destinations but also transform & enrich your data, & make it analysis-ready.

Start for free now!

Get Started with Hevo for Free

File-Based Log Shipping

To always keep the data in sync, the master directly sends the WAL logs to the slave. The master can either copy the logs to the slave server’s storage or simply share storage with them.

All WAL logs have a maximum storage capacity of 16MBs and are shipped only after they reach the threshold value. This can cause a delay in the replication process and increase the chances of a data loss due to a possible master crash.

Streaming WAL Records

The database servers stream WAL records in chunks to ensure that the data is always in sync. The slave servers receive the WAL chunks by connecting with the master server. The advantage of streaming WAL records is that they don’t wait for full capacity; these are streamed immediately. This helps in keeping the standby server up-to-date.

Also, the Postgres wal_level replica configuration is the default setting for PostgreSQL, balancing the need for data durability with performance.

Streaming replication is asynchronous by default however, it supports synchronous replication mode as well.

Note: When setting up a new replication environment, conducting a Postgres check replication WAL is critical to verify that everything is configured properly.

Setting Up Streaming Replication In PostgreSQL

Postgres WAL replication or streaming replication can be set up quite easily upon installing the PostgreSQL on all servers. Here you need to configure both the master node & the standby node.

Configuring Master Node
Configuring Standby Node

Configuring Master Node

The master node can be configured using the following steps:

Step 1: Database Initialization
Step 2: Creating A New User
Step 3: Configuring Streaming Properties
Step 4: Replication Entry In pg_hba.conf File

Step 1: Database Initialization

To initialize the database, make use of the initidb utility command.

Step 2: Creating A New User

Create a new user with replication privileges using the following command:

CREATE USER <user_name> REPLICATION LOGIN ENCRYPTED PASSWORD ’<password>’;

The user must specify a username and password for the given query. The replication keyword is used to provide the user with the necessary privileges.

Example query:

CREATE USER sample_user REPLICATION LOGIN ENCRYPTED PASSWORD 'sample_pass'

Step 3: Configuring Streaming Properties

You can configure the streaming properties using the PostgreSQL configuration file (postgresql.conf). Make the following changes in the file:

wal_level = replica
wal_log_hints = on
max_wal_senders = 3
wal_keep_segments = 8
hot_standby = on

The parameters used are:

wal_level: This is used to enable Postgres WAL replication/ streaming replication. The possible values here are replica, minimal, and logical.
wal_log_hints: It is required for the pg_rewind capability, this helps when the standby server is out of sync with the master server.
max_wal_senders: It is used to specify the maximum numbers of concurrent connections that can be established with the standby servers.
wal_keep_segments: It is used to specify a minimum number of WAL logs segments so that they are not deleted before standby consumes the entire capacity.
hot_standby: This parameter enables a read-only connection with the slave when it is set to ON.

Step 4: Replication Entry In pg_hba.conf File

This allows the servers to establish a connection between each other for replication. The file is usually found in the data directory of PostgreSQL.

Use the following lines of code:

host replication rep_user IPaddress md5

This line configures PostgreSQL access control for replication purposes.
host replication rep_user IPaddress md5 allows the user rep_user to perform replication tasks.
IPaddress specifies the IP address from which the user is allowed to connect.
The md5 indicates that the authentication method is MD5 password hashing.
This setup is typically added to PostgreSQL’s pg_hba.conf file to control access for database replication.

The master server now allows a user named rep_user to connect & act as the slave server using the specified IP for replication.

Example query:

host replication rep_user 192.168.0.22/32 md5

This is how a master node can be set up for Postgres WAL replication.

Configuring Standby Node

Once the master node is configured, the next step for Postgres WAL replication is to configure the standby node. You can configure the standby node in two simple steps:

Step 1: Create A Backup Of The Master Node
Step 2: Creating The Replication Configuration File

Step 1: Create A Backup Of The Master Node

Use the pg_basebackup utility to create a backup of the master node. This will be used as the starting point for the slave mode.

The utility must be used with the following syntax:

pg_basebackp -D <data_directory> -h <master_host> -X stream -c fast -U rep_user -W

The parameters used are:

-D: This indicates the directory you are working on.
-h: This is used to specify the master host.
-X: This is used to include the necessary transactional log files.
-C: This is used to set the checkpoints.
-W: This is used to prompt the user for a password before connecting to the database.

Step 2: Creating The Replication Configuration File

Check if the replication configuration file exists. If it doesn’t, create the replication configuration file as recovery.conf. The file should be created in the data directory of the PostgreSQL installation.

You can create the file automatically using the -R option in pg_basebackup utility.

The recovery.conf file should have the following commands:

standby_mode = ‘on’
primary_conninfo  = ‘host=<master_host> port=<postgres_port> user=<replication_user> password=<password> application_name=”host_name”’
recovery_target_timeline = ‘latest’

The host=<master_host> specifies the primary server’s hostname or IP address, and port=<postgres_port> is the port used for PostgreSQL connections.
This configuration is typically used for setting up PostgreSQL streaming replication between a primary and standby server.

The parameters used are:

standby_mode: When this parameter is set to ON, it causes the server to start as the slave.
primary_conninfo: This parameter is used to establish a connection between the slave & the master server with the help of a connection string.
recovery_target_timeline: It is used to set the recovery time.

You need to give the IP address, username & password of the master server as values for the primary_conninfo parameter to establish a connection.

Example query:

primary_conninfo = 'host=192.168.0.26 port=5432 user=rep_user password=rep_password'

Now restart the slave server to complete the process.

This is how the slave server is configured for Postgres WAL replication.

Learn about PostgreSQL Multi-master Replication: 6 Easy Setup Steps

Integrate PostgreSQL to PostgreSQL

Get a Demo Try it

Integrate Heroku for PostgreSQL to Snowflake

Get a Demo Try it

Integrate Heroku for PostgreSQL to Databricks

Get a Demo Try it

Making Streaming Replication Synchronous

Postgres WAL replication is asynchronous by default however, it is possible to make streaming replication synchronous by using the following command:

synchronous_standby_names = ‘num_sync ( standby_name [, ...] )’

The parameters used are:

num_sync: It denotes the number of synchronous slave servers from where the transactions should wait for a reply.
standby_name: This parameter should have the same value as that of the application_name parameter in recovery.conf file.

If you want all slave servers to be considered synchronous, set the value of standby_name to “*”, otherwise you can mention specific names by separating them with a comma.

Comparative Analysis of Synchronous and Asynchronous Replication

The fundamental distinction between synchronous and asynchronous replication lies in the standby server’s response timing. This decision is critical as it influences SQL processing response times and the system’s high availability.

Synchronous Replication:

The primary server postpones the completion of a process until it receives confirmation from the standby server. Consequently, the total response time encompasses the duration of log shipping.
This method ensures immediate data synchronization with the standby server, enhancing data integrity and reliability.
It is optimal for operations requiring failover and read-only load balancing.

Asynchronous Replication (Default Setting):

The primary server finalizes a process without awaiting the standby server’s acknowledgment. As a result, the total response time is comparable to scenarios without streaming replication.
Data synchronization and updates on the standby server occur asynchronously, which may lead to a lag in reflecting the primary server’s latest data.
There is a risk of data loss in the event of a failover.
This approach is preferred for geographically distant replication, particularly for disaster recovery.

Download the Ultimate Guide on Database Replication

Learn the 3 ways to replicate databases & which one you should prefer.

Before wrapping up, let us look at the basics of Replication and WAL.

Introduction To Replication

Replication refers to copying modifications in data from the publisher database to the subscriber database. In layman’s terms, it creates a replica of one database in another. These databases are usually located on different physical servers and help distribute various database queries. This helps to form a load-balancing framework.

Replication is implemented in PostgreSQL using a master-slave configuration. The master acts as the primary instance and handles the primary database and its operations. The slave acts as the secondary instance and implements all modifications made to the primary database on itself, thus making itself an identical copy. The master is the read/write server, whereas the slave is a read-only server.

For further information on replication in PostgreSQL, you can check the documentation manual.

Introduction To WAL

WAL stands for Write-Ahead Logging. It is the standard protocol to ensure that all the changes made to the database are being logged properly in their order of occurrence. It helps maintain data integrity by facilitating a recovery process in the case of a database crash.

WAL is enabled in PostgreSQL by default and has a default size of 16MB. These files are found in pg_xlog or pg_wal (PostgreSQL 10) directory. The log files are written sequentially and hence the cost of scanning through these files is quite less. These files have a unique name in the format of :

“00000001 00000000 00000000”

One major advantage of WAL is that it supports online-backup and point-in-time recovery. It also reduces the number of disk writes as only the log file needs to be flushed to the disk to guarantee that a transaction has occurred.

For further information on WAL, you can check the documentation manual.

Learn More About:

Understanding Write Ahead Logging

Conclusion

This article teaches you how to perform Postgres WAL replication easily and answers all your queries. It provides a brief introduction of various concepts related to it & helps the users understand them better and use them to perform data replication & recovery in the most efficient way possible. This helps maintain data integrity & availability. It also introduces an alternative method: Hevo Data, a No-code Data Pipeline to help users replicate data effortlessly and highly securely without having to write any code.

FAQ

What is WAL replication?

WAL (Write-Ahead Logging) replication in PostgreSQL is a mechanism where changes to the database are first written to a log (WAL) before being applied to the actual database. This allows the changes to be replicated to standby servers, ensuring data redundancy and availability.

What is the difference between `wal_receiver_timeout` and `wal_sender_timeout`?

wal_receiver_timeout: Defines the maximum time (in milliseconds) that a WAL receiver waits for data from the WAL sender before timing out and disconnecting.
wal_sender_timeout: Specifies the maximum time (in milliseconds) that a WAL sender waits for an acknowledgment from the WAL receiver before timing out and closing the connection.

What is `wal_level` in PostgreSQL?

The wal_level setting in PostgreSQL determines the amount of information written to the WAL logs. It can be set to different levels (e.g., minimal, replica, logical), depending on whether the WAL is used for basic backups, replication, or logical decoding for use cases like logical replication.

Nicholas Samuel Technical Content Writer, Hevo Data

Nicholas Samuel is a technical writing specialist with a passion for data, having more than 14+ years of experience in the field. With his skills in data analysis, data visualization, and business intelligence, he has delivered over 200 blogs. In his early years as a systems software developer at Airtel Kenya, he developed applications, using Java, Android platform, and web applications with PHP. He also performed Oracle database backups, recovery operations, and performance tuning. Nicholas was also involved in projects that demanded in-depth knowledge of Unix system administration, specifically with HP-UX servers. Through his writing, he intends to share the hands-on experience he gained to make the lives of data practitioners better.

Postgres WAL Replication: Easy Step-By-Step Guide

Postgres WAL Replication

File-Based Log Shipping

Streaming WAL Records

Setting Up Streaming Replication In PostgreSQL

Configuring Master Node

Step 1: Database Initialization

Step 2: Creating A New User

Step 3: Configuring Streaming Properties

Step 4: Replication Entry In pg_hba.conf File

Configuring Standby Node

Step 1: Create A Backup Of The Master Node

Step 2: Creating The Replication Configuration File

Making Streaming Replication Synchronous

Comparative Analysis of Synchronous and Asynchronous Replication

Introduction To Replication

Introduction To WAL

Conclusion

FAQ

What is WAL replication?

What is the difference between wal_receiver_timeout and wal_sender_timeout?

What is wal_level in PostgreSQL?

Optimize your data integration with Hevo!

What is the difference between `wal_receiver_timeout` and `wal_sender_timeout`?

What is `wal_level` in PostgreSQL?