Regarding the archive_command in "continuois archiving" chapter.

public inbox for pgsql-docs@postgresql.org  
help / color / mirror / Atom feed

Regarding the archive_command in "continuois archiving" chapter.
2+ messages / 2 participants
[nested] [flat]

* Regarding the archive_command in "continuois archiving" chapter.
@ 2026-05-05 14:37 PG Doc comments form <noreply@postgresql.org>
  2026-05-05 18:33 ` Re: Regarding the archive_command in "continuois archiving" chapter. David G. Johnston <david.g.johnston@gmail.com>
  0 siblings, 1 reply; 2+ messages in thread

From: PG Doc comments form @ 2026-05-05 14:37 UTC (permalink / raw)
  To: pgsql-docs@lists.postgresql.org; +Cc: dario_d_s@unitech.com.ar

The following documentation comment has been logged on the website:

Page: https://www.postgresql.org/docs/17/continuous-archiving.html
Description:

The example provided for archive_command for
https://www.postgresql.org/docs/18/continuous-archiving.html#BACKUP-ARCHIVING-WAL
should include commands to ensure that archiving has successfully written
the disk, even during an unexpected shutdown. Currently, PostgreSQL does not
force a write to the file system for archived WAL files as it does for data
files.

During periods of high write activity, PostgreSQL might lose files that were
confirmed as written but remained in the operating system cache. This loss
affects Point-in-Time Recovery (PITR) and the use of pg_rewind. While many
servers utilize power protection such as UPS or battery-backed RAID
controllers, the OS cache remains unprotected during a forced shutdown
without a sync call. This issue does not affect crash recovery or streaming
replication, but it impacts PITR and pg_rewind.

I recommend improving the documentation examples or including a visible
warning. The cp command in Linux should be treated as a stub; it should not
be used without an explicit sync call or by ensuring the destination
directory is mounted in an specific file system with the sync option in
/etc/fstab.

It could also include references to usage of other tools (rsync --fsync,
remote archiving or barman) or the options needed to use it with a nfs
mounted directory, but I might be too open.

This what would look like (I've used ai to help me that this gets written in
proper english)

-----
25.3.1. Setting Up WAL Archiving
In an abstract sense, a running PostgreSQL system produces an indefinitely
long sequence of WAL records. The system physically divides this sequence
into WAL segment files, which are normally 16MB apiece (although the segment
size can be altered during initdb). The segment files are given numeric
names that reflect their position in the abstract WAL sequence. When not
using WAL archiving, the system normally creates just a few segment files
and then “recycles” them by renaming no-longer-needed segment files to
higher segment numbers. It is assumed that segment files whose contents
precede the last checkpoint are no longer of interest and can be recycled.

When archiving WAL data, we need to capture the contents of each segment
file once it is filled and save that data somewhere before the segment file
is recycled for reuse. Depending on the application and the available
hardware, there could be many different ways of “saving the data somewhere”:
we could copy the segment files to an NFS-mounted directory on another
machine, write them onto a tape drive, or batch them together and burn them
onto CDs. To provide the database administrator with flexibility, PostgreSQL
does not make assumptions about how the archiving will be done. Instead,
PostgreSQL lets the administrator specify a shell command or an archive
library to be executed to copy a completed segment file to its destination.
This could be a shell command using cp, or it could invoke a complex C
function.

To enable WAL archiving, set the wal_level configuration parameter to
replica or higher, archive_mode to on, and specify the shell command to use
in the archive_command configuration parameter.

In archive_command, %p is replaced by the path name of the file to archive,
while %f is replaced by only the file name. The simplest useful command is
something like:

Plaintext

archive_command = 'test ! -f /mnt/server/archivedir/%f && cp %p
/mnt/server/archivedir/%f'  # Unix
archive_command = 'copy "%p" "C:\\server\\archivedir\\%f"'  # Windows

IMPORTANT: The simple cp or copy commands return success as soon as the data
is written to the operating system cache. However, PostgreSQL does not
automatically force these archived files to be flushed to physical storage.
In the event of an unexpected system shutdown or power failure, files
confirmed as archived might be lost from the OS cache, leading to data loss
in Point-in-Time Recovery (PITR) or issues with pg_rewind.

To ensure data persistence, the command should include a step to synchronize
the file to disk. On Unix/Linux systems, a more robust command is:

Plaintext

archive_command = 'test ! -f /mnt/server/archivedir/%f && cp %p
/mnt/server/archivedir/%f && sync /mnt/server/archivedir/%f'

The sync command must operate at the file level, and sync version should be
validated to not write the entire cache. Alternatively, the destination file
system can be mounted with the sync option in the system configuration
(e.g., /etc/fstab), though this may impact overall performance.
Administrators must validate the durability and behavior of the chosen
archival method on their specific operating system and storage architecture,
as file system synchronization guarantees vary significantly across
platforms.

After the %p and %f parameters have been replaced, the actual command
executed might look like this:

test ! -f /mnt/server/archivedir/00000001000000A900000065 && cp
pg_wal/00000001000000A900000065
/mnt/server/archivedir/00000001000000A900000065 && sync
/mnt/server/archivedir/00000001000000A900000065

A similar command will be generated for each new file to be archived.

^ permalink  raw  reply  [nested|flat] 2+ messages in thread

* Re: Regarding the archive_command in "continuois archiving" chapter.
  2026-05-05 14:37 Regarding the archive_command in "continuois archiving" chapter. PG Doc comments form <noreply@postgresql.org>
@ 2026-05-05 18:33 ` David G. Johnston <david.g.johnston@gmail.com>
  0 siblings, 0 replies; 2+ messages in thread

From: David G. Johnston @ 2026-05-05 18:33 UTC (permalink / raw)
  To: dario_d_s@unitech.com.ar; pgsql-docs@lists.postgresql.org

On Tue, May 5, 2026 at 11:19 AM PG Doc comments form <noreply@postgresql.org>
wrote:

> The cp command in Linux should be treated as a stub;

The documentation says as much:

 (This is an example, not a recommendation, and might not work on all
platforms.)

> This what would look like (I've used ai to help me that this gets written
> in
> proper english)
>

Getting it to update the sgml and produce a real diff file would be
considerably more helpful.

I do think the point here is to be so simple as to be obviously incomplete
and thus encourage the reader to find a proven backup system to plug into
here instead of writing their own script.

PostgreSQL consistently tries to avoid being inherently too platform and
process dependent and this is one of the key places where the community,
interested in their platforms and operating systems, are expected to
largely step in.

That all said, patching this part of the documentation isn't off the table.
(Though given prior comments like this exist and it hasn't been updated
suggests a low priority if nothing else - especially without a concrete
sgml patch.)

David J.

^ permalink  raw  reply  [nested|flat] 2+ messages in thread

end of thread, other threads:[~2026-05-05 18:33 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz follow: Atom feed)
-- links below jump to the message on this page --
2026-05-05 14:37 Regarding the archive_command in "continuois archiving" chapter. PG Doc comments form <noreply@postgresql.org>
2026-05-05 18:33 ` David G. Johnston <david.g.johnston@gmail.com>

This inbox is served by agora; see mirroring instructions
for how to clone and mirror all data and code used for this inbox