Received: from malur.postgresql.org ([217.196.149.56]) by arkaria.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.96) (envelope-from ) id 1wK0BV-000U6p-2p for pgsql-general@arkaria.postgresql.org; Mon, 04 May 2026 20:46:34 +0000 Received: from localhost ([127.0.0.1] helo=malur.postgresql.org) by malur.postgresql.org with esmtp (Exim 4.96) (envelope-from ) id 1wK0BU-008uSB-2l for pgsql-general@arkaria.postgresql.org; Mon, 04 May 2026 20:46:32 +0000 Received: from magus.postgresql.org ([2a02:c0:301:0:ffff::29]) by malur.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.96) (envelope-from ) id 1wK0AL-008r7A-0r for pgsql-general@lists.postgresql.org; Mon, 04 May 2026 20:45:21 +0000 Received: from mail-wr1-x434.google.com ([2a00:1450:4864:20::434]) by magus.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 (Exim 4.98.2) (envelope-from ) id 1wK0AJ-00000000Vg8-1Com for pgsql-general@lists.postgresql.org; Mon, 04 May 2026 20:45:21 +0000 Received: by mail-wr1-x434.google.com with SMTP id ffacd0b85a97d-44da2de25f3so990833f8f.1 for ; Mon, 04 May 2026 13:45:19 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cybertec.at; s=google; t=1777927518; x=1778532318; darn=lists.postgresql.org; h=mime-version:user-agent:content-transfer-encoding:references :in-reply-to:date:cc:to:from:subject:message-id:from:to:cc:subject :date:message-id:reply-to; bh=7GroRxO3NeR0GlrZJ6JqQJ9IBWj5GKjQbqgPhq7dvkE=; b=dQSR+u2dBQsiQpiNyp+sEjuSKw5wKYmXQ4Go5PqZ4t9IbuPC59UsZKT9tbrcmLV4uF AyWZ1NK3Oiul7s33407RYRtkZbfRbt3OOk28X/snvVRf5uXTqiTtmtZl0hscCzF5YScj IK+sRPJgvzfnAsEr6yC5oHHATO0AWdjHs+YJkpFzbXoerm3vFcCIf4WHLqGnqTCHsRiU kDpj5YF5RpgQ01Bs++fGQ9nQpWoKJlP4nXuZ/mPHUCthxsO1PxBTAV9Qe+7i5s5ZJC8j k/Zq/EpBPCVl9952HUCRo8Sw7a2HQc4aF8TLoDneMAugqCgfM+5VOZDfZ+4Ss7zbMK3Y zLiQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20251104; t=1777927518; x=1778532318; h=mime-version:user-agent:content-transfer-encoding:references :in-reply-to:date:cc:to:from:subject:message-id:x-gm-gg :x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=7GroRxO3NeR0GlrZJ6JqQJ9IBWj5GKjQbqgPhq7dvkE=; b=ju3n3EkTSC79YIimzl7QioTnmAPVA8UoLB7bVah/2pYNmjxN52d2psnf7JqC3gVuR7 w3jphVsOi6Q2Slg2owMN8KQij0PyZuRawyByNp8xKcXlGiYo+LTttBAmrwCy/YDiCGQd DHy6yzbdUI0w8KfGIPpXSSbB6V5r5fsUNZMPgpGHdKHGnzeHxfqq20wAiyajFxGf/cXy dceigbGIZRlAi/6i1DSv3M2td4ZdK8T1HDNUY+Kb3F49gJZehbF7xR/9fp7v9v0VBDsX T+V+mzBX9y0CxXeGUiIHiZisqqTXsVRLvYYSCzQ0WkXnj6rKXaM5atLdqeHCurG1sg/T DY4A== X-Forwarded-Encrypted: i=1; AFNElJ8rbXUWRH2nosaVnWKPRCOd4lo/EFwb2F8TgT1BOOI4AtyWut16CjL81n8mEuvhHjUXfkIMIh4rSjph8E/f@lists.postgresql.org X-Gm-Message-State: AOJu0Yx4QrQWiKv5dCSvBYz9LxzWT8RqgxsOAN6gxUab9WKoncTaGNT/ zevPKcR7PMl5mVMYDU43bU+EgDZ5AROcgnUFlmQrcqJH/wpcWpiTFi2D6ENDJ4R+00w= X-Gm-Gg: AeBDiev/CG6AdlaTiicio2dDzMqCFo+o1VGKDnROjenybJXqJY5BuXadXUyZ/tAaL19 qxeFHFkwsetk1+KBDcB45wpCEOaZ5P1GXsY6Klj9igruKm4mcCuALHOxzagPI2APzt6zN9M8I+n tkKrEXaELAxi7YA5NBlkORVE7LJCqkvc0Pj6B8jVASRN3RxUKfGMMwW7hUhJtDXDIyNgGvHFSu6 OPljxHJ/3FNspYSVy46R3irIWZiW7VHaWCo6b+WKAroGz9QkN2l7EhTc93ttuW8IsMC7LZasMN2 qrJ1YB9/ODfY+mqGCRjil7iu/yxhcshoVu5gnk/MWrQVyuBzyQx6WuLwnD5UU/Pe0vRS3LppmjI /f0d2Xp1nZke0c8vsUGD8ERcy1k1xhLg9I33W7w2kNUPZX/jVL9e7miPotl/ltDjrkhxgNBY7Vo xH9bq4OyQkzYND7lb92NYQ0gDdLI//iA4aKGM+2uA/6JZm82RTyqdfIx6Q6aob5/0Ku+JCObFRC Ls= X-Received: by 2002:a05:6000:430a:b0:43d:613:4036 with SMTP id ffacd0b85a97d-44bb62fbc2dmr18170987f8f.37.1777927518063; Mon, 04 May 2026 13:45:18 -0700 (PDT) Received: from laurenz.albe-K4N0CV00F97414D ([2001:871:260:1fd7:eaff:6acc:1732:cc8e]) by smtp.gmail.com with ESMTPSA id ffacd0b85a97d-4501035b3b7sm460263f8f.34.2026.05.04.13.45.17 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 04 May 2026 13:45:17 -0700 (PDT) Message-ID: Subject: Re: Choosing default collation/ctype From: Laurenz Albe To: Daniel Verite Cc: Ron Johnson , pgsql-general@lists.postgresql.org Date: Mon, 04 May 2026 22:45:16 +0200 In-Reply-To: <627add7e-94df-49ca-aa12-ae3900b7945f@manitou-mail.org> References: <627add7e-94df-49ca-aa12-ae3900b7945f@manitou-mail.org> Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable User-Agent: Evolution 3.58.3 (3.58.3-1.fc43) MIME-Version: 1.0 List-Id: List-Help: List-Subscribe: List-Post: List-Owner: List-Archive: Archived-At: Precedence: bulk On Mon, 2026-05-04 at 21:34 +0200, Daniel Verite wrote: > Laurenz Albe wrote: >=20 > > > Then choose UTF8. > >=20 > > Right!=C2=A0 And I recommend "C" for the collation. >=20 > Yet the "C" collation is unsuitable for handling character types > beyond ASCII. > For instance, it considers that accented letters are not letters, > so upper('=C3=A9t=C3=A9') is '=C3=A9T=C3=A9' instead of '=C3=89T=C3=89', = and '=C3=A9' ~ '\w' is false. >=20 > C.UTF-8 solves that, and since Postgres 17, it's available for all operat= ing > systems with the builtin provider. > So if you target Postgres 17+, C.UTF-8 from the builtin provider is > a better choice for UTF-8 databases than "C" . Yes, "builtin" and the "C" collation is the best default value. Yours, Laurenz Albe