Received: from malur.postgresql.org ([217.196.149.56]) by arkaria.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.96) (envelope-from ) id 1wCtGJ-002OzZ-1E for pgsql-bugs@arkaria.postgresql.org; Wed, 15 Apr 2026 05:58:07 +0000 Received: from localhost ([127.0.0.1] helo=malur.postgresql.org) by malur.postgresql.org with esmtp (Exim 4.96) (envelope-from ) id 1wCtGH-00ErNk-1p for pgsql-bugs@arkaria.postgresql.org; Wed, 15 Apr 2026 05:58:06 +0000 Received: from magus.postgresql.org ([2a02:c0:301:0:ffff::29]) by malur.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.96) (envelope-from ) id 1wCtGH-00ErNb-0s for pgsql-bugs@lists.postgresql.org; Wed, 15 Apr 2026 05:58:06 +0000 Received: from mail-pl1-x629.google.com ([2607:f8b0:4864:20::629]) by magus.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 (Exim 4.98.2) (envelope-from ) id 1wCtGG-00000001A9O-0X3N for pgsql-bugs@lists.postgresql.org; Wed, 15 Apr 2026 05:58:05 +0000 Received: by mail-pl1-x629.google.com with SMTP id d9443c01a7336-2b24fdac394so57663615ad.3 for ; Tue, 14 Apr 2026 22:58:03 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1776232682; cv=none; d=google.com; s=arc-20240605; b=Ja8ktPYF3JrbYqjq8adoDW3vps4NnBbz0Vz2Eb6/4xfkPxNmKjI6iqc5tUVTVG7tKe OcIxF6cbo69qp9c+9QEQajJwZMZ2gLM4GKLlVAzTGP1RKlzOJYFRazNWezZMlY3EGG2u c8kp2OLN53juzCOIKuQJIAom7/W4+k+eU9wF1VVSsX0eQKhtZnIqGuMrS0rZPiIALSRG va+mqpoKYg0CKXD45JCZBVqHWmGMoAstSVO/u3R4tRpDY5a9jSifxB46moRfdx9WFPoY 7HTMzHUn+4T0agYm19kUYYZidshi04pS1oVQEXbewHrau+B5hztazljkrpKeCgf9XMJB g2lQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20240605; h=cc:to:subject:message-id:date:from:reply-to:in-reply-to:references :mime-version:dkim-signature; bh=honIBuJ0ISm0wd7xIQ5sCAN79+V/oED7H6H+Hhaee0E=; fh=dZDhK1EsXBsHcNAO8mn0AplJL6RzibeFy5o0WrIPM3s=; b=Ex4Q6jE6t3rbWka+itgHNHmPM/NPYK1vUaiVPmbaNwpVtxjAw+XIfNzDwn542jZow6 F5Wd2DErNMch+ASllYGQaRo8xG7k7oOk9hmFovyVSNZbBaV2pth8gN7k/pGaM2MjltN/ /FTLaUc0pgEwyq/cbkzLWiPxM2f55jCIYSuvWXS5Q63+hacnN8emGSu8GbDfQU/I5jIC yvcMIQxoQIEzGDWcPue7o5f+sSQV+uMyzEbqtaqQf4OqVNRziW2v6Mx3K38CoepmDYkl 7ZYOVY18+RJE4RajPmewzeZRmW7YE9iMksOgLq7n/3hJqtQmQrnqiYiv0lJwyWS4s8k2 6peg==; darn=lists.postgresql.org ARC-Authentication-Results: i=1; mx.google.com; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20251104; t=1776232682; x=1776837482; darn=lists.postgresql.org; h=cc:to:subject:message-id:date:from:reply-to:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=honIBuJ0ISm0wd7xIQ5sCAN79+V/oED7H6H+Hhaee0E=; b=B4NoJs3cz7MdlOTAt8oqlruQgLOaR0gFVNwY921bIS6dFrSiWD1je+5ua0Oh7FWe// N6m0oUoUHx2LaQekkk9esEG7ny3Lmxf1QeM8Zv+bTlKTvFm4Hz2u0dSJU4Y2nWL+1g3W g7YuNYxxC1/vdLQgQSBGBBgVDQcZkl2P8fZzNx0gFaeFYyH4mkMoEwnMsojuCuivWnrr QdLMUQ81uvKHQpBwzYiUehDdauhUyBUaGJqXlHJwa5jryeN3BYN7LDr5jwpRms/B36tG aeku+0GU99ULzRH1wQvSTkNY3XyWLVmzq2Oyn/MFic33gekxqxS7Mn+VbYvLrM6M+wTb dJeg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20251104; t=1776232682; x=1776837482; h=cc:to:subject:message-id:date:from:reply-to:in-reply-to:references :mime-version:x-gm-gg:x-gm-message-state:from:to:cc:subject:date :message-id:reply-to; bh=honIBuJ0ISm0wd7xIQ5sCAN79+V/oED7H6H+Hhaee0E=; b=jRf8obpigY0OyNAnddCUTnBcSYxqQcGwfdli5lA4Tjxazd9edkjN0fcOkg5w5rWdyI dFnLCUOGu8uXgs5ZlqTpySQ0qRAKKdUVt6oFGyxMA6QTZ/6GxyrBpcz83r9p4MSBMoQ/ 5/TQI2tGTCeXM87oZjs0FDmBkIXMEESYTjNy7+vRslq0kCcH5/2ycYol29HbrZdnsOWt 9FTnKxh+Neojva7yR4wU+qiGyItE70M9VECILei8fY+pZawwRRCV9uWECBbx/qNDD+6m gFRT2XabcKDlUmSoE2yA9vbZnAJV7hZios+d8n570/0i6bKu0Sxfclutu6kseab0MzZP TaBg== X-Forwarded-Encrypted: i=1; AFNElJ/wp2f6nwJJ0NyAQDxAGvYwridRZrktEJ1HHEh32KpKNaaSZNeRF14HMz8LQdD+k4BTM+Vw3Iur8fau@lists.postgresql.org X-Gm-Message-State: AOJu0Yxyy8Sp9qr+BjMrohy+xkHqmX+Chtc6Z6+e+WXAn97v2xLxDQks sFHpMRnN6m5LRVxJFTL7FfKLOeRNnw1os7gzLsemWDXzku4bBw3FgyefBlfZuL9KkzbP61jNIVl c4yUVdc2h3pcePRJXHz8+H3Eyt1ZhHZ8= X-Gm-Gg: AeBDieu3Epi5zT+kwJE9tnXSgCmPRZ8Jpr2eRQZPHxryZ2aofbfhC7AN37yJnwwjnEq Y3HSl1N/EeUvQjHEg6QxF2lpEhr3INJTXGZUz+k2ePuQ1YjgFBjReTU7mmXrvzh9BLno5u466yf 108zg9iiATpQIOhWKLw929LGghQXPrMqASan1cWltbvwACYiL+RLIUiYNHZh2TJFHJSxcPy3GBV cY42KM/Va8CyN3mMgfaN4pdRjGez2sIxJcDTA/G+o88fC4L7KI0ZrXcwtHa9m75zSEZMw5OOVXl 4f+TDgc00WTTnQB4WnsX41k/BUXNDqD5gMRRV14XuDkFx9tR8Q== X-Received: by 2002:a17:902:ebd1:b0:2b2:ac6f:f87 with SMTP id d9443c01a7336-2b2d5aa648bmr225882775ad.44.1776232682111; Tue, 14 Apr 2026 22:58:02 -0700 (PDT) MIME-Version: 1.0 References: <19354-eefe6d8b3e84f9f2@postgresql.org> <2292889.1765846569@sss.pgh.pa.us> <2393116.1765899706@sss.pgh.pa.us> <6a8122ac-123d-4e93-9269-0b3be1e4a5a4@iki.fi> In-Reply-To: Reply-To: assam258@gmail.com From: Henson Choi Date: Wed, 15 Apr 2026 14:57:50 +0900 X-Gm-Features: AQROBzCBwr5ic_GXDPQnB3jUHDvHlnpYTv4qv1hh8k5pNtNvngYYfhnbjoD1hSU Message-ID: Subject: Re: BUG #19354: JOHAB rejects valid byte sequences To: Thomas Munro Cc: Heikki Linnakangas , Robert Haas , Tom Lane , Jeroen Vermeulen , VASUKI M , pgsql-bugs@lists.postgresql.org Content-Type: multipart/mixed; boundary="0000000000000e63c9064f79667a" List-Id: List-Help: List-Subscribe: List-Post: List-Owner: List-Archive: Archived-At: Precedence: bulk --0000000000000e63c9064f79667a Content-Type: multipart/alternative; boundary="0000000000000e63c8064f796678" --0000000000000e63c8064f796678 Content-Type: text/plain; charset="UTF-8" Subject: Fix and expand comments for Korean encodings in encnames.c Hi hackers, While reading through the encoding alias table in src/common/encnames.c, I noticed a few long-standing inaccuracies and omissions in the per-entry comments for the three Korean encodings. The most visible issue is the JOHAB entry, whose comment describes it as "Extended Unix Code for simplified Chinese" -- apparently a copy/paste slip from a neighboring EUC entry. JOHAB is in fact the Korean combining-style encoding defined in KS X 1001 annex 3. The attached 0002 patch makes comment-only adjustments to the three Korean encodings: * JOHAB: replace the incorrect "simplified Chinese" description with a correct one that identifies it as the Korean combining (Johab) encoding standardized in KS X 1001 annex 3. * EUC_KR: drop a stray space before the comma in the existing comment, and note that the encoding covers the KS X 1001 precomposed (Wansung) form. * UHC: spell out "Unified Hangul Code", clarify that it is Microsoft Windows CodePage 949, and describe its relationship to EUC-KR (superset covering all 11,172 precomposed Hangul syllables). No behavior change, no catalog change, no pg_wchar.h change -- this touches comments in src/common/encnames.c only. pgindent is clean. Thanks, Henson Choi --0000000000000e63c8064f796678 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
Subject: Fix and expand comments for Korean encodings in e= ncnames.c

Hi hackers,

While reading through the encoding alia= s table in src/common/encnames.c,
I noticed a few long-standing inaccura= cies and omissions in the per-entry
comments for the three Korean encodi= ngs.

The most visible issue is the JOHAB entry, whose comment descri= bes it as
"Extended Unix Code for simplified Chinese" -- appar= ently a copy/paste
slip from a neighboring EUC entry.=C2=A0 JOHAB is in = fact the Korean
combining-style encoding defined in KS X 1001 annex 3.
The attached 0002 patch makes comment-only adjustments to the threeKorean encodings:

=C2=A0 * JOHAB: replace the incorrect "simp= lified Chinese" description with
=C2=A0 =C2=A0 a correct one that i= dentifies it as the Korean combining (Johab)
=C2=A0 =C2=A0 encoding stan= dardized in KS X 1001 annex 3.

=C2=A0 * EUC_KR: drop a stray space b= efore the comma in the existing
=C2=A0 =C2=A0 comment, and note that the= encoding covers the KS X 1001
=C2=A0 =C2=A0 precomposed (Wansung) form.=

=C2=A0 * UHC: spell out "Unified Hangul Code", clarify th= at it is
=C2=A0 =C2=A0 Microsoft Windows CodePage 949, and describe its = relationship to
=C2=A0 =C2=A0 EUC-KR (superset covering all 11,172 preco= mposed Hangul syllables).

No behavior change, no catalog change, no = pg_wchar.h change -- this
touches comments in src/common/encnames.c only= . =C2=A0pgindent is clean.

Thanks,
Henson Choi
--0000000000000e63c8064f796678-- --0000000000000e63c9064f79667a Content-Type: text/plain; charset="US-ASCII"; name="0002-Fix-and-expand-comments-for-Korean-encodings.txt" Content-Disposition: attachment; filename="0002-Fix-and-expand-comments-for-Korean-encodings.txt" Content-Transfer-Encoding: base64 Content-ID: X-Attachment-Id: f_mnzn1x1e0 RnJvbSBjN2E3MzM1ZDJjZjVhMjg4MWIyNWQ5MDkxZmQwMjBhMmQ2MmY3NjYxIE1vbiBTZXAgMTcg MDA6MDA6MDAgMjAwMQpGcm9tOiBIZW5zb24gQ2hvaSA8YXNzYW0yNThAZ21haWwuY29tPgpEYXRl OiBXZWQsIDE1IEFwciAyMDI2IDE0OjUyOjM1ICswOTAwClN1YmplY3Q6IFtQQVRDSCB2MV0gRml4 IGFuZCBleHBhbmQgY29tbWVudHMgZm9yIEtvcmVhbiBlbmNvZGluZ3MgaW4gZW5jbmFtZXMuYwoK LS0tCiBzcmMvY29tbW9uL2VuY25hbWVzLmMgfCAxMyArKysrKysrKy0tLS0tCiAxIGZpbGUgY2hh bmdlZCwgOCBpbnNlcnRpb25zKCspLCA1IGRlbGV0aW9ucygtKQoKZGlmZiAtLWdpdCBhL3NyYy9j b21tb24vZW5jbmFtZXMuYyBiL3NyYy9jb21tb24vZW5jbmFtZXMuYwppbmRleCA5MDg1ZGJlY2Nl MS4uOTU5Yjk5MWRkZTQgMTAwNjQ0Ci0tLSBhL3NyYy9jb21tb24vZW5jbmFtZXMuYworKysgYi9z cmMvY29tbW9uL2VuY25hbWVzLmMKQEAgLTYxLDggKzYxLDkgQEAgc3RhdGljIGNvbnN0IHBnX2Vu Y25hbWUgcGdfZW5jbmFtZV90YmxbXSA9CiAJCQkJCQkJCSAqIEphcGFuZXNlLCBzdGFuZGFyZCBP U0YgKi8KIAl7CiAJCSJldWNrciIsIFBHX0VVQ19LUgotCX0sCQkJCQkJCS8qIEVVQy1LUjsgRXh0 ZW5kZWQgVW5peCBDb2RlIGZvciBLb3JlYW4gLCBLUwotCQkJCQkJCQkgKiBYIDEwMDEgc3RhbmRh cmQgKi8KKwl9LAkJCQkJCQkvKiBFVUMtS1I7IEV4dGVuZGVkIFVuaXggQ29kZSBmb3IgS29yZWFu CisJCQkJCQkJCSAqIHByZWNvbXBvc2VkIChXYW5zdW5nKSBlbmNvZGluZywgc3RhbmRhcmQgS1MK KwkJCQkJCQkJICogWCAxMDAxICovCiAJewogCQkiZXVjdHciLCBQR19FVUNfVFcKIAl9LAkJCQkJ CQkvKiBFVUMtVFc7IEV4dGVuZGVkIFVuaXggQ29kZSBmb3IKQEAgLTExOSw4ICsxMjAsOCBAQCBz dGF0aWMgY29uc3QgcGdfZW5jbmFtZSBwZ19lbmNuYW1lX3RibFtdID0KIAl9LAkJCQkJCQkvKiBJ U08tODg1OS05OyBSRkMxMzQ1LEtYUzIgKi8KIAl7CiAJCSJqb2hhYiIsIFBHX0pPSEFCCi0JfSwJ CQkJCQkJLyogSk9IQUI7IEV4dGVuZGVkIFVuaXggQ29kZSBmb3Igc2ltcGxpZmllZAotCQkJCQkJ CQkgKiBDaGluZXNlICovCisJfSwJCQkJCQkJLyogSk9IQUI7IEtvcmVhbiBjb21iaW5pbmcgKEpv aGFiKSBlbmNvZGluZywKKwkJCQkJCQkJICogc3RhbmRhcmQgS1MgWCAxMDAxIGFubmV4IDMgKi8K IAl7CiAJCSJrb2k4IiwgUEdfS09JOFIKIAl9LAkJCQkJCQkvKiBfZGlydHlfIGFsaWFzIGZvciBL T0k4LVIgKGJhY2t3YXJkCkBAIC0xODYsNyArMTg3LDkgQEAgc3RhdGljIGNvbnN0IHBnX2VuY25h bWUgcGdfZW5jbmFtZV90YmxbXSA9CiAJfSwJCQkJCQkJLyogYWxpYXMgZm9yIFdJTjEyNTggKi8K IAl7CiAJCSJ1aGMiLCBQR19VSEMKLQl9LAkJCQkJCQkvKiBVSEM7IEtvcmVhbiBXaW5kb3dzIENv ZGVQYWdlIDk0OSAqLworCX0sCQkJCQkJCS8qIFVIQzsgVW5pZmllZCBIYW5ndWwgQ29kZSwgTWlj cm9zb2Z0IFdpbmRvd3MKKwkJCQkJCQkJICogQ29kZVBhZ2UgOTQ5OyBzdXBlcnNldCBvZiBFVUMt S1IgY292ZXJpbmcKKwkJCQkJCQkJICogYWxsIDExLDE3MiBwcmVjb21wb3NlZCBIYW5ndWwgc3ls bGFibGVzICovCiAJewogCQkidW5pY29kZSIsIFBHX1VURjgKIAl9LAkJCQkJCQkvKiBhbGlhcyBm b3IgVVRGOCAqLwotLSAKMi41MC4xIChBcHBsZSBHaXQtMTU1KQoK --0000000000000e63c9064f79667a--