• ksh93u+m locale issue with select statement

    From Janis Papanagnou@3:633/10 to All on Sunday, January 11, 2026 05:51:26
    Kornshell doesn't seem to handle umlauts or other non-ASCII Unicode
    characters correctly with the 'select' statement; the display shows
    (for example)

    1) abcdefghijklmnopqrstuvwxyz 15) abcdefghijklmnopqrstuvwxyz
    2) abcdefghijklmnopqrstuvwxyz 16) abcdefghijklmnopqrstuvwxyz
    3) „bcdefghijklmnopqrstuvwxyz 17) ABCDEFGHIJKLMNOPQRSTUVWXYZ
    4) „bcdefghijklmn”pqrstuvwxyz 18) ABCDEFGHIJKLMNOPQRSTUVWXYZ
    5) „bcdefghijklmn”pqrstvwxyz 19) ŽBCDEFGHIJKLMNOPQRSTUVWXYZ
    6) abcd?fghijklĉnopqrstuvwxyz 20) ŽBCDEFGHIJKLMN™PQRSTUVWXYZ
    7) abcd?fghijklmnopqrstuvwxyz 21) ŽBCDEFGHIJKLMN™PQRSTšVWXYZ
    8) „bcdefghijklmn”pqrátvwxyz 22) ABCDEFGHIJKLMNOPQRSTUVWXYZ
    9) abcdefghijklmnopqrstuvwxyz 23) ABCDEFGHIJKLMNOPQRSTUVWXYZ
    10) abcdefghijklmnopqrstuvwxyz 24) ŽBCDEFGHIJKLMN™PQR?TšVWXYZ
    11) abcdefghijklmnopqrstuvwxyz 25) ABCDEFGHIJKLMNOPQRSTUVWXYZ
    12) abcdefghijklmnopqrstuvwxyz 26) ABCDEFGHIJKLMNOPQRSTUVWXYZ
    13) abcdefghijklmnopqrstuvwxyz 27) ABCDEFGHIJKLMNOPQRSTUVWXYZ
    14) abcdefghijklmnopqrstuvwxyz

    Somehow it obviously gets confused with the count of the number of
    characters and the count of the octets in the encoding, thus the
    formatting gets corrupted.

    (Playing with other locales doesn't change that effect.)

    Observed in ksh version AJM 93u+m/1.0.8 2024-01-01.

    (Bash handles that correctly.)

    Janis


    --- PyGate Linux v1.5.2
    * Origin: Dragon's Lair, PyGate NNTP<>Fido Gate (3:633/10)