[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#872778: xterm -lc (with UTF-8 locale) cannot properly copy some utf-8 unicode chars



On Tue, Sep 18, 2018 at 04:37:00AM +0800, 積丹尼 Dan Jacobson wrote:
> All I know is
> U+7522 CJK UNIFIED IDEOGRAPH-7522
> U+9109 CJK UNIFIED IDEOGRAPH-9109
> show up as empty squares.

Agreed, but that wasn't the point of this particular bug report.

I just created a test-script to print the 5 codepoints mentioned to a
text-file, and use xterm to copy and paste the result.  As I pointed
out "select/paste should work".  It worked for me - there may be some
locale dependency or resource-setting which is creating the problem
you reported.

(script attached)

-- 
Thomas E. Dickey <dickey@invisible-island.net>
https://invisible-island.net
ftp://ftp.invisible-island.net
#!/usr/bin/env perl

use strict;
use warnings;

use Encode 'encode_utf8';
binmode( STDOUT, ":utf8" );

sub show($$) {
	my $code = shift;
	my $name = shift;
	my $show = sprintf("U+%04X", $code);
	printf "%8s \"%c%c%c\" %s\n", $show, $code, $code, $code, $name;
}

&show(0x1f618, "FACE THROWING A KISS");
&show(0x7522, "CJK UNIFIED IDEOGRAPH-7522");
&show(0x9109, "CJK UNIFIED IDEOGRAPH-9109");
&show(0x0192, "LATIN SMALL LETTER F WITH HOOK");
&show(0x266E, "MUSIC NATURAL SIGN");


1;

Attachment: signature.asc
Description: Digital signature


Reply to: