[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Encoding problems with check_trans.pl



On Tue,16.Dec.08, 10:15:09, Bas Zoetekouw wrote:
> Hi Andrei!
> 
> You wrote:
> 
> > > Yes, the -g option is broken (or rather Perl is, by not properly
> > > detecting your locale).  The mails that are sent out should be fine
> > > though.  Could you please check those?
> > 
> > I'm checking with version 1.88 and still the same. Any more info you 
> > need?
> 
> Curious;  I checked it here and it works fine (both over smtp, and with
> -g, which I fixed yesterday).  Could you send me a (gzipped) version of
> the mail you've received?
> 
> Is anyone else seeing these kinds of problems?

Following a hunch I got for a while [the output with -g is the same as 
running 'iconv -f iso-8859-1 -t utf-8', (note: iso-8859-1, not 
iso-8859-2)] I tried this small patch

,----[  ]
| Index: check_trans.pl
| ===================================================================
| RCS file: /cvs/webwml/webwml/check_trans.pl,v
| retrieving revision 1.88
| diff -u -u -r1.88 check_trans.pl
| --- check_trans.pl	15 Dec 2008 18:02:22 -0000	1.88
| +++ check_trans.pl	16 Dec 2008 19:49:29 -0000
| @@ -522,7 +522,7 @@
| 		# and attach the body to the mail
| 		my $part = MIME::Lite->new(
| 			'Type' => 'text/plain',
| -			'Data' => encode('utf-8',$mailbody),
| +			'Data' => $mailbody,
| 		);
| 		$part->attr( 'content-type.charset' => 'utf-8' );
| 		$msg->attach( $part );
`----

and whaddaya know, it works for me!

check_trans_term.txt.gz contains a copy-paste from the terminal after a 
-g run and check_trans_mail.txt.gz is a copy of the file from my 
Maildir. My MTA is postfix (if it matters) and below the output of 
locale:

,----[ locale ]
| LANG=ro_RO.UTF-8
| LC_CTYPE="ro_RO.UTF-8"
| LC_NUMERIC="ro_RO.UTF-8"
| LC_TIME="ro_RO.UTF-8"
| LC_COLLATE="ro_RO.UTF-8"
| LC_MONETARY="ro_RO.UTF-8"
| LC_MESSAGES="ro_RO.UTF-8"
| LC_PAPER="ro_RO.UTF-8"
| LC_NAME="ro_RO.UTF-8"
| LC_ADDRESS="ro_RO.UTF-8"
| LC_TELEPHONE="ro_RO.UTF-8"
| LC_MEASUREMENT="ro_RO.UTF-8"
| LC_IDENTIFICATION="ro_RO.UTF-8"
| LC_ALL=
`----

Regards,
Andrei
-- 
If you can't explain it simply, you don't understand it well enough.
(Albert Einstein)

Attachment: signature.asc
Description: Digital signature


Reply to: