[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#211889: marked as done (apache: Default Content-encoding of iso-8859-1 is a hindrance)



Your message dated Thu, 22 Apr 2004 19:48:42 +0200 (CEST)
with message-id <Pine.LNX.4.58.0404221928450.6402@trider-g7.ext.fabbione.net>
and subject line Default charset is configurable
has caused the attached Bug report to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what I am
talking about this indicates a serious mail system misconfiguration
somewhere.  Please contact me immediately.)

Debian bug tracking system administrator
(administrator, Debian Bugs database)

--------------------------------------
Received: (at submit) by bugs.debian.org; 21 Sep 2003 01:44:31 +0000
>From moof@moof.org.uk Sat Sep 20 20:44:29 2003
Return-path: <moof@moof.org.uk>
Received: from 33.red-80-33-53.pooles.rima-tde.net (perrin.pin.moof.org.uk) [80.33.53.33] 
	by master.debian.org with esmtp (Exim 3.35 1 (Debian))
	id 1A0sFq-0006Wp-00; Sat, 20 Sep 2003 19:39:10 -0500
Received: from moof by perrin.pin.moof.org.uk with local (Exim 3.36 #1 (Debian))
	id 1A0sFn-0001Zu-00; Sun, 21 Sep 2003 02:39:07 +0200
MIME-Version: 1.0
Content-Transfer-Encoding: 7bit
Content-Type: text/plain; charset="ISO-8859-15"
From: Giles Antonio Radford <moof@moof.org.uk>
To: Debian Bug Tracking System <submit@bugs.debian.org>
Subject: apache: Default Content-encoding of iso-8859-1 is a hindrance
X-Mailer: reportbug 2.20
Date: Sun, 21 Sep 2003 02:38:57 +0200
Message-Id: <E1A0sFn-0001Zu-00@perrin.pin.moof.org.uk>
Delivered-To: submit@bugs.debian.org
X-Spam-Status: No, hits=-1.0 required=4.0
	tests=BAYES_60,HAS_PACKAGE,HTML_MESSAGE
	version=2.53-bugs.debian.org_2003_9_20
X-Spam-Level: 
X-Spam-Checker-Version: SpamAssassin 2.53-bugs.debian.org_2003_9_20 (1.174.2.15-2003-03-30-exp)

Package: apache
Version: 1.3.27.1-3
Severity: normal


Since 1.3.12, The apache httpd.conf file has had the following lines in it:
(line 821 of my file, though these things change)
---------
# Default charset to iso-8859-1 (http://www.apache.org/info/css-security/).
    
AddDefaultCharset on
---------

This is very annoying, as a lot of web browsers incorrectly assume the
server knows what it's talking about, which is not always the case, and so
my nice UTF-8 HTML documents (which all declare their encoding in a <meta>
tag) turn out mangled because my web browsers (in this case mozilla 1.5,
firebird 0.6, and Internet explorer 6) go and display it as ISO-8859-1.

I suppose the first thing to mention is that iso-8859-1 is obsolete,
replaced by iso-8859-15, which includes the Euro Character instead fo the
less often used Generic Currency Sign at 0xA4. At the very least the default
character set should be changed to that, as it's more commonly in use these
days.

At the same time, it leave users who do not use ISO-8859-15 in the lurch, or
who have several different encodings on the hard drive. mod_cern_meta can
take care of the latter lot fairly easily, but I'd like to suggest the
following instead:

Add a debconf option at installation time to choose between the various
encodings which one the default should be, or to turn this behaviour off.

Alternatively, set it at installation time fromt he user's locale settings.
It may not be correct all of the time, but it'll probably be more correct
most of the time.

Thanks,
Moof - who spent four hours tonight tryign to work out why his UTF-8
documents weren't displaying properly.
-- System Information:
Debian Release: testing/unstable
Architecture: i386
Kernel: Linux perrin 2.4.21-ac4 #1 SMP Sat Jul 26 01:18:02 CEST 2003 i686
Locale: LANG=en_IE@euro, LC_CTYPE=en_IE@euro

Versions of packages apache depends on:
ii  apache-common                 1.3.27.1-3 Support files for all Apache webse
ii  debconf                       1.3.4      Debian configuration management sy
ii  dpkg                          1.10.10    Package maintenance system for Deb
ii  libc6                         2.3.2-3    GNU C Library: Shared libraries an
ii  libdb4.1                      4.1.25-4   Berkeley v4.1 Database Libraries [
ii  libexpat1                     1.95.6-4   XML parsing C library - runtime li
ii  libmagic1                     4.02-4     File type determination library us
ii  logrotate                     3.6.5-2    Log rotation utility
ii  mime-support                  3.23-1     MIME files 'mime.types' & 'mailcap
ii  perl [perl5]                  5.8.0-18   Larry Wall's Practical Extraction 

-- debconf information:
  apache/enable-suexec: false


---------------------------------------
Received: (at 211889-done) by bugs.debian.org; 22 Apr 2004 17:48:45 +0000
>From fabbione@fabbione.net Thu Apr 22 10:48:45 2004
Return-path: <fabbione@fabbione.net>
Received: from port5.ds1-sby.adsl.cybercity.dk (trider-g7.fabbione.net) [212.242.169.198] 
	by spohr.debian.org with esmtp (Exim 3.35 1 (Debian))
	id 1BGiJZ-0006VC-00; Thu, 22 Apr 2004 10:48:45 -0700
Received: from trider-g7.ext.fabbione.net (port5.ds1-sby.adsl.cybercity.dk [212.242.169.198])
	by trider-g7.fabbione.net (Postfix) with ESMTP id 53F1516
	for <211889-done@bugs.debian.org>; Thu, 22 Apr 2004 19:48:42 +0200 (CEST)
Date: Thu, 22 Apr 2004 19:48:42 +0200 (CEST)
From: Fabio Massimo Di Nitto <fabbione@fabbione.net>
Sender: fabbione@fabbione.net
To: 211889-done@bugs.debian.org
Subject: Default charset is configurable
Message-ID: <Pine.LNX.4.58.0404221928450.6402@trider-g7.ext.fabbione.net>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
Delivered-To: 211889-done@bugs.debian.org
X-Spam-Checker-Version: SpamAssassin 2.60-bugs.debian.org_2004_03_25 
	(1.212-2003-09-23-exp) on spohr.debian.org
X-Spam-Status: No, hits=-2.0 required=4.0 tests=BAYES_00 autolearn=no 
	version=2.60-bugs.debian.org_2004_03_25
X-Spam-Level: 
X-CrossAssassin-Score: 1


Hi,
	please check the documentation for AddDefaultCharset here:

http://httpd.apache.org/docs/mod/core.html#adddefaultcharset

The relevant part, that i did not notice myself, when reviewing the
default configuration, is that it now supports not only on|off but also a
default charset. For eg:

AddDefaultCharset utf-8

so you have to set that yourself.

Since our policies have been changed and we do not mangle the
configuration files anymore, the default Charset for error pages
is iso-8859-1 and since according to:

http://www.apache.org/info/css-security/

there might be security implication, apache packages will not change
default settings for standard installation.

Users are free to change it according to their requirements at any time.

Thanks
Fabio

-- 
<user> fajita: step one
<fajita> Whatever the problem, step one is always to look in the error log.
<user> fajita: step two
<fajita> When in danger or in doubt, step two is to scream and shout.



Reply to: