[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#602573: apt: Acquire::Languages ignored (in some locales)



On Sat, Nov 6, 2010 at 13:53, Jakub Wilk <jwilk@debian.org> wrote:
> * David Kalnischkies <kalnischkies+debian@gmail.com>, 2010-11-06, 13:00:
>>>
>>> LC_MESSAGES="POSIX"
>>
>> to clarify that a bit:
>> This setting is exposed to APT as LC_MESSAGES="C" and the
>> manpage apt.conf says: "If LC_MESSAGES is set to "C" only the
>> Translation-en file (if available) will be used."
>>
>>> # apt-config dump | grep Acquire::Languages
>>> Acquire::Languages "none";
>>
>> The next sentence is "To force apt to use no Translation file use
>> the setting Acquire::Languages=none."
>> So yes, your use case should work - its just not handled properly
>> as the special C-mode is handled before the configuration is
>> considered which is okay for all, but the "none" case.
>
> I'd say it's wrong (or at least counter intuive) for other cases as well.
> Consider this example:
>
> # rm -f /var/lib/apt/lists/*-[a-z][a-z]
>
> # LC_ALL=C apt-get update -o Acquire::Languages=pl | grep Translation
> Ign http://ftp.pl.debian.org/debian/ unstable/contrib Translation-en
> Ign http://ftp.pl.debian.org/debian/ unstable/main Translation-en
> Ign http://ftp.pl.debian.org/debian/ unstable/non-free Translation-en
> Ign http://ftp.pl.debian.org/debian/ experimental/contrib Translation-en
> Ign http://ftp.pl.debian.org/debian/ experimental/main Translation-en
> Ign http://ftp.pl.debian.org/debian/ experimental/non-free Translation-en

I personally interpret LC_ALL=C as an "exotic" calling-other-applications
setting and not a "the normal" case and Acquire::Languages=?? is
for me in this regard more a debugging thing, as this setting group is
mostly intended for multi-language systems which by definition need
more than one language - and set it up one time in the config file and
forget in future about it…

> # rm -f /var/lib/apt/lists/*-[a-z][a-z]
>
> # LC_ALL=de_DE.utf8 apt-get update -o Acquire::Languages=pl | grep
> Translation
> Ign http://ftp.pl.debian.org/debian/ unstable/contrib Translation-pl
> Hole:1 http://ftp.pl.debian.org/debian/ unstable/main Translation-pl [306
> kB]
> Ign http://ftp.pl.debian.org/debian/ unstable/non-free Translation-pl
> Ign http://ftp.pl.debian.org/debian/ experimental/contrib Translation-pl
> Ign http://ftp.pl.debian.org/debian/ experimental/main Translation-pl
> Ign http://ftp.pl.debian.org/debian/ experimental/non-free Translation-pl
>
> So the same command downloads English translation in POSIX locale, and
> Polish translation in German locale. Now, that doesn't make sense to me.


It makes more sense if you know that Translation-en will include in future
the long description which is now in the Packages file directly.
So we need to get the en file as e.g. "LANG=C apt-cache show apt" needs
pre and post this split out the same output.

But okay, lets move the complete Acquire::Language= section above
the LANG= exit as now that you say it, it could really have some
benefits for scripts…


>> Note through that you can see downloads of Translation files even if
>> you have LANG=C and none set correctly (and APT interpreting it
>> correctly): The reason is that APT will download Translation files for
>> languages it has previously downloaded
>
> Yeah, I noticed that while trying to understand what's going on here.  It's
> not something I'm happy about either...

As said, not optimal, but better than the alternatives…


Best regards

David Kalnischkies
=== modified file 'apt-pkg/aptconfiguration.cc'
--- apt-pkg/aptconfiguration.cc	2010-11-06 11:50:49 +0000
+++ apt-pkg/aptconfiguration.cc	2010-11-09 13:18:19 +0000
@@ -166,18 +166,6 @@
 	string const envShort = envLong.substr(0,lenShort);
 	bool envLongIncluded = true;
 
-	// first cornercase: LANG=C, so we use only "en" Translation
-	if (envLong == "C") {
-		if (_config->Find("Acquire::Languages","") != "none")
-			codes.push_back("en");
-		allCodes = codes;
-		allCodes.insert(allCodes.end(), builtin.begin(), builtin.end());
-		if (All == true)
-			return allCodes;
-		else
-			return codes;
-	}
-
 	// to save the servers from unneeded queries, we only try also long codes
 	// for languages it is realistic to have a long code translation file…
 	// TODO: Improve translation acquire system to drop them dynamic
@@ -218,37 +206,41 @@
 	// It is very likely we will need to environment codes later,
 	// so let us generate them now from LC_MESSAGES and LANGUAGE
 	std::vector<string> environment;
-	// take care of LC_MESSAGES
-	if (envLongIncluded == false)
-		environment.push_back(envLong);
-	environment.push_back(envShort);
-	// take care of LANGUAGE
-	const char *language_env = getenv("LANGUAGE") == 0 ? "" : getenv("LANGUAGE");
-	string envLang = Locale == 0 ? language_env : *(Locale+1);
-	if (envLang.empty() == false) {
-		std::vector<string> env = VectorizeString(envLang,':');
-		short addedLangs = 0; // add a maximum of 3 fallbacks from the environment
-		for (std::vector<string>::const_iterator e = env.begin();
-		     e != env.end() && addedLangs < 3; ++e) {
-			if (unlikely(e->empty() == true) || *e == "en")
-				continue;
-			if (*e == envLong || *e == envShort)
-				continue;
-			if (std::find(environment.begin(), environment.end(), *e) != environment.end())
-				continue;
-			if (e->find('_') != string::npos) {
-				// Drop LongCodes here - ShortCodes are also included
-				string const shorty = e->substr(0, e->find('_'));
-				char const **n = needLong;
-				for (; *n != NULL; ++n)
-					if (shorty == *n)
-						break;
-				if (*n == NULL)
-					continue;
+	if (envShort != "C") {
+		// take care of LC_MESSAGES
+		if (envLongIncluded == false)
+			environment.push_back(envLong);
+		environment.push_back(envShort);
+		// take care of LANGUAGE
+		const char *language_env = getenv("LANGUAGE") == 0 ? "" : getenv("LANGUAGE");
+		string envLang = Locale == 0 ? language_env : *(Locale+1);
+		if (envLang.empty() == false) {
+			std::vector<string> env = VectorizeString(envLang,':');
+			short addedLangs = 0; // add a maximum of 3 fallbacks from the environment
+			for (std::vector<string>::const_iterator e = env.begin();
+			     e != env.end() && addedLangs < 3; ++e) {
+				if (unlikely(e->empty() == true) || *e == "en")
+					continue;
+				if (*e == envLong || *e == envShort)
+					continue;
+				if (std::find(environment.begin(), environment.end(), *e) != environment.end())
+					continue;
+				if (e->find('_') != string::npos) {
+					// Drop LongCodes here - ShortCodes are also included
+					string const shorty = e->substr(0, e->find('_'));
+					char const **n = needLong;
+					for (; *n != NULL; ++n)
+						if (shorty == *n)
+							break;
+					if (*n == NULL)
+						continue;
+				}
+				++addedLangs;
+				environment.push_back(*e);
 			}
-			++addedLangs;
-			environment.push_back(*e);
 		}
+	} else {
+		environment.push_back("en");
 	}
 
 	// Support settings like Acquire::Translation=none on the command line to
@@ -270,6 +262,16 @@
 			return codes;
 	}
 
+	// cornercase: LANG=C, so we use only "en" Translation
+	if (envShort == "C") {
+		allCodes = codes = environment;
+		allCodes.insert(allCodes.end(), builtin.begin(), builtin.end());
+		if (All == true)
+			return allCodes;
+		else
+			return codes;
+	}
+
 	std::vector<string> const lang = _config->FindVector("Acquire::Languages");
 	// the default setting -> "environment, en"
 	if (lang.empty() == true) {

=== modified file 'debian/changelog'
--- debian/changelog	2010-11-06 14:02:27 +0000
+++ debian/changelog	2010-11-09 13:18:19 +0000
@@ -7,12 +7,12 @@
 
   [ David Kalnischkies ]
   * apt-pkg/aptconfiguration.cc:
-    - respect the none-force even in LANG=C (Closes: #602573)
+    - evaluate Acquire::Languages= before LANG= (Closes: #602573)
   * apt-pkg/orderlist.cc:
     - try fixing before removing even if the fix is hidden in
       a provides, hidden in the #590438 testcase
 
- -- David Kalnischkies <kalnischkies@gmail.com>  Sat, 06 Nov 2010 12:50:20 +0100
+ -- David Kalnischkies <kalnischkies@gmail.com>  Tue, 09 Nov 2010 14:16:41 +0100
 
 apt (0.8.8) unstable; urgency=low
 

=== modified file 'test/libapt/getlanguages_test.cc'
--- test/libapt/getlanguages_test.cc	2010-11-06 11:50:49 +0000
+++ test/libapt/getlanguages_test.cc	2010-11-09 13:18:19 +0000
@@ -89,6 +89,24 @@
 	env[0] = "C";
 	vec = APT::Configuration::getLanguages(false, false, env);
 	equals(vec.size(), 0);
+
+	_config->Set("Acquire::Languages", "environment");
+	env[0] = "C";
+	vec = APT::Configuration::getLanguages(false, false, env);
+	equals(vec.size(), 1);
+	equals(vec[0], "en");
+
+	_config->Set("Acquire::Languages", "de");
+	env[0] = "C";
+	vec = APT::Configuration::getLanguages(false, false, env);
+	equals(vec.size(), 1);
+	equals(vec[0], "de");
+
+	_config->Set("Acquire::Languages", "fr");
+	env[0] = "ast_DE.UTF-8";
+	vec = APT::Configuration::getLanguages(false, false, env);
+	equals(vec.size(), 1);
+	equals(vec[0], "fr");
 	_config->Set("Acquire::Languages", "");
 
 	_config->Set("Acquire::Languages::1", "environment");


Reply to: