[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#424746: lintian: please detect duplicate words in the description



tag 424746 patch
thanks

On Wednesday 16 May 2007, Justin Pryzby wrote:
> Package: lintian
> Version: 1.23.28
> Severity: wishlist
>
> apt-cache dumpavail |grep -wice 'the  *the'

Attached is a patch adding such check.

Cheers,
-- 
Atomo64 - Raphael

Please avoid sending me Word, PowerPoint or Excel attachments.
See http://www.gnu.org/philosophy/no-word-attachments.html
diff --git a/checks/description b/checks/description
index 9f22377..b06196c 100644
--- a/checks/description
+++ b/checks/description
@@ -116,6 +116,10 @@ while (<IN>) {
         tag "description-contains-homepage";
     }
 
+    if (m,((?:\W|^)(\w+)\s+(\2)(?:\W|$)),i) {
+        tag "description-contains-duplicated-word", "$1";
+    }
+
     if ($lines == 1) {
 	# checks for the first line of the extended description:
 	if (m/^\s/o) {
diff --git a/checks/description.desc b/checks/description.desc
index f517aaf..1b60a48 100644
--- a/checks/description.desc
+++ b/checks/description.desc
@@ -112,3 +112,7 @@ Info: Lintian found a spelling or capitalization error in the package
  for.  It does not have a dictionary like a spelling checker does.  It is
  particularly picky about spelling and capitalization in package
  descriptions since they're very visible to end users.
+
+Tag: description-contains-duplicated-word
+Type: warning
+Info: The description contains a duplicated word, usually by mistake.
diff --git a/testset/description/debian/control b/testset/description/debian/control
index f9e606b..bf24c8c 100644
--- a/testset/description/debian/control
+++ b/testset/description/debian/control
@@ -9,7 +9,7 @@ Package: description
 Architecture: all
 Description: description is a package which tests lintian's description checks.
  missing
- The synopsis should not start with the package's name. Moreover,
+ The the synopsis should not start with the package's name. Moreover,
  the long description
  	should
  	not
diff --git a/testset/tags.description b/testset/tags.description
index 9bae99d..87eae2e 100644
--- a/testset/tags.description
+++ b/testset/tags.description
@@ -21,4 +21,5 @@ W: description-foo: description-starts-with-leading-spaces
 W: description-foo: possible-unindented-list-in-extended-description
 W: description: changelog-not-compressed-with-max-compression changelog.Debian.gz
 W: description: debian-changelog-file-contains-obsolete-user-emacs-settings
+W: description: description-contains-duplicated-word The the
 W: description: description-synopsis-might-not-be-phrased-properly

Attachment: signature.asc
Description: This is a digitally signed message part.


Reply to: