[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Check whether a .po file is OK



Quoting vince (vinz65@gmail.com):
> Hi everyone
> 
> I wonder how I can check whether a .po file is OK or whether it has
> error.
> 
> Currently I
> - first try to compile it with gtranslator
> - then try to compile it from the command line with "msgfmt" (as I saw
> that sometimes gtranslator compiles successfully and does not warn over
> eventual errors)
> 
> Is that enough, or do I have to make other checks? And which ones?


I have a few more tests I run on PO files.

Please find attached my "po_test" script, along with check_var.pl it
uses. This is based on code kindly provided by Denis Barbier. I
unfortunately don't remember who is the author of check_var.pl

Undocumented shell and Perl script.

Please do not comment on my awful writing style for shell scripts. I
know about it.


Basically, it:

- syncs the file with a POT file in the same directory (and chokes if
there is not POT file....this should be enhanced)
- tests the charset
- runs "msgfmt -c"
- tests for variable matching (may bring false positives when a
translation inverts variables but that better than nothing



#!/bin/sh
#
# Sync a PO file with a POT file
# test its charset
# check its validity
#
# Copyright © 2006-2008 Christian Perrier <bubulle@debian.org>
# Copyright © 2006 Denis Barbier <barbier@debian.org>

temp=`tempfile`
trap "rm -f $temp $temp.new" 1 2 3 15

clean_and_die() {
    rm -f $temp $temp.new
    if [ "$1" = "2" ] ; then
	echo Invalid encoding. Please check the file.
    fi
    exit $1
}


if [ -z $1 ]
then
  echo Usage $0 fichier
  clean_and_die 1
fi
if [ ! -f $1 ]
then
  echo $1 not found.
  clean_and_die 1
fi

if [ `ls -1 *\.pot 2>/dev/null | wc -l` != "1" ] ; then 
  echo No POT file \(or more than one\) in current directory
  clean_and_die 1
fi

potfile=`ls -1 *\.pot`
msgmerge -U --previous $1 $potfile
if msgcat $1 >$temp; then
	mv $temp $1
fi

enc=$(grep -E "^\"Content-Type" $1 | cut -f2 -d= | sed 's/\\n\"//g')

echo -n "Charset is $enc. Stats: "


if test -n "$enc" ; then
  iconv -f $enc -t UTF-8 $1 >$temp >/dev/null 2>&1 || clean_and_die 2
  if [ -x $HOME/bin/check_var.pl ] ; then
    $HOME/bin/check_var.pl $1 || clean_and_die 3
  else
    echo $HOME/bin/check_var.pl not found
    clean_and_die 1
  fi
  LC_ALL=C msgfmt -c -v -o /dev/null --statistics $1
  chmod 644 $1
fi


clean_and_die 0
#! /usr/bin/perl

sub getVars
{
        my $text = shift;
        my $var = '';
        while ($text =~ m/\G.*?(\$\{[^{}]+\})/g) {
                $var .= $1;
        }
        return $var;
}

$/ = "\n\n";
open (PO, "< $ARGV[0]") or die "Unable to open $ARGV[0]: $!\n";
while (<PO>)
{
        s/"\n"//g;
        (my $msgid) = m/^msgid "(.*)"$/m;
        (my $msgstr) = m/^msgstr "(.*)"$/m;
        next if $msgstr eq '' || m/^#, .*fuzzy/m;
        my $var1 = getVars($msgid);
        my $var2 = getVars($msgstr);
        print if $var1 ne $var2;
}
close (PO);

Attachment: signature.asc
Description: Digital signature


Reply to: