Re: XLIFF tools

To: debian-i18n@lists.debian.org
Subject: Re: XLIFF tools
From: JC Helary <jch.helary@free.fr>
Date: Tue, 27 Dec 2005 11:25:17 +0900
Message-id: <[🔎] 3FE8278A-99B3-4186-96BD-3BFFDA5DD405@free.fr>
In-reply-to: <[🔎] 20051226155323.GA16999@nekral.homelinux.net>
References: <[🔎] 20051221141520.GA13261@javifsp.no-ip.org> <[🔎] 20051221172431.GB32161@random.kti.ae.poznan.pl> <[🔎] 20051221190239.GG5754@djedefre.onera> <[🔎] 20051222110437.GA6442@random.kti.ae.poznan.pl> <[🔎] 448DE84C-4B3A-4C81-B624-8AC36D4994DD@free.fr> <[🔎] 1135268170.6877.25.camel@localhost.localdomain> <[🔎] 83D05A76-FC01-414F-A636-EF7E5138F597@free.fr> <[🔎] 20051222204645.GA30667@nekral.homelinux.net> <[🔎] 29C56D2E-77A9-43FE-A979-E8C5A816AB26@free.fr> <[🔎] 20051223153453.GA6554@nekral.homelinux.net> <[🔎] 20051226155323.GA16999@nekral.homelinux.net>

 * Transolution
   http://transolution.python-hosting.com/
   an XLIFF editor with filters for SGML based formats (HTML, XML,
   Docbook, OpenOffice) and PO. It also provides a Translation Memory

server (which can use TMX files), and a tool to convert XLIFF toTMX.

My understanding is that their support fo XLIFF files is still notfull. But the project is promissing (previously known as evil-trans,name changed to make sure the project gets mainstreamacceptance... :) There are dependancies that don't work very wellwith OSX right now but that is not relevant for Debian.

 * OmegaT
   http://www.omegat.org/omegat/omegat.html
A translator GUI. It supports tag based formats (HTML,OpenOffice) andplain text. It can use and generate a TMX. It generate thetranslated
   documents and the XLIFF.
I'm not sure it can be packaged for Debian (requires Swing,requires an
   Apple's .jar)
   Also the dependency chain is quite important.

OmegaT is not a XLIFF editor. The next version (RC5 was release onthe 24th :) supports any level of TMX and passed the Lisa compliancetest. LISA maintains the TMX standard, OASIS the XLIFF standard.


Exported files are _not_ XLIFF.

OmegaT also supports Java properties bundles so that Java apps(including OmegaT itself) can be localised in OmegaT.

We (I am member of the dev team as tester/localiser/documentationmaintainer/user support/OSX bundle maintainer :) are working onbilingual source files support: po/xliff/etc as well as DocBook andgeneric xml source files (any helper is welcome btw.)

As for the Apple's jar thing I am not sure it is _required_. If itis, it is only for use on OSX and thus irrelevant to Debian. As forthe SWING thing, you are correct.

 * omegat+
   http://omegatplus.sourceforge.net/
   It seems this one don't have the same dependencies than OmegaT. It
   should be easier to package on Debian. I don't know if it is as
   complete as OmegaT (or maybe more complete).

Just for your info. The developer used to work in OmegaT as adocumentation maintainer, he broke all out previous translationmemories and eventually left the project to create his fork since hecould not stand being explained translation related things by peoplewho were not able to produce code (besides for the fact that theOmegaT lead developer is a translator and has done the Eclipseinterface to Russian using OmegaT...) It may be that in the futurethis fork will create good code, but the user/tester base isinexistant. Right now it has yet to produce anycode.

 * SUN's open-language-tools
   https://open-language-tools.dev.java.net/

It is licensed under CDDL. It can't entre main currently. Idon't know

   if it could be shiped in non-free in Debian.
   I've not tried it.

SUN had an in-house Java tool used for their own translation process.It is a pure XLIFF editor using their in-house generated XLIFF files(OOo has been localised with STE).The package has been released under the name OpenLanguageTools. Itworks well, I don't know the dependancies except for it being a Javaapp.Now a number of OOo Language Native groups are working with OmegaT ondocumentation translation for ease of use (and quality of usersupport :) Although GUI files are still worked on with XLIFF files.

With xlifftool, we can probably convert any PO to create an XLIFFfile.
However, I'm not sure we can do the reverse conversion with any XLIFF
file.


That's something to test.

Also, did you encountered any issue due to providing POs to
LocFactoryEditor? Would the translations be easier if we couldprovide you
XLIFF files instead of POs?

I use LocFactoryEditor (I am working on the French localisation rightnow). LFE's po/xliff support is equivalent. The advantage of xlifffiles is that, as you noticed yourself, there are plenty of solidoptions outside the Linux world while .po files are restricted toFink's kbabel/gtranslator or LFE. Knowing that kbabel/gtranslator areX11 apps (even on OSX) there are character input systems issues:everything need to be set up especially to get either of those 2 appsto work and it is a pain in the butt. The only valid option on OSX isLFE but it is not a free (either way) application even if the demomode is working without any annoying limitation (a bunch of functionlimitations though).

For contributions to Debian form outside the Debian world, xlifffiles would be appreciated.

If you want to tests this softwares, I've made Debian packages for

Transolution, xlifftool and XML-TMX: https://nekral.homelinux.net/pootle/

(online when I'm not sleeping)
Omegat doesn't need to be installed (java -jar omegat.jar should be
sufficient to test it).

You are right. OmegaT was first of all designed as an app that workscrossplatform because of the lack of translator's tools on Linux. Itis supposed to work perfectly well on Linux. The recent RC includespreference files set as each platform demands: in ~/.omegat on Linux.

We still need to work a lot for full Linux "acceptance" but there isa developer who ported OmegaT 1.4.5 to Gentoo if you are interested:

http://bugs.gentoo.org/show_bug.cgi?id=91559

1.4.5 will be obsoleted as soon as 1.6 is released though (sometimein January).

Generally speaking, my earlier comments on the .po format are limitedby my lack of familiarity with the format. I apologise for anymisunderstanding born from my mails.

I think though that it is clear that .po as well as .xliff are both_localisation_ oriented formats while .xliff also provides nativesupport for documentation translation. In short, it is _conceptually_easier to fully localise an application (GUI/Docs in various formats)by using an exclusively .xliff based process than by using anexclusively .po based one.

Tweaks being slowly avaliable to provide .po<->.xliff conversionsboth formats could be considered equivalent for a subset of functionsprovided by .xliff though. If that subset is enough for Debian itshould be a satisfying option.

As far as translation management is concerned though, it seems to methat translation variants are better handled by xliff and thisspecific item should greatly enhance the translation process inDebian if properly implemented:

In a <tu> it could be possible to have previous versions <tuv> thatdo not differ in meaning but only in structure (either spellcheck orsyntax, punctiation corrections) and those could be set to beequivalent to the target language <tuv> without any trouble, whatseems (?) to be difficult right now in a .po setting.

The fact that xliff interacts fully with other translation standards(tbx for glossaries, srx for segmentation, tmx for tm exchange)greatly enhances the translator's experience and allows a Debianlocaliser to easily leverage her/his work with external sources thatwould require quite a lot of fiddling right now, while keeping theoutput result consistent with the Debian po based localisationframework.

If gettext were not involve at all, a fully xliff based process wouldbe valid. Since gettext is at the core of Debian, inevitably .pocomes in and the localisation functions of .xliff are thusredundants. Is it valid to use .xliff for documentation only and .pofor GUI work only ? Is it possible to create TMs from the GUI workthat can be used with the Documentation work ? Is it possible to workwith multiple files and file formats on one localisation project tohave trasnparent access to the whole data set (like OmegaT does) ? Iam not sure all the above questions are relevant since I don't knowthe current process but it seems to be they are questions that occurin any multi-file format translation process within the free and nonfree world equally...


Jean-Christophe

Reply to:

Follow-Ups:
- Re: XLIFF tools
  - From: Clytie Siddall <clytie@riverland.net.au>
- Re: XLIFF tools
  - From: Keld J|rn Simonsen <keld@dkuug.dk>

References:
- Re: Work on a centralized infrastructure for i18n/l10n
  - From: Javier Fernández-Sanguino Peña <jfs@computer.org>
- Re: Work on a centralized infrastructure for i18n/l10n
  - From: Thomas Huriaux <thomas.huriaux@gmail.com>
- Re: Work on a centralized infrastructure for i18n/l10n
  - From: Christian Perrier <bubulle@debian.org>
- Re: Work on a centralized infrastructure for i18n/l10n
  - From: Thomas Huriaux <thomas.huriaux@gmail.com>
- Re: Work on a centralized infrastructure for i18n/l10n
  - From: JC Helary <jch.helary@free.fr>
- Re: Work on a centralized infrastructure for i18n/l10n
  - From: Stefano Canepa <sc@linux.it>
- Re: Work on a centralized infrastructure for i18n/l10n
  - From: JC Helary <jch.helary@free.fr>
- Re: Work on a centralized infrastructure for i18n/l10n
  - From: Nicolas François <nicolas.francois@centraliens.net>
- Re: Work on a centralized infrastructure for i18n/l10n
  - From: JC Helary <jch.helary@free.fr>
- XLIFF tools (was: Work on a centralized infrastructure for i18n/l10n)
  - From: Nicolas François <nicolas.francois@centraliens.net>
- Re: XLIFF tools
  - From: Nicolas François <nicolas.francois@centraliens.net>

Prev by Date: Re: Translation coordination robot, who?
Next by Date: Re: Committing D-I manual files
Previous by thread: Re: XLIFF tools
Next by thread: Re: XLIFF tools
Index(es):
- Date
- Thread