[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#687563: RFS: opengrm-ngram/1.0.3-1 [ITP] -- opengrm n-gram, library



Il 28/02/2013 00:36, Giulio Paci ha scritto:
> Il 27/02/2013 13:27, Jakub Wilk ha scritto:
>> * Giulio Paci <giuliopaci@gmail.com>, 2013-01-13, 13:56:
>>>> Would it be possible to exclude binary files from the being analysed by licensecheck?
>>> Done.
>>
>> Now debian/source/include-binaries can be dropped.

Dropped.

>> Do you know how the files in src/testdata/* were generated? I wonder if we have the full source for it.
> 
> No I do not know. After manual inspection of the test scripts I can say that I do not know how to generate:
> 1) testdata/earnest.cat, testdata/earnest.det, testdata/earnest.fst, testdata/earnest.min (unused)
> 2) testdata/earnest.txt (used as input of ngramsymbols_test.sh and I guess the source of all the earnest.* files)
> 3) testdata/earnest.det.far, testdata/earnest.fst.far, testdata/earnest.min.far (used as input of ngramcount_test.sh)
> 4) testdata/init.randcorpus.0.mod, testdata/init.randcorpus.1.mod, testdata/init.randcorpus.2.mod, testdata/init.randcorpus.3.mod (used as input of ngramrand_test.sh)

> I sent an email upstream to ask about the other files, but I guess we can suppose that we have all the reasonable sources for those files.

Upstream confirmed the hypothesis that all the earnest.* files derived from earnest.txt. However upstream does not remember the exact procedure to obtain earnest.fst and
earnest,cat. init.randcorpur.<number>.mod where produced using a random number generator.

Bests,
	Giulio.


Reply to: