On 2017-10-25 15:12, Michael Crusoe wrote:
2017-10-25 16:04 GMT+03:00 Steffen Möller <steffen_moeller@gmx.de>:
On 25.10.17 13:47, Michael Crusoe wrote:
someone
2017-10-25 14:34 GMT+03:00 Steffen Möller <steffen_moeller@gmx.de
<mailto:steffen_moeller@gmx.de>>:
On 25.10.17 10:56, Michael Crusoe wrote:
Sorry, I missed the bit where we are deprecating RRID. Can
https://arxiv.org/ftp/arxiv/paexplain?
Because of
pers/1707/1707.03659.pdf [1]
<https://arxiv.org/ftp/arxiv/pandapers/1707/1707.03659.pdf [1]>
some web googling from which I gathered that the "ResearchResource
IDentifiers" are not only provided by SciCrunch. Admittedly, Ifail to
find that page now that I want to find it :o/<http://scicrunch.org> is the
There is no conflict here. scicrunch.org [2]
post-pilot phase of what is described in that paper.fight
Personally, I could not care less, let those catalog providers
that out among themselves. However, I find that the notion of"RRID" to
SciCrunch
clearly identifies the provenance of that information, while
me is more of a concept coined by (https://www.force11.org),not a
provider. And with several initiatives following the samepurpose, I
found that by using SciCrunch not RRID, we would be the mostto the
provider-neutral. And then again, it is only something local
Debian packaging, not publicly visible, so nobody should trulysci
care and
the use of SciCrunch imho serves us best on a technical level.
RRIDs share a single name space that allow for multiple providers,
crunch being the current main provider for software tools andfor an
databases and other registries responsible for the other types. By
referring to RRIDs generically then there is no conflict.
See https://www.ebi.ac.uk/miriam/main/datatypes/MIR:00000558 [3]
overviewof
Please rename this field to RRID, or better yet just have a list
URIs like we do in CWL so you don't have to care if it is a RRID,DOI
or whatever :-)[4]
http://www.commonwl.org/v1.0/CommandLineTool.html#SoftwarePa ckage
This is what we are doing. The field is called "Registry" (not RRID,
so
we can also refer to Wikis and other catalogs) and allows for an
arbitrary unordered number of (Name, Entry) tupels, in complete
analogy
to the CWL, I tend to think.
Well, no. In CWL we don't separate the provider from the identifier.
That's the whole point about COOL URIs.
I've CC'd Stian as he explains this better than I do.
The problem is that from the applied IDs, only Bio.Tools provide COOL URIs. Other providers should, but they don't, at least not yet. Thus a provider + ID pair is, unfortunately, necessary.
As a side-track: You mentioned DOIs, Michael.
Would it make sense if Debian (Med) adds DOIs as citable links to upstream releases, in addition to the upstream version and upstream repo information?
And in any case, DOIs for both the upstream project as a whole (i.e. all releases), and/or for the particular releases, can be added as citations for a src package, if package maintainer or upstream wish so.
Related to your comment (and very, very close to my heart) is the
question if we do everything sufficiently well to map the CWL
workflows
to Debian packages. We could for instance add references to
CWL-workflow-database-entries for the workflows that a particular
Debian
packages is used in, so we can test them when the package updates -
er,
before the package updates in the distribution.
We are good here; you can determine the packages used in any given CWL
description that includes a SoftwareRequirement that is mappable
directly or indirectly to a package.
For automated testing you would need a way to specify "normal" or
expected results; CWL v1.0.x doesn't have that concept. A
researchobject.org [16] RO that contains/references those results with
the corresponding CWL workflow would however fulfill this role.
And another side-track: In addition to CWL workflows and using them as test (requiring some input-output pairs and equality relation), would it make sense for Debian to link to some kind of "CWL wrappers" for the single tools?
That is again similar to the elsewhere-discussed proposal of generating (and/or linking to) software containers (Docker, Singularity, rkt?)...
Back to the topic: I agree with Steffen that if we mean the link pairs as Provider + ID (as opposed to ID_type + ID_value), then SciCrunch makes more sense than RRID.
Cheers,
Matus