[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: UDD schema for new queue



On 13/02/09 at 14:48 +0100, Andreas Tille wrote:
> Hi,
>
> I would like to include information about packages residing
> in the new queue into Blends tasks pages.  Because I decided
> to use UDD as main source of information for these pages I
> would like to inject information about new queue into UDD
> first.  I've got a hint to
>
>    http://ftp-master.debian.org/new.822
>
> which provides a good amount of the information I need.  Before
> I start I would like to hear your opinion about creating the
> following two tables and the means to harvest the data for the
> according fields.  Perhaps I'm just lacking some knowledge which
> might simplify things even further:
>
> -- Sources
> CREATE TABLE new_sources (
>        source text,
>        version text,
>        maintainer text,
>        maintainer_name text,
>        maintainer_email text,
>        bin text,                  -- by parsing http://ftp-master.debian.org/new/<src>_<version>.html#dsc field "Binary:"

call it binaries? ask ftpmasters to export it to deb822?

>        changed_by text,           -- Uploader?
>        architecture text,
>        homepage text,             -- by parsing http://ftp-master.debian.org/new/<src>_<version>.html#dsc field "Homepage:"
>        vcs_type text,             -- by parsing http://ftp-master.debian.org/new/<src>_<version>.html#dsc field "Vcs-*:"
>        vcs_url text,              -- by parsing http://ftp-master.debian.org/new/<src>_<version>.html#dsc field "Vcs-*:"
>        vcs_browser text,          -- by parsing http://ftp-master.debian.org/new/<src>_<version>.html#dsc field "Vcs-Browser:"
>        distribution text,
>        closes int,                -- WNPP bug #

I think that a given NEW upload can close several bugs. (Think of
packages in NEW because of new binary packages, not just new source
packages).

>        license text,              -- trying to parse http://ftp-master.debian.org/new/<bin1>_<version>.html#binary-<bin1>-copyright field "License:"
>        last_modified timestamp,
>        queue text
>     PRIMARY KEY (source, version, distribution)
> );
>
>
> -- Packages
> CREATE TABLE new_packages (
>        package text,
>        version text,
>        architecture text,
>        maintainer text,
>        description text,          -- by parsing http://ftp-master.debian.org/new/<bin>_<version>.html#control field "Description:"
>        source text,
>        source_version,
>        depends text,
>        recommends text,
>        suggests text,
>        enhances text,
>        pre_depends text,
>        breaks text,
>        replaces text,
>        provides text,
>        conflicts text,
>        installed_size integer,
>        homepage text,
>        section text,
>        long_description text,
>        license text,              -- trying to parse http://ftp-master.debian.org/new/<package>_<version>.html#binary-<package>-copyright field "License:"
>     PRIMARY KEY (package, version, architecture, distribution)
> );
>
>
> If there is any better method to obtain the fields above than
> parsing HTML pages I would be really happy if you could enlighten
> me.

ask ftpmasters to export the data you want, if it's already available
somewhere?

> Any more comments?

I'm not sure if all the fields are really that useful... But if they are
there, it's true that it's not that hard to import them as well.
-- 
| Lucas Nussbaum
| lucas@lucas-nussbaum.net   http://www.lucas-nussbaum.net/ |
| jabber: lucas@nussbaum.fr             GPG: 1024D/023B3F4F |


Reply to: