[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

UDD schema for new queue



Hi,

I would like to include information about packages residing
in the new queue into Blends tasks pages.  Because I decided
to use UDD as main source of information for these pages I
would like to inject information about new queue into UDD
first.  I've got a hint to

   http://ftp-master.debian.org/new.822

which provides a good amount of the information I need.  Before
I start I would like to hear your opinion about creating the
following two tables and the means to harvest the data for the
according fields.  Perhaps I'm just lacking some knowledge which
might simplify things even further:

-- Sources
CREATE TABLE new_sources (
       source text,
       version text,
       maintainer text,
       maintainer_name text,
       maintainer_email text,
       bin text,                  -- by parsing http://ftp-master.debian.org/new/<src>_<version>.html#dsc field "Binary:"
       changed_by text,           -- Uploader?
       architecture text,
       homepage text,             -- by parsing http://ftp-master.debian.org/new/<src>_<version>.html#dsc field "Homepage:"
       vcs_type text,             -- by parsing http://ftp-master.debian.org/new/<src>_<version>.html#dsc field "Vcs-*:"
       vcs_url text,              -- by parsing http://ftp-master.debian.org/new/<src>_<version>.html#dsc field "Vcs-*:"
       vcs_browser text,          -- by parsing http://ftp-master.debian.org/new/<src>_<version>.html#dsc field "Vcs-Browser:"
       distribution text,
       closes int,                -- WNPP bug #
       license text,              -- trying to parse http://ftp-master.debian.org/new/<bin1>_<version>.html#binary-<bin1>-copyright field "License:"
       last_modified timestamp,
       queue text
    PRIMARY KEY (source, version, distribution)
);


-- Packages
CREATE TABLE new_packages (
       package text,
       version text,
       architecture text,
       maintainer text,
       description text,          -- by parsing http://ftp-master.debian.org/new/<bin>_<version>.html#control field "Description:"
       source text,
       source_version,
       depends text,
       recommends text,
       suggests text,
       enhances text,
       pre_depends text,
       breaks text,
       replaces text,
       provides text,
       conflicts text,
       installed_size integer,
       homepage text,
       section text,
       long_description text,
       license text,              -- trying to parse http://ftp-master.debian.org/new/<package>_<version>.html#binary-<package>-copyright field "License:"
    PRIMARY KEY (package, version, architecture, distribution)
);


If there is any better method to obtain the fields above than
parsing HTML pages I would be really happy if you could enlighten
me.

Any more comments?

Kind regards

       Andreas.

--
http://fam-tille.de


Reply to: