[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Popcon time-series data points



(Adding popcon developers to Cc, and reordering paragraphs so that it's
easier to follow)

On 14/02/18 at 10:47 +0000, Nikolaos Alexopoulos wrote:
> On Tue, 2018-02-13 at 19:36 +0100, Lucas Nussbaum wrote:
> > On 12/02/18 at 13:23 +0000, Nikolaos Alexopoulos wrote:
> > > Hi everyone,
> > > 
> > > As part of our research here at TU Darmstadt, we would like to have
> > > access to the time points used to generate the Debian popcon
> > > graphs. Is
> > > there a person we could talk to in order to gain access to the
> > > data,
> > > while of course respecting the submitters' privacy?
> > 
> > I don't think that Debian has a policy in place that would allow to
> > share the raw data.
> > 
> > Maybe it would be easier if you explained what you are trying to do,
> > and
> > someone generates the required processed data for you?
> 
> In a nutshell, what we want to do is find correlations between the
> (changes in) popularity of Debian (source) packages and the
> bugs/vulnerabilities reported for them over time. Therefore, we do not
> need raw data, rather we just need snapshots of the popcon public data
> in monthly intervals for the past years. I would assume that sharing
> this data does not infringe on popcon's privacy statement.

OK, I looked into how the popcon graphs are implemented.

If nobody objects by 2018-02-21, I'll send you the output of

  select p.day, package, p.vote, p.old, p.recent, p.no_files,
  submissions
  from popcon_package pp
  join popcon p on (pp.id = p.package_id)
  join popcon_day pd on (pd.day = p.day)
  where in_debian order by p.day;

Which results in something like:

    day     | package |  vote  |  old  | recent | no_files | submissions 
------------+---------+--------+-------+--------+----------+-------------
 2004-01-24 | bash    |   1600 |    48 |     33 |        0 |        1684
 2004-01-25 | bash    |   1727 |    49 |     35 |        0 |        1814
 2004-01-26 | bash    |   1731 |    52 |     41 |        0 |        1828
 2004-01-27 | bash    |   1724 |    53 |     54 |        0 |        1835
 2004-01-28 | bash    |   1727 |    52 |     61 |        0 |        1844
 2004-01-29 | bash    |   1620 |    50 |     65 |        0 |        1739
 2004-01-30 | bash    |   1631 |    48 |     67 |        0 |        1750
 2004-01-31 | bash    |   1639 |    49 |     74 |        0 |        1766
 2004-02-01 | bash    |   1871 |    43 |    103 |        0 |        2020
 2004-02-02 | bash    |   1890 |    34 |    112 |        0 |        2039
 2004-02-03 | bash    |   1910 |    33 |    106 |        0 |        2052
 2004-02-04 | bash    |   1909 |    33 |    103 |        0 |        2047

(Please remind me if I forget to send this to you)

- Lucas


Reply to: