[Pkg-octave-devel] Bug#706376: Bug#706376: Bug#706376: Bug#706376: Bug#706376: Bug#706376: octave: sparse matrix n*2^16

To: David Bateman <david@bateman.eu>, 706376@bugs.debian.org
Cc: Miroslaw Kwasniak <Miroslaw.Kwasniak@pwr.wroc.pl>, Octave Maintainers List <octave-maintainers@octave.org>
Subject: [Pkg-octave-devel] Bug#706376: Bug#706376: Bug#706376: Bug#706376: Bug#706376: Bug#706376: octave: sparse matrix n*2^16
From: Jordi Gutiérrez Hermoso <jordigh@octave.org>
Date: Wed, 19 Jun 2013 15:53:42 -0400
Message-id: <[🔎] CAPHS2gypQ_m7H_dZZ_atXVUP_-f+sQQY+ne7+LZAU4YzDXnFyg@mail.gmail.com>
Reply-to: Jordi Gutiérrez Hermoso <jordigh@octave.org>, 706376@bugs.debian.org
In-reply-to: <[🔎] 51C1F316.1080404@bateman.eu>
References: <20130429102522.GA28312@uv.ikem.pwr.wroc.pl> <CAPHS2gxr6tYVkJFrQqCUd=aYu93b1uV-4tG_o46vnjZ5kHw6UQ@mail.gmail.com> <CAKWV3Rr1J0LQO3Vx_ASs5epq6U1tn1+nSgjAEecePhUAKga7WA@mail.gmail.com> <CAPHS2gyTkO7BrmTO6Q9o-Tc01wuGr_LeM=0fLLiCabh+GPjdtA@mail.gmail.com> <CAKWV3RogRArufv-F8dt5ftEm5SqAmYbjdot13O2LVADGgpQu7Q@mail.gmail.com> <CAPHS2gwsVbvQuaeQLaUMosCjp4pdZgj7noEohveyGb2-tV6u3A@mail.gmail.com> <CAPHS2gzdPYpp2XscvcdOo+gC3D81JHBXujm8bfWyyxJKrdfoJA@mail.gmail.com> <CAKWV3Rr_MP6Dcqqu1HSSqjCH-3-WJhuG-ncLtXQYQjhV1kgM+A@mail.gmail.com> <CAPHS2gxqDAr6gDpkPV+4_x3v8NM=rZ9LEmnKkQWewdheSLU9-Q@mail.gmail.com> <CAKWV3RpWZ2XTnUX8t0qCmUs-YZ2zEh_YFurxbCpK=1gCCjd4cw@mail.gmail.com> <CAPHS2gxBTZpcwrUpvDJTDKBEiG+qx7GCTsWB3LkMR0ZYZw1zTg@mail.gmail.com> <CAKWV3Rp5_FLhmUHN9XD5A2Ts2AsFuc1jqNpnyixiicf-AvVj_A@mail.gmail.com> <CAPHS2gyTFfBPRyrhEQLid0xtWeCAj3M2cewbd_5dPtw6oUeVKw@mail.gmail.com> <[🔎] 51BC817E.7080401@bateman.eu> <[🔎] CAPHS2gx6wiy4o-Hj5tMJ7sq3aVh80Z=6_t_Y26vzVy9HAX9QoQ@mail.gmail.com> <[🔎] 51C1F316.1080404@bateman.eu>

On 19 June 2013 14:06, David Bateman <david@bateman.eu> wrote:
> On 06/16/2013 03:59 AM, Jordi Gutiérrez Hermoso wrote:
>> I'm saying the real problem is that we assume linear indexing works
>> for all matrix types, including sparse matrices. I claim that this is
>> the real problem.
>
> Who is assuming linear indexing works for all matrix types ? Where
> exactly is that stated ?

We are assuming it in our code. In numel for one. And in places like
whatever processes A(idx) for a logical index idx. We're not making
special cases in these places, "if (sparse_type)
dont_linearly_index();"

Each of these places that assume that linear indexing works needs to
be patched to check for sparse types.

>> We can patch around this problem by avoiding linear
>> indexing,
>
> The bug report was in "trace" that called "isempty" on a sparse matrix.
> Neither function needs "numel" or "linear indexing". We aren't patching
> around anything, we are fixing a bug

We are fixing one symptom of a larger bug, a bug that is present in
many different locations.

>>  but this is just treating the symptoms, not the disease.
>
> So you advocate everyone moving to 64-bit indexing ?

That would delay the problem for a nontrivial amount of time. It would
be nice, but it wouldn't fix the problem.

There are other things we could do.

(1) Avoid linear indexing for sparse matrices as much as possible,
i.e. check everywhere we can think of for sparse matrix types. You've
mentioned a few more places where this should be checked.

(2) Warn when creating sparse matrices with large indices that some
operations may not work, or clearly error out when those operations
are attempted.

(3) Abstract octave_idx_type so that it doesn't actually use 32-bit
ints for sparse matrices.

>> While I don't deny that we can make some progress masking the
>> symptoms, the disease itself should also be treated somehow.
>
> There is no disease, and unless you want to artificially limit the
> size of sparse matrices that can be treated such that numel is less
> than 2^31 for 32 bit indexing and 2^63 for 64 bit indexing. Why do
> this which makes sparse matrices much less useful, so there is no
> solution for what you call a disease

Well, numel needs "if(sparse_type) {weep();}" or whatever.

>>> So essentially you're saying that sparse matrices with
>>> 32-bit indexing and numel larger than 2^31 are useless!!
>> I'm saying that they will fail in other unexpected ways,
>
> Isn't that the definition of a bug.

Yes, the real bug: that we have a tacit assumption in our code that
linear indexing works.

>>  and we shouln't mask symptoms.
>
> We never tried to. Look at the code in dim-vector.h
>
> <quote>
>   // The following function will throw a std::bad_alloc ()
>   // exception if the requested size is larger than can be indexed by
>   // octave_idx_type. This may be smaller than the actual amount of
>   // memory that can be safely allocated on a system.  However, if we
>   // don't fail here, we can end up with a mysterious crash inside a
>   // function that is iterating over an array using octave_idx_type
>   // indices.
>
>   octave_idx_type safe_numel (void) const;
> </quote>
>
> The numel method of Sparse<T> calls this method that is supposed to
> throw an error. However as the builtin Fnumel is calling args(1).numel()
> which is calling dims().numel() the sparse safe version of numel isn't
> being called. The solution is to add a numel method to
> octave_base_sparse that calls dims().safe_numel() instead. So this is a
> bug as well.

Yep, I suppose that's my dont_linearly_index() function suggested
above.

> As for linear indexing, if you look in idx-vector.cc you'll see that in
> the convert_index functions an error is returned like
>
> octave_idx_type idx = static_cast<octave_idx_type>(d)
> bool err = static_cast<double> (idx) != d;
>
> So as expected
>
> s = speye (2^17);
> s (2^32)
>
> throws an error
>
> error: subscript indices must be either positive integers or logicals
>
> You might think this is a little cryptic but I wouldn't call it "masking
> a symptom". I propose modifying this error to read
>
> error: subscript indices must be either positive integers less than 2^31
> or logicals
>
> for 32 bit indexing and
>
> error: subscript indices must be either positive integers less than 2^63
> or logicals
>
> for 64 bit indexing.

I think you'll still need to do something about a more realistic
situation of linear indexing with a logical matrix, which also ends up
translating into linear indexing thanks to our underlying bug: the
assumption that linear indexing works. In this case, there shouldn't
be an error at all, like Ed suggested, since we have enough
information in a logical matrix to avoid linear indexing.

> See the attached changeset

It looks good. Can you push it? Also, note that we have an actual
Octave bug for it:

    https://savannah.gnu.org/bugs/?38936

Contrary to apperances, I don't mean to be unhelpful, so please let me
know if you can't push the fix yourself.

- Jordi G. H.

Reply to:

Follow-Ups:
- [Pkg-octave-devel] Bug#706376: Bug#706376: Bug#706376: Bug#706376: Bug#706376: Bug#706376: octave: sparse matrix n*2^16
  - From: David Bateman <dbateman@dbateman.org>

References:
- [Pkg-octave-devel] Bug#706376: Bug#706376: Bug#706376: Bug#706376: octave: sparse matrix n*2^16
  - From: David Bateman <david@bateman.eu>
- [Pkg-octave-devel] Bug#706376: Bug#706376: Bug#706376: Bug#706376: Bug#706376: octave: sparse matrix n*2^16
  - From: Jordi Gutiérrez Hermoso <jordigh@octave.org>
- [Pkg-octave-devel] Bug#706376: Bug#706376: Bug#706376: Bug#706376: Bug#706376: octave: sparse matrix n*2^16
  - From: David Bateman <david@bateman.eu>

Prev by Date: [Pkg-octave-devel] Bug#706376: Bug#706376: Bug#706376: Bug#706376: Bug#706376: octave: sparse matrix n*2^16
Next by Date: Re: [Pkg-octave-devel] Removal of packages
Previous by thread: [Pkg-octave-devel] Bug#706376: Bug#706376: Bug#706376: Bug#706376: Bug#706376: octave: sparse matrix n*2^16
Next by thread: [Pkg-octave-devel] Bug#706376: Bug#706376: Bug#706376: Bug#706376: Bug#706376: Bug#706376: octave: sparse matrix n*2^16
Index(es):
- Date
- Thread