Re: Seeking a small group to package Apache Arrow (was: Bug#970021: RFP: apache-arrow -- cross-language development platform for in-memory analytics)

To: Julian Gilbey <julian@d-and-j.net>
Cc: Diane Trout <diane@ghic.org>, debian-science@lists.debian.org, 970021@bugs.debian.org, debian-python@lists.debian.org, debian-devel@lists.debian.org
Subject: Re: Seeking a small group to package Apache Arrow (was: Bug#970021: RFP: apache-arrow -- cross-language development platform for in-memory analytics)
From: Dirk Eddelbuettel <edd@debian.org>
Date: Sun, 31 Mar 2024 06:48:13 -0500
Message-id: <[🔎] 26121.19837.566425.4463@rob.eddelbuettel.com>
In-reply-to: <[🔎] ZglG14dK1GhjBpVc@d-and-j.net>
References: <[🔎] ZgG_uqfllUUlqH-l@d-and-j.net> <[🔎] 71718a3f86b42248de7cb21abf460b65d56ff61a.camel@ghic.org> <[🔎] Zgh0i71i36IGnrR8@d-and-j.net> <[🔎] feb43c466125f764d4f5108c25fd82dc5575a635.camel@ghic.org> <[🔎] ZglG14dK1GhjBpVc@d-and-j.net>

Julian,

Arrow is a complicated and large package. We use it at work (where there is a
fair amount of Python, also to Conda etc) and do have issues with more
complex builds especially because it is 'data infrastructure' and can come in
from different parts. I would recommend against packaging at old one -- we
also have seen issues with different (py)arrow version biting.

Have you seen https://github.com/apache/arrow-nanoarrow ?

It works via the C API to Arrow which interchanges data via two void* to the
the two structs for arrow array and schema -- and avoids linkage issue. (In
user space the pyarrow or R arrow packages can still be used also interfacing
via these.)  I have been using it for R package bindings for some time and we
plan to expand that (again, at work) -- as do others. It is already use by
duckdb, by the Arrow 'ADBC' interfaces (which are generic in the ODBC/JDBC
sense but for Arrow, and also by a python interface to snowflake.

Dirk

-- 
dirk.eddelbuettel.com | @eddelbuettel | edd@debian.org

Reply to:

References:
- Seeking a small group to package Apache Arrow (was: Bug#970021: RFP: apache-arrow -- cross-language development platform for in-memory analytics)
  - From: Julian Gilbey <jdg@debian.org>
- Re: Seeking a small group to package Apache Arrow (was: Bug#970021: RFP: apache-arrow -- cross-language development platform for in-memory analytics)
  - From: Diane Trout <diane@ghic.org>
- Re: Seeking a small group to package Apache Arrow (was: Bug#970021: RFP: apache-arrow -- cross-language development platform for in-memory analytics)
  - From: Julian Gilbey <julian@d-and-j.net>
- Re: Seeking a small group to package Apache Arrow (was: Bug#970021: RFP: apache-arrow -- cross-language development platform for in-memory analytics)
  - From: Diane Trout <diane@ghic.org>
- Re: Seeking a small group to package Apache Arrow (was: Bug#970021: RFP: apache-arrow -- cross-language development platform for in-memory analytics)
  - From: Julian Gilbey <julian@d-and-j.net>

Prev by Date: Re: morph's abandoned packages (list)
Next by Date: Re: morph's abandoned packages (list)
Previous by thread: Re: Seeking a small group to package Apache Arrow (was: Bug#970021: RFP: apache-arrow -- cross-language development platform for in-memory analytics)
Next by thread: Re: Seeking a small group to package Apache Arrow (was: Bug#970021: RFP: apache-arrow -- cross-language development platform for in-memory analytics)
Index(es):
- Date
- Thread