The Adoption of the Python Array API in scikit-learn
DateMay 5Time14:20 - 14:40Location Open Stage
The Python array API is a standard API inspired by NumPy's own API that is progressively adopted by array-producing libraries such as NumPy, PyTorch, CuPy, JAX, DPNP on the one hand and array-consuming libraries such as SciPy and scikit-learn on the other hand. Once adopted it will allow scikit-learn to leverage the hardware acceleration support (e.g. GPU support) of the library backing the input arrays instead of always trying to convert those to NumPy and run the computation on the CPU.
In this presentation, we will give an overview of the adoption of this standard in the scikit-learn library both from a technical point of view and an open source contribution dynamics point of view and then illustrate the performance impacts on some example pipeline.