Support multiple tensor backends via NEP-47 (Python array API standard)

learning-chip · July 1, 2021, 1:37am

The recently proposed NEP-47 attempts to unify the APIs of various tensor frameworks (NumPy, Tensorflow, PyTorch, Dask, JAX, CuPy, MXNet, etc.), via the Python array API standard.

It is a much more compact version of the original NumPy APIs, removing unnecessary functions that are not friendly to heterogenous hardware like GPUs.

Since PyMC3 is currently using JAX via Aesara, it should be quite painless to adopt NEP-47 for multi-backend support, I guess?

Related topic: NumPy array protocols

learning-chip · July 1, 2021, 1:40am

References:

learning-chip · July 1, 2021, 2:01am

Looking at PyMC3 and Aesara source code, it seems relatively easy to switch to NEP-47 (basically numpy).

In Aesara, most of JAX-related code is in aesara\link\jax\dispatch.py

Code snippet:

import jax
import jax.numpy as jnp
import jax.scipy as jsp

...

@jax_funcify.register(Cholesky)
def jax_funcify_Cholesky(op, **kwargs):
    lower = op.lower

    def cholesky(a, lower=lower):
        return jsp.linalg.cholesky(a, lower=lower).astype(a.dtype)

    return cholesky


@jax_funcify.register(Solve)
def jax_funcify_Solve(op, **kwargs):

    if op.assume_a != "gen" and op.lower:
        lower = True
    else:
        lower = False

    def solve(a, b, lower=lower):
        return jsp.linalg.solve(a, b, lower=lower)

    return solve


@jax_funcify.register(Det)
def jax_funcify_Det(op, **kwargs):
    def det(x):
        return jnp.linalg.det(x)

    return det


@jax_funcify.register(Eig)
def jax_funcify_Eig(op, **kwargs):
    def eig(x):
        return jnp.linalg.eig(x)

    return eig

In PyMC3, JAX-related code is pymc3\sampling_jax.py, for exmaple:

def sample_numpyro_nuts(
...
):
    model = modelcontext(model)

    seed = jax.random.PRNGKey(random_seed)

    rv_names = [rv.name for rv in model.value_vars]
    init_state = [model.initial_point[rv_name] for rv_name in rv_names]
    init_state_batched = jax.tree_map(lambda x: np.repeat(x[None, ...], chains, axis=0), init_state)
    init_state_batched_at = [at.as_tensor(v) for v in init_state_batched]

If I understand correctly, JAX-specific code only takes a small fraction of the PyMC source code. Most of code still uses vanilla numpy, which is actually great for NEP-47.

learning-chip · July 1, 2021, 2:09am

If a tensor framework supports basic array operations (the Array API core, see API specification), as well as higher-level solvers like linalg.cholesky linalg.solve, linalg.det, linalg.eig, then it should be quite straightforward to add this new backend, I guess?

Any important features that the backend framework must support? For example static vs dynamic graph? Automatic differentiation?

twiecki · July 1, 2021, 3:23pm

That’s an interesting idea and could definitely be done. You already identified that we could probably just copy a lot from the JAX implementation. This is probably better discussed as an aesara issue.

Topic		Replies	Views
PyMC is Forking Aesara to PyTensor News development , aesara	7	1323	December 16, 2022
Help wanted: Continue PyTorch backend for PyTensor Development development	13	1203	June 8, 2024
Is there a place for JAX-native PyMC (Without Aesara)? Questions	0	623	March 4, 2021
PyTorch backend for PyMC4 Development	42	25750	April 18, 2018
PYMC 5 significant speedup of default sampler (Pytensor) Sharing	4	1381	January 25, 2023

Support multiple tensor backends via NEP-47 (Python array API standard)

Related topics