Skip to content

Commit

Permalink
docs: initial prototype of exporting Lux models to Jax (#1088)
Browse files Browse the repository at this point in the history
* docs: initial prototype of exporting Lux models to Jax

* docs: update title

* fix: multiple fixes

* docs: add note on col-major
  • Loading branch information
avik-pal authored Nov 17, 2024
1 parent f1cb52c commit cf8fd61
Show file tree
Hide file tree
Showing 3 changed files with 147 additions and 0 deletions.
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -43,3 +43,4 @@ benchmarks/results

# Generated by tutorials
pinn_nested_ad.gif
*.mlir
4 changes: 4 additions & 0 deletions docs/src/.vitepress/config.mts
Original file line number Diff line number Diff line change
Expand Up @@ -306,6 +306,10 @@ export default defineConfig({
text: "Compiling Lux Models",
link: "/manual/compiling_lux_models",
},
{
text: "Exporting Lux Models to Jax",
link: "/manual/exporting_to_jax",
},
],
},
{
Expand Down
142 changes: 142 additions & 0 deletions docs/src/manual/exporting_to_jax.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,142 @@
# Exporting Lux Models to Jax (via EnzymeJAX & Reactant)

!!! danger "Experimental"

This feature is experimental and is subject to change without notice. Additionally,
this feature currently requires some manual setup for interacting with Jax, which we are
working on improving.

In this manual, we will go over how to export Lux models to StableHLO and use
[EnzymeJAX](https://github.com/EnzymeAD/Enzyme-JAX) to run integrate Lux models with
JAX. We assume that users are familiar with
[Reactant compilation of Lux models](@ref reactant-compilation).

```@example exporting_to_stablehlo
using Lux, Reactant, Random
const dev = reactant_device()
```

We simply define a Lux model and generate the stablehlo code using `Reactant.@code_hlo`.

```@example exporting_to_stablehlo
model = Chain(
Conv((5, 5), 1 => 6, relu),
MaxPool((2, 2)),
Conv((5, 5), 6 => 16, relu),
MaxPool((2, 2)),
FlattenLayer(3),
Chain(
Dense(256 => 128, relu),
Dense(128 => 84, relu),
Dense(84 => 10)
)
)
ps, st = Lux.setup(Random.default_rng(), model) |> dev;
```

Generate an example input.

```@example exporting_to_stablehlo
x = randn(Random.default_rng(), Float32, 28, 28, 1, 4) |> dev;
```

Now instead of compiling the model, we will use `Reactant.@code_hlo` to generate the
StableHLO code.

```@example exporting_to_stablehlo
hlo_code = @code_hlo model(x, ps, st)
```

Now we just save this into an `mlir` file.

```@example exporting_to_stablehlo
open("exported_lux_model.mlir", "w") do io
write(io, string(hlo_code))
end
```

Now we define a python script to run the model using EnzymeJAX.

```python
from enzyme_ad.jax import primitives

import jax
import jax.numpy as jnp

with open("exported_lux_model.mlir", "r") as file:
code = file.read()


def run_lux_model(
x,
weight1,
bias1,
weight3,
bias3,
weight6_1,
bias6_1,
weight6_2,
bias6_2,
weight6_3,
bias6_3,
):
return primitives.ffi_call(
x,
weight1,
bias1,
weight3,
bias3,
weight6_1,
bias6_1,
weight6_2,
bias6_2,
weight6_3,
bias6_3,
out_shapes=[
jax.core.ShapedArray([4, 10], jnp.float32),
],
fn="main",
source=code,
lang=primitives.LANG_MHLO,
pipeline_options=primitives.JaXPipeline(""),
)


# Note that all the inputs must be transposed, i.e. if the julia function has an input of
# shape (28, 28, 1, 4), then the input to the exported function called from python must be
# of shape (4, 1, 28, 28). This is because multi-dimensional arrays in Julia are stored in
# column-major order, while in JAX/Python they are stored in row-major order.

# Input as defined in our exported Lux model
x = jax.random.normal(jax.random.PRNGKey(0), (4, 1, 28, 28))

# Weights and biases corresponding to `ps` and `st` in our exported Lux model
weight1 = jax.random.normal(jax.random.PRNGKey(0), (6, 1, 5, 5))
bias1 = jax.random.normal(jax.random.PRNGKey(0), (6,))
weight3 = jax.random.normal(jax.random.PRNGKey(0), (16, 6, 5, 5))
bias3 = jax.random.normal(jax.random.PRNGKey(0), (16,))
weight6_1 = jax.random.normal(jax.random.PRNGKey(0), (256, 128))
bias6_1 = jax.random.normal(jax.random.PRNGKey(0), (128,))
weight6_2 = jax.random.normal(jax.random.PRNGKey(0), (128, 84))
bias6_2 = jax.random.normal(jax.random.PRNGKey(0), (84,))
weight6_3 = jax.random.normal(jax.random.PRNGKey(0), (84, 10))
bias6_3 = jax.random.normal(jax.random.PRNGKey(0), (10,))

# Run the exported Lux model
print(
jax.jit(run_lux_model)(
x,
weight1,
bias1,
weight3,
bias3,
weight6_1,
bias6_1,
weight6_2,
bias6_2,
weight6_3,
bias6_3,
)
)
```

0 comments on commit cf8fd61

Please sign in to comment.