Support for multidim outputs #266

wiseodd · 2024-12-06T16:34:03Z

Closes #252

Thanks to Kazuki for updating asdfghjkl on PyPI, we can now handle 3D tensor outputs (e.g. in LLMs) with GLM predictive.

…tirespect multiple batch dims

wiseodd · 2024-12-06T16:34:41Z

WIP adding an article based on lm_example.py to the docs.

wiseodd · 2024-12-06T17:32:14Z

@runame @aleximmer ready to review!

aleximmer

Thanks for the useful addition. To make sure this breaks nothing, I would recommend the test mentioned in the review, otherwise LGTM.

docs/lm_example.md

laplace/baselaplace.py

laplace/curvature/asdl.py

aleximmer · 2025-01-19T15:23:09Z

tests/test_baselaplace.py

Could you add a test where you test for equivalence between a "trivial" multidimensional model and a previously supported model? This can be just a linear model that mapped previously from [num_data, D] -> [num_data, K] and now from [num_data / L, L, D] -> [num_data / L, L, K], that is, the data set is simply reshaped. This should give an equivalent posterior and predictive. Maybe to make sure also have the option to use a NN for the test, not only the linear model case.

wiseodd · 2025-02-04T19:34:23Z

laplace/curvature/asdl.py

-                return_outputs=True,
-                batch_size=self._get_batch_size(x),
-            )
+            Ji, f = batch_gradient(self.model, closure, return_outputs=True)


@runame @aleximmer do you know what happened here?

... Ji, f = batch_gradient(self.model, closure, return_outputs=True) print("Jac") print(Ji) ...

with these setup:

X, Y = torch.randn(10, 3), torch.randn(10, 1) X_multidim, Y_multidim = X.reshape(5, 2, 3), Y.reshape(5, 2, 1) model_std = nn.Linear(3, 1) model_multidim = deepcopy(model_std) ... la = Laplace(model_multidim, ...) la.fit(...) la(X_multidim)

prints

Jac tensor([[ 0.2302, 0.1006, 0.4197, 1.0000], [ 0.0000, 0.0000, 0.0000, 0.0000], [ 0.3055, 1.1884, -0.4010, 1.0000], [ 0.0000, 0.0000, 0.0000, 0.0000], [-2.0167, 1.4216, -0.5488, 1.0000], [ 0.0000, 0.0000, 0.0000, 0.0000], [-0.4076, 0.2810, -1.7831, 1.0000], [ 0.0000, 0.0000, 0.0000, 0.0000], [ 0.1297, 0.2202, -0.5024, 1.0000], [ 0.0000, 0.0000, 0.0000, 0.0000]]

Notice that the returned Ji has row [0 ... 0]. Seems like ASDL's issue?

To reproduce: uv run pytest -k "test_predictive_multidim

wiseodd added 5 commits November 20, 2024 15:51

Make functional (co)variance, curvlinops Jacobian and Hessian computa…

1010e1b

…tirespect multiple batch dims

Add GLM example to lm example

969848b

Address weight-sharing dims in ASDL Jacobian

6c310f1

Update ASDL dep

28cccaa

Merge branch 'main' into glm-multidim

822ed9c

wiseodd added the enhancement New feature or request label Dec 6, 2024

wiseodd requested review from aleximmer and runame December 6, 2024 16:34

wiseodd self-assigned this Dec 6, 2024

wiseodd added 2 commits December 6, 2024 11:48

Skip LowRankLaplace in multidim tests

6511163

Add doc for multidim predictive

3af1cf0

wiseodd marked this pull request as ready for review December 6, 2024 17:31

aleximmer requested changes Jan 19, 2025

View reviewed changes

Add Jacobian dimension check

31898fb

wiseodd commented Feb 4, 2025

View reviewed changes

Add test Jacobian multidim

965b852

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for multidim outputs #266

Support for multidim outputs #266

wiseodd commented Dec 6, 2024

wiseodd commented Dec 6, 2024

wiseodd commented Dec 6, 2024

aleximmer left a comment

aleximmer Jan 19, 2025

wiseodd Feb 4, 2025

wiseodd Feb 4, 2025

wiseodd Feb 4, 2025

Support for multidim outputs #266

Are you sure you want to change the base?

Support for multidim outputs #266

Conversation

wiseodd commented Dec 6, 2024

wiseodd commented Dec 6, 2024

wiseodd commented Dec 6, 2024

aleximmer left a comment

Choose a reason for hiding this comment

aleximmer Jan 19, 2025

Choose a reason for hiding this comment

wiseodd Feb 4, 2025

Choose a reason for hiding this comment

wiseodd Feb 4, 2025

Choose a reason for hiding this comment

wiseodd Feb 4, 2025

Choose a reason for hiding this comment