Question on simplex bijector implementation #283

Red-Portal · 2023-08-12T22:13:04Z

Hi,

It appears that torch.probability simply uses softmax for the simplex bijector.
Is there a reason our simplex transform is much more complicated?
I was also thinking about a GPU-friendly implementation, which the current implementation appears hard do.

The text was updated successfully, but these errors were encountered:

torfjelde · 2023-08-13T16:00:43Z

Softmax isn't bijective. The one we have now is (maps from d to d-1 dimensional)

devmotion · 2023-08-13T16:36:58Z

See also #51.

Red-Portal · 2023-08-13T17:18:41Z

Betanalpha does discuss a bijective softmax by arbitrarily setting the endpoint logits. Any experience with this?

devmotion · 2023-08-13T20:25:55Z

That's supported e.g. in GPLikelihoods (see maybe also the discussion in JuliaGaussianProcesses/GPLikelihoods.jl#55).

Red-Portal · 2023-08-13T21:27:51Z

Good to know thanks. Though, back to my original intention, I really wish that our simplex bijector could play nicely with GPUs out of the box. Among non-NF bijectors, it seems the simplex bijector is really going to be the big challenge going in that direction. Do we have any plans on how to pursue this? It does seem to me that the softmax approach would be much easier to get this done.

Red-Portal · 2023-08-14T22:27:03Z

Actually, nevermind. I just wrote a stick-breaking bijector using array operations based on the implementations of numpyro and tensorflow. If this were to be added to Bijectors.jl we'll probably have to add a CUDA array specialization. Let me know how to proceed on this.

devmotion · 2023-08-14T22:53:19Z

On Julia >= 1.9, a CUDA specialization could be put in an extension (possibly could even just be an extension with GPUArrays).

Red-Portal · 2023-08-14T22:55:36Z

I do have the feeling that this will have to wait until the batch operation interface is finalized. @torfjelde Do we have an expectation on when that would be?

sethaxen · 2024-05-24T16:37:10Z

There are three main ways to use softmax for simplex transforms. One uses parameter expansion to retain bijectivity: f(y) = [softmax(y); logsumexp(y)]. The other two come from compositional data analysis literature are called additive log-ratio f(y) = softmax(vcat(y, 0)) and isometric log-ratio f(y) = softmaxx(V * y) for a particular choice of semi-orthogonal matrix V. I'm currently testing performance of each of these versus stick-breaking.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question on simplex bijector implementation #283

Question on simplex bijector implementation #283

Red-Portal commented Aug 12, 2023

torfjelde commented Aug 13, 2023

devmotion commented Aug 13, 2023

Red-Portal commented Aug 13, 2023

devmotion commented Aug 13, 2023

Red-Portal commented Aug 13, 2023 •

edited

Loading

Red-Portal commented Aug 14, 2023

devmotion commented Aug 14, 2023

Red-Portal commented Aug 14, 2023

sethaxen commented May 24, 2024

Question on simplex bijector implementation #283

Question on simplex bijector implementation #283

Comments

Red-Portal commented Aug 12, 2023

torfjelde commented Aug 13, 2023

devmotion commented Aug 13, 2023

Red-Portal commented Aug 13, 2023

devmotion commented Aug 13, 2023

Red-Portal commented Aug 13, 2023 • edited Loading

Red-Portal commented Aug 14, 2023

devmotion commented Aug 14, 2023

Red-Portal commented Aug 14, 2023

sethaxen commented May 24, 2024

Red-Portal commented Aug 13, 2023 •

edited

Loading