Conform to PyTorch convention in the loss #163

wiseodd · 2024-04-18T17:54:46Z

Currently we always assume that the class index in the outputs of the model to be -1. While this works for standard models and Huggingface LLMs, there are some important models where this is false. E.g. image-output models where the logit tensor is of shape (n_batch, n_classes, height, width), i.e. the class index is 1.

The text was updated successfully, but these errors were encountered:

wiseodd · 2024-04-26T11:19:38Z

The current idea is to add an arg in BaseLaplace: logit_class_idx: int = -1. Then, whenever Laplace flattens logits, it will use that as guidance.

Test cases for conv last layer, which will result in a logit of shape (batch_size, n_classes, height, width), should be created to cover this use case.

wiseodd added this to the 0.2 milestone Apr 18, 2024

wiseodd self-assigned this Apr 18, 2024

wiseodd linked a pull request Apr 26, 2024 that will close this issue

Replace hardcoded class index with logit_class_dim argument #177

Open

wiseodd modified the milestones: 0.2, 0.3 Jul 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Conform to PyTorch convention in the loss #163

Conform to PyTorch convention in the loss #163

wiseodd commented Apr 18, 2024

wiseodd commented Apr 26, 2024

Conform to PyTorch convention in the loss #163

Conform to PyTorch convention in the loss #163

Comments

wiseodd commented Apr 18, 2024

wiseodd commented Apr 26, 2024