Softmax applied twice when computing loss in LogisticRegression #654

garryod · 2021-05-26T16:00:18Z

🐛 Bug

In the LogisticRegression model, loss is computed as the negative log-likelihood of the log-softmax of the softmax of the linear classifier outputs. I.e. loss = F.nll_loss(F.log_softmax(F.softmax(self.linear(x))), y)

To Reproduce

Code sample

model = LogisticRegression(42, 42)
batch = (torch.rand(42), torch.rand(42))
model.training_step(batch, 0)

Expected behavior

Network loss should be computed as the cross entropy of predictions and targets (negative log likelihood of the log-softmax of the linear classifier outputs). I.e. loss = F.nll_loss(F.log_softmax(self.linear(x)), y)

Environment

PyTorch Version (e.g., 1.0): 1.8.1
OS (e.g., Linux): RHEL 8.4
How you installed PyTorch (conda, pip, source): pip
Build command you used (if compiling from source): N/A
Python version: 3.9.2
CUDA/cuDNN version: N/A
GPU models and configuration: N/A
Any other relevant information: None

Additional context

This bug should not cause any serious issue, however it deviates from expected behaviour and reduces value separation which has potential to negatively impact training.
The PyTorch implementation of Cross Entropy Loss includes the log-softmax, as such it expects raw unnormalized scores for each class (docs, functional)
Happy to submit a PR to resolve if deemed appropriate

The text was updated successfully, but these errors were encountered:

github-actions · 2021-05-26T16:01:12Z

Hi! thanks for your contribution!, great first issue!

Avoid two softmax calls in LogisticRegression Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>

garryod added fix fixing issues... help wanted Extra attention is needed labels May 26, 2021

Borda mentioned this issue Jun 15, 2021

Fix - Avoid two softmax calls in LogisticRegression #655

Merged

Borda closed this as completed in #655 Jun 16, 2021

Borda pushed a commit that referenced this issue Jun 16, 2021

Fix issue #654 (#655)

b087892

Avoid two softmax calls in LogisticRegression Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>

Borda added bug Something isn't working and removed fix fixing issues... labels Jun 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Softmax applied twice when computing loss in LogisticRegression #654

Softmax applied twice when computing loss in LogisticRegression #654

garryod commented May 26, 2021

github-actions bot commented May 26, 2021

Softmax applied twice when computing loss in LogisticRegression #654

Softmax applied twice when computing loss in LogisticRegression #654

Comments

garryod commented May 26, 2021

🐛 Bug

To Reproduce

Code sample

Expected behavior

Environment

Additional context

github-actions bot commented May 26, 2021