Use handwritten rules for `zero` and `one` #773

MasonProtter · 2024-01-24T12:05:42Z

Attempt to fix the problems raised in SciML/NeuralPDE.jl#791 that was caused by #770

I think this is the right way to generalize the scalar rule to any input type, and always return a ZeroTangent() instead of a NoTangent() like the old @non_differentiable rule did which was quite mathematically suspect.

MasonProtter · 2024-01-24T17:11:38Z

Appears to have the same failure profile as #774. @oxinabox any thoughts on what we should do? If there's problems in NeuralPDE, there's probably also issues in other parts of the ecosystem.

Here's a demo of these methods fixing the problem that NeuralPDE had: SciML/NeuralPDE.jl#792

oxinabox · 2024-01-25T04:09:52Z

Why is this not:

function frule((_, _), ::typeof(zero), x)
    return (zero(x), ZeroTangent())
end

function rrule(::typeof(zero), x)
    zero_pullback(_) = (NoTangent(), ZeroTangent())
    return (zero(x), zero_pullback)
end

# `one`

function frule((_, _), ::typeof(one), x)
    return (one(x), ZeroTangent())
end

function rrule(::typeof(one), x)
    one_pullback(_) = (NoTangent(), ZeroTangent())
    return (one(x), one_pullback)
end

This extra stuff with projection and multiplication all should just cancel away for if the tangent is ZeroTangent().

I agree the NoTangent before was mathematically wrong, and correct is ZeroTangent(), though because of how they act similarly, and under Zygote identically it worked out fine.

We should probably add a test for this since it has caused bugs now.
Should be very simple taking the zero([1,2,3])

Thank you for playing wack-a-mole with this.

MasonProtter · 2024-01-25T09:21:19Z

Why is this not:

Mostly because I was copy-pasting and then tweaking the output from @scalar_rule, I can simplify it though.

We should probably add a test for this since it has caused bugs now.
Should be very simple taking the zero([1,2,3])

can do

oxinabox · 2024-01-25T11:22:20Z

test failures on 1.x are unrelated.
1.6 is passing which is enough

Use handwritten rules for zero and one

9972f36

formatting

275b93b

MasonProtter added 3 commits January 25, 2024 10:23

simplify zero and one rules

0df040f

add some zero/one tests

6e2f430

bump version

cc93409

oxinabox approved these changes Jan 25, 2024

View reviewed changes

oxinabox merged commit e569283 into JuliaDiff:main Jan 25, 2024
6 of 11 checks passed

MasonProtter deleted the patch-1 branch January 25, 2024 11:24

MasonProtter mentioned this pull request Jan 25, 2024

refactor: add rule for zero(::Any) SciML/NeuralPDE.jl#791

Closed

christiangnrd mentioned this pull request Jan 27, 2024

CompatHelper: bump compat for Adapt to 4, (keep existing compat) #762

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use handwritten rules for `zero` and `one` #773

Use handwritten rules for `zero` and `one` #773

MasonProtter commented Jan 24, 2024

MasonProtter commented Jan 24, 2024 •

edited

Loading

oxinabox commented Jan 25, 2024 •

edited

Loading

MasonProtter commented Jan 25, 2024 •

edited

Loading

oxinabox commented Jan 25, 2024 •

edited

Loading

Use handwritten rules for zero and one #773

Use handwritten rules for zero and one #773

Conversation

MasonProtter commented Jan 24, 2024

MasonProtter commented Jan 24, 2024 • edited Loading

oxinabox commented Jan 25, 2024 • edited Loading

MasonProtter commented Jan 25, 2024 • edited Loading

oxinabox commented Jan 25, 2024 • edited Loading

Use handwritten rules for `zero` and `one` #773

Use handwritten rules for `zero` and `one` #773

MasonProtter commented Jan 24, 2024 •

edited

Loading

oxinabox commented Jan 25, 2024 •

edited

Loading

MasonProtter commented Jan 25, 2024 •

edited

Loading

oxinabox commented Jan 25, 2024 •

edited

Loading