Core: Multiple input/param gradient modification #168

chr5tphr · 2022-10-13T16:58:44Z

change the core Hook to support the modification of multiple inputs and params
for this, now each input and parameter that requires a gradient will be hooked, and a backward, which is aware of which the current 'sink' is, will be called for each
use View instead of custom Identity to produce a .grad_fn

Note:

this may be a breaking change for custom hooks based on the old implementation

TODO:

finish implementation:
- parameters have no grad_fn, and we cannot simply overwrite them with a view; hooking directly with tensor hooks is problematic when the parameters are used in different functions
- there may be potentially a better approach than calling the backward function once per 'sink', although the current implementation may allow for better modularity - multiple outputs are still not supported, it may be worth to think how to do it, however, it may also be better to do this at a later stage
implement tests
- new tests for the new functionality: multiple inputs and params in hooks
- fix old tests that assume the use of Identity and are not sink-aware
add documentation

- change the core Hook to support the modification of multiple inputs and params - for this, now each input and parameter that requires a gradient will be hooked, and a backward, which is aware of which the current 'sink' is, will be called for each - use View instead of custom Identity to produce a .grad_fn Note: - this may be a breaking change for custom hooks based on the old implementation TODO: - finish implementation: - parameters have no grad_fn, and we cannot simply overwrite them with a view; hooking directly with tensor hooks is problematic when the parameters are used in different functions - there may be potentially a better approach than calling the backward function once per 'sink', although the current implementation may allow for better modularity - multiple outputs are still not supported, it may be worth to think how to do it, however, it may also be better to do this at a later stage - implement tests - new tests for the new functionality: multiple inputs and params in hooks - fix old tests that assume the use of Identity and are not sink-aware - add documentation

- use additions to forward hooks in torch 2.0.0 to pass kwargs to pass keyword arguments - handle multiple inputs and outputs in core.Hook and core.BasicHook, by passing all required grad_outputs and inputs to the backward implementation TODO: - finish draft and test implementation - add tests - add documentation - This stands in conflict with #168, but promises a better implementation by handling inputs and outpus as common to a single function, rather than individually as proposed in #168

- use additions to forward hooks in torch 2.0.0 to pass kwargs to pass keyword arguments - handle multiple inputs and outputs in core.Hook and core.BasicHook, by passing all required grad_outputs and inputs to the backward implementation TODO: - attribution scores are currently wrong in BasicHook, likely an issue with the gradient inside BasicHook? Might be some cross-terms interacting that should not interact - finish draft and test implementation - add tests - add documentation - This stands in conflict with #168, but promises a better implementation by handling inputs and outpus as common to a single function, rather than individually as proposed in #168

chr5tphr mentioned this pull request Oct 13, 2022

Layer-wise LRP score #167

Open

chr5tphr force-pushed the hook-multi-input-param branch from 5672160 to 8f11583 Compare November 4, 2022 17:01

chr5tphr mentioned this pull request Feb 15, 2023

Module with Multiple Inputs #176

Open

chr5tphr mentioned this pull request May 2, 2023

Obtanining graident of LRP otput w.r.t. network parameters #183

Closed

This was referenced Aug 10, 2023

Core: Multiple Inputs, Outputs, and Keyword Arguments #196

Draft

Proper way of handling classifier heads in Transformers #192

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Core: Multiple input/param gradient modification #168

Core: Multiple input/param gradient modification #168

chr5tphr commented Oct 13, 2022 •

edited

Loading

Core: Multiple input/param gradient modification #168

Are you sure you want to change the base?

Core: Multiple input/param gradient modification #168

Conversation

chr5tphr commented Oct 13, 2022 • edited Loading

chr5tphr commented Oct 13, 2022 •

edited

Loading