Stellarpower.fix.new optimiser api for gradient monitoring #7611

stellarpower · 2024-05-08T21:36:59Z

Description

Beginning of some changes needed for compatibility with Keras v3. Feel free to use as a springboard and cherrypick some lines:

Main addition is changing Gradient Callback to use non-Legacy Optimiser API
Also bundled in is a fix for !7578

The legacy optimiser API is gone in Keras v3. This version uses the current API, available in v2 and v3.

The _resource_apply_dense() etc. functions are no longer virtual in the parent class and so we need to use another method. This just overrides the base apply_gradients() function to perform the work.

Performed some basic tests using this gist, but pasted in browser, and may be far from perfect in general. Providing as something to work from at least.

Overriding config and then calling the parent did nothing also so deleted it.

To facilitate debugging, change:

    grad_acc_model.compile(
        loss=self.Model.loss,
        optimizer=_CustomOptimizer(),
        jit_compile = False,
        run_eagerly = True,
    )

to run eagerly.

self.model is a property; so needs another name The gradient accumulator used the legacy optimisers, which are now removed. Not tested the gradients yet, but this runs.

The legacy optimiser API is gone in Keras v3. This version uses the current API, available in v2 and v3. The _resource_apply_dense() etc. functions are no longer virtual in the parent class and so we need to use another method. This just overrides the base apply_gradients() function to perform the work. Performed some basic tests using [this gist]( https://gist.github.com/stellarpower/24fd6b1cbd864a088ec2a5f3e8a9fb26/edit ), but pasted in browser, and may be far from perfect in general. Providing as something to work from at least. Overriding config and then calling the parent did nothing also so deleted it. To facilitate debugging, change: grad_acc_model.compile( loss=self.Model.loss, optimizer=_CustomOptimizer(), jit_compile = False, run_eagerly = True, ) to run eagerly.

codecov · 2024-05-08T21:39:07Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 55.66%. Comparing base (5612829) to head (2bf4a56).
Report is 11 commits behind head on main.

Additional details and impacted files

@@             Coverage Diff             @@
##             main    #7611       +/-   ##
===========================================
- Coverage   75.76%   55.66%   -20.10%     
===========================================
  Files         502      500        -2     
  Lines       54003    53056      -947     
===========================================
- Hits        40914    29533    -11381     
- Misses      12676    23163    +10487     
+ Partials      413      360       -53

Flag	Coverage Δ
func	`?`
system	`?`
unit	`55.66% <ø> (-0.67%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

see 241 files with indirect coverage changes

stellarpower added 2 commits May 8, 2024 14:50

Support needed for Keras 3 in integration file.

295c0d9

self.model is a property; so needs another name The gradient accumulator used the legacy optimisers, which are now removed. Not tested the gradients yet, but this runs.

kptkin assigned morganmcg1 and ayulockin May 8, 2024

dmitryduev requested a review from ayulockin May 13, 2024 18:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stellarpower.fix.new optimiser api for gradient monitoring #7611

Stellarpower.fix.new optimiser api for gradient monitoring #7611

stellarpower commented May 8, 2024

codecov bot commented May 8, 2024 •

edited

Stellarpower.fix.new optimiser api for gradient monitoring #7611

Are you sure you want to change the base?

Stellarpower.fix.new optimiser api for gradient monitoring #7611

Conversation

stellarpower commented May 8, 2024

Description

codecov bot commented May 8, 2024 • edited

Codecov Report

codecov bot commented May 8, 2024 •

edited