[Bug] Revise the _remove_state_dict_prefix and _add_state_dict_prefix functions in timm.py to adapt to the case of multiple submodels. by wilxy · Pull Request #1295 · open-mmlab/mmpretrain

wilxy · 2023-01-04T08:57:13Z

When using TimmClassifier as student or teacher model in Knowledge Distillation Algorithms, there have some bugs in save_checkpoint and load_checkpoint.

save_checkpoint
When saving checkpoint like save_checkpoint(self.state_dict(), 'xxx.pth'), where self is a Knowledge Distillation Algorithm which contains submodels self.student and self.teacher, self.state_dict() will recursively call the state_dict function here.
The _remove_state_dict_prefix function in the TimmClassifier class will be used as a hook to modify the original destination.
Specifically, the _remove_state_dict_prefix function creates a new_state_dict whose memory is different from the original destination as the hook_result to modify the original destination for submodels student and teacher. But the state_dict funtion of the Knowledge Distillation Algorithm Model will not receive this modify, so the memory address and value of destination have not changed.
To solve this problem, we change the _remove_state_dict_prefix function to modify the state_dict directly instead of creating a new_state_dict.
load_checkpoint
When loading checkpoint of a Knowledge Distillation Algorithm Model whose student and teacher are all TimmClassifier. The _add_state_dict_prefix function in the TimmClassifier class will be used as a hook to modify the state_dict of each submodel.
When modifying the student submodel, _add_state_dict_prefix function will delete all keys of teacher submodel.
To solve this problem, we change the _add_state_dict_prefix function to only delete the key that different from its new_key.

…ions in timm.py to adapt to the case of multiple submodels.

CLAassistant · 2023-01-04T08:57:20Z

All committers have signed the CLA.

Ezra-Yu · 2023-01-04T09:00:22Z

please sign the CLA so that I can review your PR.

mzr1996 · 2023-01-09T03:27:26Z

Hello, can you sign the CLA and fix the lint problem? Then we can merge the PR. @wilxy

wilxy · 2023-01-10T06:55:09Z

Hello, can you sign the CLA and fix the lint problem? Then we can merge the PR. @wilxy

Thanks for the reminder, I've signed the CLA and fixed the lint problem.

Ezra-Yu · 2023-05-06T07:01:51Z

Hi @wilxy , Can you migrate this PR to the main branch?

revise the _remove_state_dict_prefix and _add_state_dict_prefix funct…

ce57f57

…ions in timm.py to adapt to the case of multiple submodels.

mm-assistant Bot assigned Ezra-Yu Jan 4, 2023

wilxy mentioned this pull request Jan 4, 2023

[Bug] Imcompatible with TimmClassifier open-mmlab/mmrazor#418

Closed

fix the lint problem

5fec666

change user.email for cla

ecc17d0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] Revise the _remove_state_dict_prefix and _add_state_dict_prefix functions in timm.py to adapt to the case of multiple submodels.#1295

[Bug] Revise the _remove_state_dict_prefix and _add_state_dict_prefix functions in timm.py to adapt to the case of multiple submodels.#1295
wilxy wants to merge 3 commits intoopen-mmlab:dev-1.xfrom
wilxy:dev-1.x

wilxy commented Jan 4, 2023

Uh oh!

CLAassistant commented Jan 4, 2023 •

edited

Loading

Uh oh!

Ezra-Yu commented Jan 4, 2023

Uh oh!

mzr1996 commented Jan 9, 2023

Uh oh!

wilxy commented Jan 10, 2023

Uh oh!

Ezra-Yu commented May 6, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

wilxy commented Jan 4, 2023

Uh oh!

CLAassistant commented Jan 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Ezra-Yu commented Jan 4, 2023

Uh oh!

mzr1996 commented Jan 9, 2023

Uh oh!

wilxy commented Jan 10, 2023

Uh oh!

Ezra-Yu commented May 6, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

CLAassistant commented Jan 4, 2023 •

edited

Loading