Memoize network calls in model registry by nico-martin · Pull Request #1547 · huggingface/transformers.js

nico-martin · 2026-02-28T06:51:13Z

The ModelRegistry has two points where calls to the model host are made: fetching the model config (config.json) and fetching file metadatas. When a consumer first calls ModelRegistry.get_model_files to inspect what a model needs (display the total size on the dowload button) and then creates the pipeline, both of those network calls are made twice. Which is not necessary.

This PR introduces a small memoizePromise utility that deduplicates promises by key. Whether the first call is still in-flight or already resolved, any subsequent call with the same key gets the same promise back. This is then applied to get_file_metadata and the new get_config helper in get_model_files.js, so the config and metadata fetches each happen at most once per unique set of arguments.

…tion that does not check for tokenizer files or processor files if the task does not use them

Co-authored-by: Joshua Lochner <admin@xenova.com>

…s.js into v4-cache-handler

breaks simultaneous loading

HuggingFaceDocBuilderDev · 2026-02-28T06:54:28Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

xenova · 2026-03-01T00:24:05Z

packages/transformers/src/utils/memoize_promise.js

Nice idea! I think -- in any case, we should add a max cache size, kind of like how we do here: https://github.com/huggingface/tokenizers.js/blob/488865017c65e9b84b84ede4d7fa89f348dc0fea/src/utils/data-structures/LRUCache.ts#L4

nico-martin and others added 30 commits January 29, 2026 22:56

added progress_total progress callback status info

dfebb4a

added get_file_metadata helper

9216326

some clean up

17f9855

improved get_file_metadata and get_files

4fbe9a1

added functions to main export

ba4a4ba

removed dynamic import

6249f31

restructuring

be8d7bb

refactored the pipeline tasks so I can have a get_pipeline_files func…

9f3e224

…tion that does not check for tokenizer files or processor files if the task does not use them

updated doc

7b327a6

Update packages/transformers/src/utils/core.js

32dad76

Co-authored-by: Joshua Lochner <admin@xenova.com>

added is_pipeline_cached and improved return object

46433d6

fixes after review

4eeed39

added ModelRegistry to doc

f19ccfd

added clear_cache and clear_pipeline_cache

3430421

Update packages/transformers/src/utils/cache/clear_cache.js

be5a6b2

Co-authored-by: Joshua Lochner <admin@xenova.com>

small doc fix

563e872

Merge branch 'v4-cache-handler' of github.com:huggingface/transformer…

01d5b23

…s.js into v4-cache-handler

changed delete logic for cache

856d302

fixed examples in cache utilitiy files

4cb293d

fixed examples

83170d1

renamed type

b9d8c33

refactoring get_file_metadata

056843f

moved src/utils/pipeline-tasks.js to src/pipelines/index.js

4f38c6d

fixed doc builder

138a6d9

fixed doc builder

e45bbbb

created shared getFetchHeaders function

ef6c196

added case for DecoderOnlyWithoutHead and DecoderOnly

dfdfa41

Merge branch 'main' into v4-cache-handler

ffc3870

improved console.warn

3a19ca9

changed to modelType = MODEL_TYPES.EncoderOnly if not foundInMapping

ded0a38

xenova and others added 25 commits February 26, 2026 17:32

Remove test file

c376149

pnpm format

5b64758

Cleanup

333d8a7

use config from_pretrained logic for ensuring config is of correct type

842334c

Reorder file acquisition

34d1299

Add example JSDoc to file header

7ef10ad

Add ModelRegistry tests

0248d90

FIXME: skip cache clearing tests

131d1df

breaks simultaneous loading

Formatting

3ba1c30

Unify model-loader.js, get_model_files.js, and session.js

f8679af

console.warn to logger.warn

1c34775

Only resolve dtype once in session.js

8bbe229

Use map + Promise.all

d794990

map -> forEach

499ae00

Use env.fetch instead of global fetch

e7cb64d

Add comment to clear_cache for clarity

bbc0fec

Add model_file_name support in cache operations

47ea158

Update cache tests

14400e8

Fix TOCTOU race condition

e641672

Remove dead code

59b2e64

cleanup

a49c0bf

Cleanup pipeline import/exports

36bde74

renamed folder cache to model_registry

9e10bf7

added memoization for promises

0c18ba5

clean up

9793d3b

nico-martin requested a review from xenova February 28, 2026 07:08

xenova reviewed Mar 1, 2026

View reviewed changes

Base automatically changed from v4-cache-handler to main March 1, 2026 00:27

Merge branch 'main' into v4-cache-handler-memo

ab16231

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memoize network calls in model registry#1547

Memoize network calls in model registry#1547
nico-martin wants to merge 62 commits intomainfrom
v4-cache-handler-memo

nico-martin commented Feb 28, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Feb 28, 2026

Uh oh!

xenova Mar 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

nico-martin commented Feb 28, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Feb 28, 2026

Uh oh!

xenova Mar 1, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants