Refactor `loaders` module by evansd · Pull Request #2719 · opensafely-core/ehrql

evansd · 2026-03-17T17:54:56Z

This refactors some of the code for loading user-supplied modules to use a single ModuleDetails type to represent all the values we might care about in one of these modules, including potential errors. The aim is to end up with more consistent handling of these such that we can have more features that can work with either datasets or measures (see #2313).

We recently added measures support to create-dummy-tables but this was made unnecessarily difficult by the way loaders were implemented. This refactoring aims to change that.

We also treat running user-supplied modules in "debug" mode as a separate task to serializing their contents which removes some incidental complexity.

cloudflare-workers-and-pages · 2026-03-17T17:57:52Z

Deploying databuilder-docs with Cloudflare Pages

Latest commit:	`0fd99bc`
Status:	✅ Deploy successful!
Preview URL:	https://7476cbf1.databuilder.pages.dev
Branch Preview URL:	https://evansd-refactor-loaders.databuilder.pages.dev

View logs

This pulls a generic `run_ehrql_command_in_subprocess()` function out of the more specific `load_definition_in_subprocess()` function.

We're not trying to retrieve a value when running `debug`: we just want to execute the Python and collect any output. Making this a separate function removes special-casing elsewhere.

We can also simplify the argument handling here by using `choices`.

This is designed to represent all the various values we may be interested in from a user-supplied definition module. We need to support serializing errors because the pre-serialization code will no longer know which kinds of error the calling code cares about.

Instead of having specialised loaders for each definition type we now have a single function which loads the module, calls various `populate_*_details` functions to grab potentailly relevant attributes from the module, and returns a `ModuleDetails` object which collects all these attributes. This `ModuleDetails` object can then be serialized, passed back to the parent process and it can then decide what details it cares about. Note that we now return exceptions rather than raising them immediately. For example, if we are loading measure definitions we don't care if `dataset` doesn't have a population defined yet. But we can't just go ahead and serialize it because it's not possible to construct a serialized dataset without a population definition. So we return the error and let the caller decide whether to raise it or not. To make the change easier to read we continue passing around now useless `definition_type` arguments. We'll remove these in a later commit.

The serializer, and functions upstream of it, no longer care what type of definition module they're working with.

The old name made it sound like is was loading a particular "debug" type of definition, whereas it's not really loading anything (as it doesn't return a value) it's running an arbitrary definition in a particular mode.

evansd changed the title ~~Evansd/refactor loaders~~ Refactor loaders module Mar 17, 2026

evansd force-pushed the evansd/refactor-loaders branch from 3ab2394 to 41cde8b Compare March 17, 2026 17:56

github-actions bot deployed to databuilder-docs (Preview) March 17, 2026 17:57 View deployment

evansd force-pushed the evansd/refactor-loaders branch from 41cde8b to 15fbffe Compare March 17, 2026 17:59

github-actions bot deployed to databuilder-docs (Preview) March 17, 2026 17:59 View deployment

evansd force-pushed the evansd/refactor-loaders branch from 15fbffe to 1f242d0 Compare March 18, 2026 14:39

github-actions bot deployed to databuilder-docs (Preview) March 18, 2026 14:40 View deployment

evansd force-pushed the evansd/refactor-loaders branch from 1f242d0 to 7609b06 Compare March 19, 2026 17:33

github-actions bot deployed to databuilder-docs (Preview) March 19, 2026 17:33 View deployment

evansd force-pushed the evansd/refactor-loaders branch from 7609b06 to 3da0591 Compare March 19, 2026 17:47

github-actions bot deployed to databuilder-docs (Preview) March 19, 2026 17:47 View deployment

Factor out code for running ehrQL commands in subprocess

0e55581

This pulls a generic `run_ehrql_command_in_subprocess()` function out of the more specific `load_definition_in_subprocess()` function.

evansd force-pushed the evansd/refactor-loaders branch from 3da0591 to 67cee40 Compare March 19, 2026 17:51

github-actions bot deployed to databuilder-docs (Preview) March 19, 2026 17:51 View deployment

evansd force-pushed the evansd/refactor-loaders branch from 67cee40 to cef1a6c Compare March 24, 2026 10:38

github-actions bot deployed to databuilder-docs (Preview) March 24, 2026 10:39 View deployment

evansd added 13 commits March 24, 2026 11:07

Refactor debug command to not use serializer

9b2625a

We're not trying to retrieve a value when running `debug`: we just want to execute the Python and collect any output. Making this a separate function removes special-casing elsewhere.

Handle "store_true" argparse actions in docs generator

3321833

Remove unused code

5449383

Inline argument definition now it's used only once

302dfd8

We can also simplify the argument handling here by using `choices`.

Remove definition_type argument

6563609

The serializer, and functions upstream of it, no longer care what type of definition module they're working with.

Separate out validation code from loading code

83e1b70

Refactor the "dataset or measure" loader

42b4f9d

Update loader tests to work with refactored approach

ec547bd

Move functions around to co-locate related code

8783d6e

Rename debug loader function

b7e0fe7

The old name made it sound like is was loading a particular "debug" type of definition, whereas it's not really loading anything (as it doesn't return a value) it's running an arbitrary definition in a particular mode.

Fix monkeypatching in docs tests

10e0bcc

fix: Run just generate-docs

0fd99bc

evansd force-pushed the evansd/refactor-loaders branch from cef1a6c to 0fd99bc Compare March 24, 2026 11:08

github-actions bot deployed to databuilder-docs (Preview) March 24, 2026 11:08 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor `loaders` module#2719

Refactor `loaders` module#2719
evansd wants to merge 15 commits intomainfrom
evansd/refactor-loaders

evansd commented Mar 17, 2026 •

edited

Loading

Uh oh!

cloudflare-workers-and-pages bot commented Mar 17, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

evansd commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cloudflare-workers-and-pages bot commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Deploying databuilder-docs with Cloudflare Pages

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

evansd commented Mar 17, 2026 •

edited

Loading

cloudflare-workers-and-pages bot commented Mar 17, 2026 •

edited

Loading