Skip to content

Performance testing: GP activation filtering #2619

@suzannehamilton

Description

@suzannehamilton

Investigate the impact on ehrQL job duration if we implement the more complex version of GP activation filtering.

This probably means implementing version 1b as described here, but we should double-check with Will and Catherine in case there is another proposal which sounds more likely. Alternatively, we could implement the most complex version that has been discussed, to try to get an upper bound on performance issues.

We will have to implement the filtering for all GP tables. See #2602 for the spike we've done on a single table.

For performance testing, we can run jobs on the secure environment, so that we're testing against the real backend. We have a way of running jobs in a testing mode which does not overwrite existing outputs (@evansd can you explain or post a link about this?) We should run each job before and after the changes, rather than comparing the job duration to the last time the study code was run.

Decisions:

  • Exactly which version should we implement in this spike?
  • Which study repos should we use for performance testing?

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions