No-fuss DB connection configuration via libpq PG* environment variables by brontolosone · Pull Request #1748 · getodk/central-backend

brontolosone · 2026-02-11T15:01:24Z

Fixes Additional SSL configuration for database central#1245 🥳 We now "support" EVERYTHING that PostgreSQL supports 🥳 🌈 🦄
PR DB config, including port, through libpq env vars central#1647 for the corresponding Central docker-compose configuration changes
closed PR improve: add support for database connection URIs #1168, this comment introduces the idea behind this PR. This was a PR of mine through which, by allowing passing connection info as a URI, I wanted to enable central-backend to connect over a Unix socket. It was my introduction to the whole DB configuration mess. Closed, as solving the compatibility problems seemed like a sisyphean task, but now, 1½ years later, I'm back!
currently unmerged PR support postgresql other port number central#915 . A contributor wants to be able to support an other-than-5432 database port for an externally provisioned DB in the docker setup.
issue Postgres connection string #377 & PR improve #377: support more ways of connecting to the database. #394: examples of the fiddly work involved in maintaining a config interface ourselves and tracking down all the knex/slonik presumptions.

Motivation, background

Our code / config / knex (+deptree) / slonik (+deptree), they're all miming to eachother about DB connection configuration. It's easy to forget that *eventually*, that information just needs to be conveyed to `libpq` — the *actual* bedrock database driver. Not one of these middlemen inherently cares about the specifics, it's like a bucket brigade of passing & mangling information; but in the end we just want a DB connection.

That whole wrapper mille-feuille is rife with misconceptions (eg, how SSL works / should be conveyed), incomplete exposure of connection features (eg, TCP connection timeouts), and incompatibilities (again, SSL, and also Unix domain sockets).

But it just so happens that libpq (the thing that actually does the thing)
is designed so that one can talk directly to it to configure its connections. This works with a set of environment variables that libpq reads from the process environment. It will use that information unless in the connection setup call to libpq something else is explicitly configured. In the documentation it says about these env vars:

These are useful to avoid hard-coding database connection information into simple client applications, for example.

And indeed one can imagine that it is nice to be able to write a client application without having to re-expose the complete postgres connection parameter interface.

So if we adopt that configuration mechanism (which means: we get out of the way!), we can stop worrying about what to support, what configuration interface to present to users, and how to convey things to knex/slonik in their idiosyncratic way (if even supported). 🥳

Advantages would be:

The complete configuration interface can be understood from the documentation straight from the people who make the thing that does the actual thing; no more digging through issues in half-abandoned NPM projects to understand why something doesn't work or how to coax one lib (but then coax the other lib slightly differently). 🥳 Less ambiguity, simpler support.
It's easy for people to test connection setups, without having to fiddle with ODK. One can just set the appropriate environment variables and start psql. If you get a connection, you did it right, if you don't, you did it wrong (and probably there'll be some output from the tool hinting at what's wrong). psql, libpq, postgresql are all from the same vendor; they're not miming to eachother, there's just so much less ambiguity around why something isn't working. It also makes it easy for a sysadmin to communicate connection details and support their users, because of that uniform configuration interface.

Approach

Get out of the way. Don't supply any connection information directly.
Preserve the legacy interface for setting DB connection info. Our legacy DB config interface is through the use of the config module, and its cascading inheritance. Initially I toyed a bit with sourcing different files with different environment variables (eg, through the Makefile) depending on whether we're running tests or not, but I didn't quite get good DX this way, plus I think that getting used to a new config interface would be an impediment to getting this PR in. Thus, I wrapped (oh no! wrapping! interpretation) that interface, one can still set the same limited set of db config variables that way, but they'll go into env vars rather than into connection setup call parameters. So everything still works! Yet, one can also override things throug environment variables, eg if one runs PGDATABASE=someotherdatabase make dev, then that's what you'll get, it won't be overridden with what's in config. Example:
```
PGDATABASE=staging_odkcentral make dev
node lib/bin/enforce-node-version.js
node lib/bin/run-migrations.js
Warning: Environment variable "PGDATABASE" already set, with value: "staging_odkcentral".
Not applying DB configuration of "database: odkcentral"
```
Here we're warned that the value from the config (odkcentral in this case) is not being applied, because we've explicitly set PGDATABASE. So you'll be made aware of what's going on. No surprises. You're going to connect to the staging_odkcentral database.
As an example of the flexibility, my own default.database config in config/local.json is, with this PR:
```
"database": {
  "host": "/run/postgresql",
  "database": "odkcentral",
  "user": "",
  "password": "",
}
```
which gives me a peer-authenticated Unix socket connection, with my DB user the same as my Unix user. I need to blank the user and password field, otherwise I'd get the values from the higher-up config precedence waterfall (default.json etc).
The "ssl": true config fragment is also supported, but I thought it appropriate to print a warning, as I don't feel doing SSL without authenticating the other party is on par with the warm feeling of safety that "ssl": true instills. So you'll see this helpful warning:
```
Warning: Database SSL mode set to "require", which means that while the connection will be *encrypted*, it will not be *authenticated*.
However, you can set up strong authentication through the use of libpq environment variables:
https://www.postgresql.org/docs/current/libpq-envars.html#LIBPQ-ENVARS                                                                                                        
to configure server certificate verification:                                     
https://www.postgresql.org/docs/current/libpq-ssl.html#LIBQ-SSL-CERTIFICATES.
```
For this config-sourcing mechanism, for which we have to do some interpretation, the guiding principle is: in case of ambiguity, crash or warn.
Now for the Docker setup (central repo), a PR is here. Via that Central branch, this code is also running on dev.getodk.cloud at the moment.

What has been done to verify that this works as intended?

Tests still pass, but see note at the top re #1746 .

Why is this the best possible solution? Were any other approaches considered?

Don't get me started 😆 . See background/rationale at the top of this PR.

How does this change affect users? Describe intentional changes to behavior and behavior that could have accidentally been affected by code changes. In other words, what are the regression risks?

For all quotidian setups, there should be no regression, legacy config will continue to work. But, if people have been patching code to do workarounds to get certain connection parameters in that we didn't support, there may be surprises, hard to predict 🤷‍♂️

Does this change require updates to the API documentation? If so, please update docs/api.yaml as part of this PR.

Not to the API documentation.
But the central install howto, and the .env.example file, should be streamlined with this code. It could also be useful to zombiepost into the various forum posts where people have been asking about DB configuration, so that newcomers searching for info will find the uptodate New Way of Configuring the Database Any Way You Like.

Before submitting this PR, please make sure you have:

run make test and confirmed all checks still pass OR confirm CircleCI build passes
verified that any code from external sources are properly credited in comments or that everything is internally sourced

lib/util/load-db-env.js

lib/task/db.js

lib/util/load-db-env.js

matthew-white · 2026-02-22T05:36:51Z

lib/util/load-db-env.js

Curious whether you considered adding the function here to the existing lib/util/db.js? That file has been getting kind of big, so I'm open to splitting it up like this, but I'd be curious to learn more about your thinking in this area.

I like files as an organizational unit, and they're pretty much free, well at least for me they are, df -i tells me I'm not nearly out of inodes yet ;-)

Yes this functionality is related to the DB, in a sense, but lib/util/db.js by and large has to do with SQL templating and pseudo-ORM functions, while this new configuration functionality isn't.

This new load-db-env module often needs to be loaded early on, and for that my instinct is that it should be small, simple, and have minimal dependencies (also to avoid cyclic dependencies)

everything in ODK Central backend is related to ODK, maybe we should put everything in one file odk.js, it'd be perfectly organized 🤡

one thing you may have noticed as well is that in this codebase there are quite a few large (to my taste) modules that are informally sectioned using quite decorative header comments, eg in lib/util/db.js we have:
//////////////////////////////////////////////////////////////////////////////// // SLONIK UTIL
which to me is a faint code smell, I feel one should use the language's formal modularization features for modularization, not comments, which impose a cognitive load on human readers. Since I don't like that smell I err on the side of making a module rather than following the (anti-?) pattern of cordoning off a "module section" with elaborate decorative comments as section headers (it even has its own typographical mini-style) within a module.

lib/util/load-db-env.js

test/unit/util/db.js

…es, and support legacy config by setting those variables when defined in config yet unset in environment

…onment variables now

…ent with the observed earlier behaviour, and that's that.

brontolosone changed the title ~~Db unconfig~~ No-fuss DB connection configuration via libpq PG* environment variablesconfig Feb 11, 2026

brontolosone changed the title ~~No-fuss DB connection configuration via libpq PG* environment variablesconfig~~ No-fuss DB connection configuration via libpq PG* environment variables Feb 11, 2026

brontolosone force-pushed the db-unconfig branch 2 times, most recently from 9a7bb60 to 088afaf Compare February 11, 2026 18:03

brontolosone requested a review from matthew-white February 11, 2026 18:47

brontolosone added this to ODK Central Feb 11, 2026

brontolosone moved this to ✏️ in progress in ODK Central Feb 11, 2026

brontolosone marked this pull request as ready for review February 11, 2026 18:48

brontolosone force-pushed the db-unconfig branch 2 times, most recently from 007711a to 3d75b9b Compare February 11, 2026 19:22

matthew-white assigned brontolosone Feb 11, 2026

brontolosone force-pushed the db-unconfig branch 4 times, most recently from 6885630 to c0be247 Compare February 12, 2026 09:11

brontolosone marked this pull request as draft February 13, 2026 07:16

brontolosone force-pushed the db-unconfig branch from c0be247 to b9606dc Compare February 14, 2026 08:15

brontolosone mentioned this pull request Feb 14, 2026

DB config, including port, through libpq env vars getodk/central#1647

Open

2 tasks

brontolosone marked this pull request as ready for review February 14, 2026 12:36

matthew-white reviewed Feb 18, 2026

View reviewed changes

lib/util/load-db-env.js Outdated Show resolved Hide resolved

lib/util/load-db-env.js Outdated Show resolved Hide resolved

lib/util/load-db-env.js Outdated Show resolved Hide resolved

brontolosone force-pushed the db-unconfig branch 2 times, most recently from 0c20f66 to 26137b5 Compare February 21, 2026 07:25

matthew-white mentioned this pull request Feb 22, 2026

Configure database via libpq environment variables getodk/central#1661

Open

matthew-white removed this from ODK Central Feb 22, 2026

matthew-white mentioned this pull request Feb 22, 2026

support postgresql other port number getodk/central#915

Closed

matthew-white reviewed Feb 22, 2026

View reviewed changes

brontolosone force-pushed the db-unconfig branch from 26137b5 to 8f4aaa9 Compare February 23, 2026 15:33

brontolosone added 3 commits February 24, 2026 18:28

un-meddle in DB config affairs: let libpq use PG* environment variabl…

71c5694

…es, and support legacy config by setting those variables when defined in config yet unset in environment

remove DB connection string formulation tests: we use PG* libpq envir…

1e0abd9

…onment variables now

port backup/restore to use libpq PG* env variables

0fc9c8b

brontolosone added 2 commits February 24, 2026 18:28

remove SSL configuration comment noise: we're using a setting consist…

57a109c

…ent with the observed earlier behaviour, and that's that.

address PR comments

a49b262

brontolosone force-pushed the db-unconfig branch from 8f4aaa9 to 9a4528d Compare February 24, 2026 18:29

ssl handling fixups, & unit tests

7626a2e

brontolosone force-pushed the db-unconfig branch from 9a4528d to 7626a2e Compare February 24, 2026 18:58

sans SSL support

aaa6fb2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

No-fuss DB connection configuration via libpq PG* environment variables#1748

No-fuss DB connection configuration via libpq PG* environment variables#1748
brontolosone wants to merge 7 commits intogetodk:masterfrom
brontolosone:db-unconfig

brontolosone commented Feb 11, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

matthew-white Feb 22, 2026

Uh oh!

brontolosone Feb 23, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

brontolosone commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Related

Motivation, background

Approach

What has been done to verify that this works as intended?

Why is this the best possible solution? Were any other approaches considered?

How does this change affect users? Describe intentional changes to behavior and behavior that could have accidentally been affected by code changes. In other words, what are the regression risks?

Does this change require updates to the API documentation? If so, please update docs/api.yaml as part of this PR.

Before submitting this PR, please make sure you have:

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

matthew-white Feb 22, 2026

Choose a reason for hiding this comment

Uh oh!

brontolosone Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

brontolosone commented Feb 11, 2026 •

edited

Loading