Added UMA infrastructure by alongd · Pull Request #895 · ReactionMechanismGenerator/ARC

alongd · 2026-06-07T17:34:25Z

Currently sits on top of #836

kfir4444

Thanks @alongd!
I added some comments, and please note this has to be rebased to main, since we had some relatively big changes in the ase branch.

kfir4444 · 2026-06-18T20:06:32Z

+# 4) HuggingFace authentication for the gated uma-s-1p1 model.
+if [ "$SKIP_HF_LOGIN" -eq 0 ]; then
+    if $COMMAND_PKG run -n "$ENV_NAME" huggingface-cli whoami &>/dev/null; then
+        echo "✔️  Already authenticated to HuggingFace."


The workflow is not clear to me. Are we not able to utilize the HF_TOKEN if the user has it, or parse it into the script?

kfir4444 · 2026-06-18T20:07:07Z

+PYCODE
+
+# 4) HuggingFace authentication for the gated uma-s-1p1 model.
+if [ "$SKIP_HF_LOGIN" -eq 0 ]; then


SKIP_HF_LOGIN is potentially confusing, are we skipping it for testing or since the user is logged in?

kfir4444 · 2026-06-18T20:09:20Z

+  - setuptools
+  - pip
+  - pip:
+      # fairchem-core pulls in a CUDA-enabled PyTorch by default on Linux.


We should probably let them know that. Also, can we allow users to choose which device they are using? like uma-cpu and uma-gpu?

kfir4444 · 2026-06-18T20:11:30Z

        return MOPAC(**kwargs)
-
+
+    elif name in ('uma', 'fairchem'):


Does fairchem has a model that is not uma? Do we want to leave the option to call fairchem in the input?

kfir4444 · 2026-06-18T20:12:49Z

+            # A TS search needs a saddle-point optimizer; UMA ships none, so use Sella.
            from sella import Sella
-            opt_class = Sella
+            opt = Sella(atoms, order=1 if is_ts else 0, logfile=logfile)


We use sella when is_ts == True, why does order != 1 always?

kfir4444 · 2026-06-18T20:14:44Z

        save_current_geometry(output, atoms, xyz)

+    if job_type == 'irc':
+        from sella import IRC  # VERIFY the Sella IRC API in the installed sella


Can you drop the comment? did we verify the IRC API?

kfir4444 · 2026-06-18T20:15:47Z

-            opt_class = engine_dict.get(engine_name, BFGS)
-        opt = opt_class(atoms, logfile=os.path.join(os.path.dirname(input_path), 'opt.log'))
-
+            opt = engine_dict.get(engine_name, BFGS)(atoms, logfile=logfile)


I think the original implementation was more readable.

kfir4444 · 2026-06-18T20:18:50Z

+        if is_atom or is_triplet_o2:
+            label = self.species[0].label if self.species else 'species'
+            logger.warning(f'Computing a UMA single point for {label} (an isolated atom or triplet O2). '
+                           f'UMA absolute energies are unreliable for these under-represented species; '


Can you add a source for that please? Is that like a known issue?

kfir4444 · 2026-06-18T20:20:47Z

+        """
+        scan_coords = self.data.get('scan_coords')
+        if scan_coords:
+            return [xyz if isinstance(xyz, dict) else str_to_xyz(xyz) for xyz in scan_coords]


Amazing! I missed that we didn't knew how to interact with scans from ase. Can you please add tests to these functions too?

codecov · 2026-06-21T16:54:40Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 63.14%. Comparing base (07da21d) to head (a555ec5).
⚠️ Report is 1 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #895      +/-   ##
==========================================
+ Coverage   63.02%   63.14%   +0.11%     
==========================================
  Files         114      114              
  Lines       38178    38218      +40     
  Branches     9990     9999       +9     
==========================================
+ Hits        24063    24132      +69     
+ Misses      11227    11194      -33     
- Partials     2888     2892       +4

Flag	Coverage Δ
functionaltests	`63.14% <ø> (+0.11%)`	⬆️
unittests	`63.14% <ø> (+0.11%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Harness.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Add devtools/uma_environment.yml and devtools/install_uma.sh: a documented, user-driven setup script that wraps every step (create uma_env, verify fairchem/Sella/ASE imports, interactive HuggingFace login for the gated model, and the runtime exports for invoking UMA from arc_env). Supports --test to run the model-dependent unit tests and --skip-hf-login. Wire 'make install-uma' and .PHONY, and document UMA setup in installation.rst. Deliberately NOT added to devtools/install_all.sh / install-ci: the UMA model is gated (Meta license + HuggingFace token) and heavy, so it is a manual setup; CI coverage comes from the env-independent UMA tests.

Build UMA (Meta FAIR fairchem-core, uma-s-1p1, task omol) on top of the generic ASE adapter (PR #836) instead of a standalone adapter: - ase_script.py: add a uma/fairchem branch to get_calculator; set total charge and spin (=multiplicity) on atoms.info (omol conditioning); use Sella order=1 for TS saddle-point searches when is_ts; add an irc job type via Sella IRC. - ase.py: derive the calculator from the level method (so method='uma' works with no args), resolve UMA defaults (latest model, omol, cpu) via determine_settings, pass is_ts and irc_direction to the script, and warn on a UMA single point for an isolated atom or triplet O2 (unreliable absolute energy). - settings.py: UMA_PYTHON=find_executable('uma_env'), ASE_CALCULATORS_ENV['uma'], and UMA_LATEST_MODEL. - level.py: route method 'uma'/'uma-s-1'/'uma-s-1p1' to the 'ase' software. - yaml.py: implement parse_irc_traj and parse_1d_scan_coords so UMA IRC/scan outputs round-trip. Rotor scans run through ARC's directed_scan (constrained opt), already supported by the ASE adapter. fairchem/Sella-IRC API points only confirmable inside uma_env are marked with # VERIFY. Adds env-independent unit tests (routing, calculator/settings resolution, input writing, sp warning, output round-trip) plus skip-guarded model tests.

Made `warn_if_unreliable_uma_sp` a bolean function, check the return value. Doesn't interfere with current implementation

alongd

Looks good, I added a minor comment. Did you check it with UMA? How's the performance?

alongd · 2026-06-27T09:25:24Z


+    def test_yaml_parser(self):
+        """Test the YAMLParser adapter for all its parse methods."""
+        import tempfile


please put imports at module level, unless we cannot for some reason

Copilot

Pull request overview

Adds optional UMA (Meta FAIR fairchem-core) infrastructure on top of the existing ASE job adapter integration, including environment/setup tooling and ARC-side routing/parsing/tests to run UMA via ASE.

Changes:

Add make install-uma plus devtools/install_uma.sh and devtools/uma_environment.yml for user-driven UMA setup (gated model; not CI).
Route UMA levels (method='uma', uma-s-1, uma-s-1p1) through the ASE adapter and add UMA-specific defaults/warnings.
Extend YAML parsing + tests for scan/IRC keys and add UMA-via-ASE adapter tests (with gated model tests behind an opt-in flag).

Reviewed changes

Copilot reviewed 11 out of 11 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
Makefile	Adds `install-uma` target for manual UMA setup.
docs/source/installation.rst	Documents optional UMA installation and usage.
devtools/uma_environment.yml	Defines the conda env skeleton for `uma_env`.
devtools/install_uma.sh	Automates UMA env creation, dependency install, HF login, and optional tests.
arc/settings/settings.py	Registers UMA env discovery and defines `UMA_LATEST_MODEL`.
arc/parser/parser_test.py	Adds YAML parser tests covering new scan/IRC-related keys.
arc/parser/adapters/yaml.py	Implements YAML parsing for 1D scan coords and IRC trajectories.
arc/level.py	Adds UMA-to-ASE routing in software deduction.
arc/job/adapters/uma_test.py	Adds UMA-via-ASE wiring tests + opt-in model tests.
arc/job/adapters/scripts/ase_script.py	Adds fairchem/UMA calculator support and IRC execution via Sella.
arc/job/adapters/ase_adapter.py	Adds UMA calculator detection, default settings, and a reliability warning for UMA SPs.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+        # UMA (run via the ASE adapter; 'uma' resolves to the latest model)
+        if self.method in ('uma', 'uma-s-1', 'uma-s-1p1'):
+            self.software = 'ase'
+


    atoms = Atoms(symbols=xyz['symbols'], positions=xyz['coords'])
+    atoms.info.update({'charge': charge, 'spin': multiplicity})  # UMA (omol) conditions on these
    calc = get_calculator(settings, charge, multiplicity)


+ARC_ENV_PY="$($COMMAND_PKG run -n arc_env python -c 'import sys; print(sys.executable)')"
+ARC_ENV_PREFIX="$(dirname "$(dirname "$ARC_ENV_PY")")"
+BABEL_VERSION_DIR="$(ls -d "$ARC_ENV_PREFIX"/lib/openbabel/*/ 2>/dev/null | head -1)"
+
+export_block() {
+    echo "export BABEL_LIBDIR=${BABEL_VERSION_DIR%/}"
+    echo "export BABEL_DATADIR=${ARC_ENV_PREFIX}/share/openbabel/$(basename "${BABEL_VERSION_DIR%/}")"
+    echo "export PYTHONPATH=${ARC_DIR}:\$PYTHONPATH"
+}


+if [ "$RUN_TESTS" -eq 1 ]; then
+    echo ">>> Running the UMA model-dependent unit tests (first run downloads the model; this is slow)..."
+    export BABEL_LIBDIR="${BABEL_VERSION_DIR%/}"
+    export BABEL_DATADIR="${ARC_ENV_PREFIX}/share/openbabel/$(basename "${BABEL_VERSION_DIR%/}")"
+    export PYTHONPATH="${ARC_DIR}:${PYTHONPATH}"
+    UMA_RUN_MODEL=1 "$ARC_ENV_PY" -m pytest "$ARC_DIR/arc/job/adapters/uma_test.py" -v
+fi


alongd requested a review from Copilot June 7, 2026 17:34

Copilot started reviewing on behalf of alongd June 7, 2026 17:34 View session

alongd removed the request for review from Copilot June 7, 2026 17:34

kfir4444 reviewed Jun 18, 2026

View reviewed changes

alongd force-pushed the uma branch from 259912e to c3e13f3 Compare June 19, 2026 03:48

kfir4444 force-pushed the uma branch 4 times, most recently from b628acc to 2549a2c Compare June 21, 2026 09:35

github-advanced-security AI found potential problems Jun 21, 2026

View reviewed changes

Comment thread arc/job/adapters/uma_test.py Fixed

kfir4444 force-pushed the uma branch from 2549a2c to c66bdf0 Compare June 21, 2026 12:43

kfir4444 added 3 commits June 21, 2026 21:47

with self.assertLogs(level='WARNING'): Doesn't work well in parallel.

a555ec5

Made `warn_if_unreliable_uma_sp` a bolean function, check the return value. Doesn't interfere with current implementation

kfir4444 force-pushed the uma branch from b664afd to a555ec5 Compare June 21, 2026 18:56

alongd requested a review from Copilot June 27, 2026 09:22

Copilot started reviewing on behalf of alongd June 27, 2026 09:22 View session

alongd commented Jun 27, 2026

View reviewed changes

Copilot AI reviewed Jun 27, 2026

View reviewed changes

Uh oh!

Conversation

alongd commented Jun 7, 2026

Uh oh!

kfir4444 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

codecov Bot commented Jun 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

alongd left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov Bot commented Jun 21, 2026 •

edited

Loading