BrainMT: A Hybrid Mamba‑Transformer Architecture for Modeling Long‑Range Dependencies in Functional MRI Data

Arunkumar Kannan, Martin A. Lindquist, Brian Caffo

Johns Hopkins University

🎉 BrainMT has been accepted to MICCAI'25 🎉

This is an official PyTorch implementation for BrainMT: A Hybrid Mamba‑Transformer Architecture for Modeling Long‑Range Dependencies in Functional MRI Data.

Contact: akannan7@jhu.edu (Arunkumar Kannan)

Share us a ⭐️ if you find our repository helpful!

✨ Highlights

🔍 Motivation: Can we develop deep learning models that efficiently operate on voxel-level fMRI data - just like we do with other medical imaging modalities?

🧠 Architecture: We introduce BrainMT, a novel hybrid framework designed to efficiently learn and integrate long-range spatiotemporal attributes in fMRI data. BrainMT framework operates in two stages:

1️⃣ A bidirectional Mamba block with a temporal-first scanning mechanism to capture global temporal interactions in a computationally efficient manner; and
2️⃣ A transformer block leveraging self-attention to model global spatial relationships across the deep features processed by the Mamba block.

📈 Results: Through extensive experiments and ablation studies on two large-scale public datasets - UKBioBank (UKB) and the Human Connectome Project (HCP), we demonstrate that BrainMT outperforms existing methods and generalizes robustly across diverse tasks for improved phenotypic prediction in neuroimaging.

Getting Started

This section will guide you through setting up the environment, preprocessing data, and running the BrainMT model.

1. Environment Setup

This code is implemented using Python 3.9.18, PyTorch 2.6.0 and CUDA 12.4.

Step 1: Create and activate virtual environment

# Create virtual environment
python -m venv brainmt_env

# Activate virtual environment
source brainmt_env/bin/activate

Step 2: Install dependencies

# Install from requirements.txt
pip install -r requirements.txt

2. Data Preparation

Our workflow begins with data that has already been processed through the standardized fMRI preprocessing pipelines of the UK BioBank (UKB) and Human Connectome Project (HCP). The data preparation steps are two-fold here: converting the fMRI volumes into a model-friendly format and preparing the corresponding phenotype targets for our downstream tasks.

🧠 Preprocessing fMRI Volumes

The primary goal here is to convert the NIfTI files into a more efficient format for our model. The preprocessing script, located in src/brainmt/preprocessing/, handles the following:

Normalization: Applies voxel-wise normalization across the time dimension (either z-score or min-max).
Masking: Removes background voxels to reduce computational overhead.
Conversion: Transforms the 4D fMRI volumes into PyTorch tensors and saves them in fp16 format to significantly reduce storage space and accelerate data loading during training.

Usage

Configure paths and parameters in preprocessing/preprocess_fmri.py.
- load_root: Set this to the directory containing preprocessed fMRI NIfTI files.
- save_root: Set this to the output directory where the processed PyTorch tensors will be stored.

Run the script:

python src/brainmt/preprocessing/preprocess_fmri.py

🎯 Preparing Target Phenotypes

We also prepare the target data for our two downstream tasks: regression and classification.

Regression (Cognitive Intelligence):
- For the UKB dataset, we use fluid intelligence scores from data-field 20016.
- For the HCP dataset, we use the age-adjusted cognitive composite score from CogTotalComp_AgeAdj.
- To stabilize model training, we z-normalize these scores for each dataset independently.
Classification (Sex):
- For the UKB dataset, we use the sex field 31.
- For the HCP dataset, we use the corresponding gender field.
- We encode the labels numerically: 'male' is mapped to 1 and 'female' is mapped to 0.
The final output for each task is a pickle file that contains a dictionary mapping each subject's ID to their corresponding target value. This file is used directly by the data loader during model training.

3. Running the Model

We use Hydra to manage configurations, making it easy to customize runs from the command line. The configuration files are located in the configs/ directory.

Configuration Files

configs/base.yaml: The main configuration file. It sets default parameters for the model, dataset, training schedule, optimizer, and logging.
configs/model/brain_mt.yaml: Defines the BrainMT model architecture.
configs/dataset/fmri.yaml: Specifies the dataset paths and properties
configs/task/regression.yaml: Sets the task to regression.
configs/task/classification.yaml: Sets the task to classification.

Training

The train.py script handles the model training process using Distributed Data Parallel (DDP) for efficient multi-GPU training.

1. Configure Training Run: Open configs/dataset/fmri.yaml and update the img_path and target_path to point to preprocessed fMRI data and phenotype files.

2. Start Training: To start a training run, you can use the torchrun command. The configuration for the run is controlled by modifying parameters directly on the command line.

Example: Training for Regression

torchrun --nproc_per_node=2 train.py task=regression

Checkpoints for the best performing model on the validation set will be saved in the directory specified by checkpoint.dir in configs/base.yaml.

Note: If you encounter NCCL P2P communication issues on multi-GPU systems, prefix the command with NCCL_P2P_DISABLE=1:

NCCL_P2P_DISABLE=1 torchrun --nproc_per_node=2 train.py task=regression

Inference

Execute inference.py script specifying the task and the path to the trained model checkpoint.

Example: Inference for Regression

python inference.py task=regression inference.checkpoint_path=/path/to/your/best_model.pth

4. Checkpoints

Datasets	HCP	UKB
Regression (cognitive intelligence)	Download	Download
Classification (sex)	Download	Download

✅ To‑Do List for Code Release

~~Create repository~~
~~Installation guide – provide requirements.txt / environment.yml and setup instructions~~
~~Training scripts – release reproducible training pipeline (train.py, configs)~~
~~Evaluation scripts – include scripts for validation and test‑set evaluation~~
~~Dataset prep – share preprocessing scripts~~
~~Config files – upload YAML config templates for different tasks~~
Release Model checkpoints

Citation

If you find this repository useful, please consider citing:

@inproceedings{KanAru_BrainMT_MICCAI2025,
        author = {Kannan, Arunkumar and Lindquist, Martin A. and Caffo, Brian},
        title = {BrainMT: A Hybrid Mamba-Transformer Architecture for Modeling Long-Range Dependencies in Functional MRI Data},
        booktitle = {International Conference on Medical Image Computing and Computer-Assisted Intervention},
        year = {2025},
        publisher = {Springer Nature Switzerland},
        volume = {LNCS 15971},
        month = {September},
        page = {151 -- 161}
}

Acknowledgements

We would like to thank the following repositories for their great works: VideoMamba, MambaVision, SwiFT, TFF.

This work used data from the Human Connectome Project, WU-Minn Consortium (PIs David Van Essen and Kamil Ugurbil; NIH grant 1U54 MH091657, funded by the 16 NIH Institutes and Centers supporting the NIH Blueprint for Neuroscience Research and the McDonnell Center for Systems Neuroscience at Washington University), as well as from UK Biobank (Project ID 33278), a major biomedical database.

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
assets		assets
configs		configs
src/brainmt		src/brainmt
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BrainMT: A Hybrid Mamba‑Transformer Architecture for Modeling Long‑Range Dependencies in Functional MRI Data

✨ Highlights

Getting Started

1. Environment Setup

Step 1: Create and activate virtual environment

Step 2: Install dependencies

2. Data Preparation

🧠 Preprocessing fMRI Volumes

Usage

🎯 Preparing Target Phenotypes

3. Running the Model

Configuration Files

Training

Inference

4. Checkpoints

✅ To‑Do List for Code Release

Citation

Acknowledgements

About

Uh oh!

Releases

Packages

Languages

License

arunkumar-kannan/BrainMT-fMRI

Folders and files

Latest commit

History

Repository files navigation

BrainMT: A Hybrid Mamba‑Transformer Architecture for Modeling Long‑Range Dependencies in Functional MRI Data

✨ Highlights

Getting Started

1. Environment Setup

Step 1: Create and activate virtual environment

Step 2: Install dependencies

2. Data Preparation

🧠 Preprocessing fMRI Volumes

Usage

🎯 Preparing Target Phenotypes

3. Running the Model

Configuration Files

Training

Inference

4. Checkpoints

✅ To‑Do List for Code Release

Citation

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages