Skip to content

DavePower-cloud/Laparoscopic-Assessment-using-Deep-Learning-

Repository files navigation

๐Ÿง  Laparoscopic Skill Assessment using Deep Learning

Python
Deep Learning
Domain
Status
License
DOI

Automated assessment of simulated laparoscopic surgical performance using computer vision and deep learning (3DCNN).

๐Ÿ“Œ Overview

This repository implements a deep learning framework for objective assessment of laparoscopic surgical skill using simulation video data. The system uses a 3D Convolutional Neural Network (3DCNN) to extract spatiotemporal features and classify operator expertise:

  • Novice

  • Trainee

  • Expert

This work demonstrates how AI can:

  • reduce reliance on subjective human assessment

  • scale evaluation in simulation-based training

  • provide consistent, reproducible performance metrics


๐Ÿ“„ Associated Publications

Automated Assessment of Simulated Laparoscopic Surgical Performance using 3DCNN. IEEE EMBC (2024) https://doi.org/10.1109/EMBC53108.2024.10782160.

Automated assessment of simulated laparoscopic surgical skill performance using deep learning. Scientific Reports (Nature Portfolio, 2025) https://doi.org/10.1038/s41598-025-96336-5


๐ŸŽฏ Motivation

Traditional surgical skill assessment:

  • relies on expert raters

  • is time-intensive

  • introduces subjectivity

Even structured frameworks (e.g. OSATS) require manual scoring. This project explores whether deep learning can directly infer skill from video, enabling scalable and objective assessment.


๐Ÿ—‚ Dataset: LSPD (Laparoscopic Surgical Performance Dataset)

Characteristics

Tasks:

  • Bands

  • Stack

  • Tower

Skills Image

Participants:

  • Novice

  • Trainee

  • Expert

~100 videos expanded to 2244 samples via augmentation

Weak supervision:

  • labels applied at video level

โš ๏ธ The dataset used in this study is not publicly available due to ethical, privacy, and institutional governance constraints. Access may be considered upon reasonable request to the corresponding author, subject to appropriate approvals and data sharing agreements.


๐Ÿงช Methodology

Preprocessing

  • Video trimming (OpenCV)

  • Frame resizing (128ร—128)

  • Normalisation

  • Data augmentation:

    • Gaussian blur

    • Brightness / contrast

    • Salt & pepper noise

    • Horizontal flipping


Model Architecture (3DCNN)

  • 4 ร— Conv3D layers (64 โ†’ 512 filters)

  • Kernel size: 3ร—3ร—3

  • Batch Normalisation

  • Max Pooling (2ร—2ร—2)

  • Dropout (0.1โ€“0.5)

  • Fully connected layers (1024 โ†’ 512) Outputs:

    • Multi-class: novice / trainee / expert

    • Binary: novice vs expert

Model Diagram


Training

  • 5-fold cross-validation

  • Train/test split: 80/20

  • Validation split: 20% of training

  • Optimiser: Adam

  • Loss: Binary cross-entropy


๐Ÿ“Š Results

Multi-class Classification

Skill Accuracy
Stack 79%
Tower 49%
Bands 54%

Multi Con Matrix

Performance was highest for the stack task, while classification of the trainee group remained challenging due to heterogeneity.

Binary Classification (Expert vs Novice)

Skill Accuracy
Stack 91%
Tower 97%
Bands 79%

Binary Con Matrix

Binary classification demonstrated strong discrimination between expert and novice participants, particularly for the tower task.

Area PLot


๐Ÿ“ˆ Statistical Analysis of Task Difficulty

  • Stack time: H = 14.08, p = 0.0087

  • Stack score: H = 11.70, p = 0.0028

  • Tower score: H = 19.83, p = 0.000049

  • Bands time: H = 12.53, p = 0.00189

Not significant:

  • Tower time: H = 1.28, p = 0.52

  • Bands score: H = 1.858, p = 0.39

Summary Results


๐Ÿ‘จโ€โš•๏ธ Inter-Rater Reliability

  • Stack: ฮบ = 1.00

  • Tower: ฮบ = 0.76

  • Bands: ฮบ = 0.72


๐Ÿ‘จโ€โš•๏ธ Human vs Model Performance

Multi-class agreement:

  • Stack: ฮบ = 0.40

  • Tower: ฮบ = 0.41

  • Bands: ฮบ = 0.12

Binary agreement:

  • Stack: ฮบ = 0.53

  • Tower: ฮบ = 0.90

  • Bands: ฮบ = -0.18

The model achieved higher and more consistent performance, particularly in binary classification.


๐Ÿ”‘ Key Insights

  • Strong discrimination between expert vs novice

  • Tower task provides the clearest separation

  • Trainee group remains difficult to classify

  • Human agreement varies significantly

  • AI enables consistent, scalable evaluation


๐Ÿ–ผ Visual Overview

System Pipeline
Raw Video โ†’ Preprocessing โ†’ 3DCNN โ†’ Feature Extraction โ†’ Classification โ†’ Metrics
Model vs Human
Human: variable, subjective
Model: consistent, reproducible
Task Difficulty:

  • Tower โ†’ Strong
  • Stack โ†’ Moderate
  • Bands โ†’ Weak

Pipeline Diagram


๐Ÿ“Š Example Output

{
"task": "tower",
"prediction": "expert",
"confidence": 0.94
}


๐Ÿงฑ Repository Structure

src/

  • config.py
  • data.py
  • model.py
  • train.py
  • evaluate.py
  • utils.py

notebooks/

  • auto_skills_assess.ipynb

๐Ÿš€ Quick Start

Install dependencies:

  • pip install -r requirements.txt

Train model:

  • python src/train.py

Evaluate model:

  • python src/evaluate.py

๐Ÿ’ก Key Contributions

  • LSPD dataset for laparoscopic skill assessment

  • 3DCNN-based spatiotemporal modelling

  • Weakly supervised learning

  • Scalable AI-driven evaluation


โš ๏ธ Limitations

  • Small dataset

  • Trainee variability

  • No frame-level labels

  • Simulation-only data


๐Ÿ”ฎ Future Work

  • Attention-based video models

  • Multi-modal inputs

  • Real-world deployment

  • Integration into simulation systems


๐Ÿ“œ License

MIT License


๐Ÿ‘ค Author

David Power
Healthcare Simulation Specialist | MSc Artificial Intelligence


๐Ÿ“š Citation

If you use this code or build upon this work, please cite:

@article{power2025laparoscopic,
  title   = {Automated assessment of simulated laparoscopic surgical skill performance using deep learning},
  author  = {Power, David and Burke, C. and Madden, Michael G. and others},
  journal = {Scientific Reports},
  volume  = {15},
  pages   = {13591},
  year    = {2025},
  doi     = {10.1038/s41598-025-96336-5},
  publisher = {Nature Publishing Group}
}


@inproceedings{power2024laparoscopic,
  title     = {Automated Assessment of Simulated Laparoscopic Surgical Performance using 3DCNN},
  author    = {Power, D. and Ullah, I.},
  booktitle = {2024 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)},
  address   = {Orlando, FL, USA},
  pages     = {1--4},
  year      = {2024},
  doi       = {10.1109/EMBC53108.2024.10782160},
  publisher = {IEEE}
}

About

Automated Assessment of Simulated Laparoscopic Surgical Performance using 3DCNN

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors