OpenPAD SDK

On-device Presentation Attack Detection (face liveness) for Android. Detects face spoofing -- photos, screens, video replays, face swaps -- using a multi-layer ML pipeline. All processing runs locally. No server dependency, no cloud calls, no data leaves the device.

Requirements: Android 8.0+ (API 26), front-facing camera.

** This is an AI gen photo, source: https://this-person-does-not-exist.com/en

Integration Guide

1. Add the dependency

Add the JitPack repository to your root settings.gradle.kts:

dependencyResolutionManagement {
    repositoriesMode.set(RepositoriesMode.FAIL_ON_PROJECT_REPOS)
    repositories {
        google()
        mavenCentral()
        maven { url = uri("https://jitpack.io") }
    }
}

Add the dependency in your app build.gradle.kts:

dependencies {
    implementation("com.github.iamjosephmj:OpenPAD:1.0.6")
}

2. Initialize

Call initialize() once before starting verification (e.g. in Activity.onCreate()). This loads all ML models asynchronously.

OpenPad.initialize(context) {
    // SDK is ready
}

With a preset configuration:

OpenPad.initialize(context, config = OpenPadConfig.Banking)

With custom parameters:

OpenPad.initialize(
    context = this,
    config = OpenPadConfig(
        livenessThreshold = 0.70f,
        faceMatchThreshold = 0.70f
    ),
    onReady = { /* ready */ },
    onError = { error -> Log.e("PAD", error.toString()) }
)

3a. UI Mode -- Camera Verification Flow

Launch the built-in camera UI. The SDK handles camera, challenge-response, and delivers the result via callback. There is no intro or verdict screen -- the camera starts immediately and the activity finishes as soon as a verdict is reached. Your app decides how to present the result.

OpenPad.analyze(activity, object : OpenPadListener {
    override fun onLiveConfirmed(result: OpenPadResult) {
        // Verified live: result.confidence, result.durationMs
    }

    override fun onSpoofDetected(result: OpenPadResult) {
        // Spoof detected: result.spoofAttempts
    }

    override fun onError(error: OpenPadError) {
        // Initialization, camera, or permission error
    }

    override fun onCancelled() {
        // User closed the flow
    }
})

3b. Headless Mode -- Bring Your Own Camera

For integrators who want to use their own camera UI. The SDK provides a frame analyzer and reactive state flows.

val session = OpenPad.createSession(listener) ?: return

// Plug into CameraX
val imageAnalysis = ImageAnalysis.Builder().build()
imageAnalysis.setAnalyzer(executor, session.frameAnalyzer)

// Observe state to drive your own UI
lifecycleScope.launch {
    session.status.collect { status -> /* ANALYZING, LIVE, SPOOF_SUSPECTED */ }
}
lifecycleScope.launch {
    session.phase.collect { phase -> /* CHALLENGE_CLOSER, EVALUATING, etc. */ }
}
lifecycleScope.launch {
    session.instruction.collect { text -> /* "Move closer", "Hold still" */ }
}
lifecycleScope.launch {
    session.challengeProgress.collect { progress -> /* 0.0 to 1.0 */ }
}

// When done
session.release()

4. Theme Customization (UI Mode)

The SDK camera screen uses a customizable color theme. Set it before calling analyze().

OpenPad.theme = OpenPadThemeConfig(
    primary = 0xFF1565C0,        // Buttons, progress arcs
    success = 0xFF2E7D32,        // Live-confirmed accent
    error = 0xFFD32F2F,          // Spoof-detected accent
    surface = 0xFF121212,        // Background
    surfaceVariant = 0xFF1E1E1E, // Elevated panels
    onSurface = 0xFFE0E0E0,     // Text color
    onSurfaceHigh = 0xFFFFFFFF,  // High-emphasis text
    scrim = 0xFF121212,          // Overlay outside face oval
    frostGlass = 0xFF1E1E1E,    // Frosted panel backgrounds
    ovalIdle = 0xFFE0E0E0,      // Face oval border (idle)
    divider = 0xFF333333         // Separator lines
)

Colors are ARGB hex Long values to keep the public API free of Compose dependencies.

5. Cleanup

OpenPad.release()

Configuration Reference

All parameters are optional. Defaults are tuned for balanced security/usability.

OpenPadConfig(
    // --- Verdict ---
    livenessThreshold = 0.70f,           // Minimum overall confidence to accept as live
    faceMatchThreshold = 0.70f,          // Minimum face similarity between checkpoints (face swap detection)
    spoofAttemptPenalty = 0.08f,          // Extra threshold per consecutive failed attempt
    maxSpoofAttempts = 2,                // Maximum retry attempts before final verdict

    // --- Detection ---
    faceDetectionConfidence = 0.55f,     // Minimum face detection confidence

    // --- Scoring Weights (should sum to ~1.0) ---
    textureAnalysisWeight = 0.15f,       // Surface texture patterns (print/screen artifacts)
    depthGateWeight = 0.20f,             // Fast depth pre-filter
    depthAnalysisWeight = 0.55f,         // Full 3D depth map analysis (primary discriminator)
    screenReflectionWeight = 0.08f,      // Screen reflection multi-class signal (YOLOv5n)

    // --- Model Thresholds ---
    depthGateMinScore = 0.20f,           // Score to trigger full depth analysis (lower = more permissive)
    depthFlatnessMinScore = 0.40f,       // Hard cutoff: faces flatter than this are rejected
    screenReflectionMinConfidence = 0.50f,// Screen reflection confidence threshold
    screenReflectionMinSignals = 2,      // Minimum spoof classes detected to trigger gate

    // --- Classical Signal Thresholds ---
    moireDetectionThreshold = 0.60f,     // Moire score above this flags screen artifacts
    screenPatternThreshold = 0.70f,      // LBP screen pattern score above this flags screen
    photometricMinScore = 0.30f,         // Combined photometric score below this flags spoof

    // --- Anti-Replay ---
    staticFrameThreshold = 0.997f,       // Frame similarity above this flags static/frozen feed
    minMotionVariance = 0.1f,            // Head movement variance below this flags lack of motion

    // --- Low-Light Adaptation ---
    lowLightThreshold = 0.40f,           // Brightness below this triggers threshold relaxation
    lowLightRelaxation = 0.30f,          // Amount to relax scoring gates in low-light conditions

    // --- Performance ---
    maxFramesPerSecond = 8,              // Frame processing rate
    enableDebugOverlay = false,          // Show real-time debug metrics
    sessionTimeoutMs = 10_000L,          // Overall session timeout
    challengeTimeoutMs = 10_000L,        // Challenge phase timeout

    // --- Frame Enhancement ---
    enableFrameEnhancement = true,       // ESPCN x2 super-resolution during challenge (ML quality gate)

    // --- Preprocessing ---
    enablePreprocessing = true,          // Gamma correction + CLAHE contrast enhancement
    preprocessingGammaTarget = 0.45f,    // Target gamma for correction (lower = brighter)
    preprocessingClaheClipLimit = 2.0f   // CLAHE clip limit (higher = more contrast)
)

Configuration Presets

The SDK ships 10 named presets for common scenarios. Use them directly or as a starting point for fine-tuning.

Preset	Security	Speed	Use Case
`Default`	Balanced	Balanced	General-purpose apps
`HighSecurity`	High	Standard	Identity verification, high-value transactions
`FastPass`	Low	Fast	Check-in, attendance, low-risk access
`Banking`	Very High	Standard	Financial services, regulatory compliance
`Onboarding`	Moderate	Standard	First-time user registration (minimize drop-off)
`Kiosk`	High	Standard	Fixed-mount terminals with controlled lighting
`LowEndDevice`	Moderate	Fast	Budget Android phones (disables frame enhancement)
`Development`	Minimal	Standard	Integration testing (debug overlay enabled)
`HighThroughput`	Moderate	Very Fast	Queues, turnstiles, batch processing (15 FPS)
`MaxAccuracy`	Maximum	Standard	Security-critical flows (highest false-rejection rate)

Usage

// Use a preset directly
OpenPad.initialize(context, config = OpenPadConfig.Banking)

// Use a preset as a starting point, then override specific parameters
OpenPad.initialize(
    context = this,
    config = OpenPadConfig.HighSecurity.copy(
        enableDebugOverlay = true,
        maxFramesPerSecond = 10
    )
)

Key parameter differences from Default

Parameter	Default	HighSecurity	FastPass	Banking	MaxAccuracy
`livenessThreshold`	0.70	0.78	0.55	0.80	0.82
`faceMatchThreshold`	0.70	0.78	0.60	0.80	0.82
`depthFlatnessMinScore`	0.40	0.45	0.35	0.45	0.50
`photometricMinScore`	0.30	0.32	0.25	0.35	0.38
`spoofAttemptPenalty`	0.08	0.10	0.05	0.12	0.12
`maxFramesPerSecond`	8	8	12	8	8
`enableFrameEnhancement`	true	true	true	true	true
`enableDebugOverlay`	false	false	false	false	false

Public API

Class	Purpose
`OpenPad`	Singleton entry point. `initialize()`, `analyze()`, `createSession()`, `release()`.
`OpenPadConfig`	All tunable thresholds, weights, and limits.
`OpenPadThemeConfig`	UI color customization (ARGB hex longs).
`OpenPadListener`	Callback interface: `onLiveConfirmed`, `onSpoofDetected`, `onError`, `onCancelled`.
`OpenPadResult`	Verdict: `isLive`, `confidence`, `durationMs`, `spoofAttempts`, `depthCharacteristics`, `faceAtNormalDistance`, `faceAtCloseDistance`.
`DepthCharacteristics`	3D depth statistics from the CDCN 32x32 depth map: `mean`, `standardDeviation`, `quadrantVariance`, `minDepth`, `maxDepth`.
`OpenPadError`	Sealed class: `NotInitialized`, `InitializationFailed`, `CameraUnavailable`, `PermissionDenied`, `AlreadyRunning`.
`OpenPadSession`	Headless session: `frameAnalyzer`, `status`, `phase`, `challengeProgress`, `instruction`, `release()`.

Architecture

Pipeline Overview

flowchart TB
    CAM[Camera Frame] --> FD[1. Face Detection]
    FD --> TEX[2. Texture]
    FD --> DEP[3. Depth]
    FD --> FREQ[4. Frequency]
    FD --> SCR[5. Screen Reflection]
    FD --> PHOTO[6. Photometric]
    FD --> TEMP[7. Temporal]
    FD --> EMB[8. Face Embedding]
    FD --> ENH[9. Frame Enhancement]
    TEX --> AGG[10. Aggregation]
    DEP --> AGG
    FREQ --> AGG
    SCR --> AGG
    PHOTO --> AGG
    TEMP --> AGG
    ENH --> AGG
    EMB --> CHAL[11. Challenge]
    AGG --> CHAL
    CHAL --> VERDICT[Verdict]

Layer	Component	Type
1	Face Detection (BlazeFace)	ML
2	Texture (MiniFASNet V2 + V1SE)	ML
3	Depth (MN3 gate → CDCN)	ML
4	Frequency (FFT moiré + LBP)	Native C
5	Screen Reflection (YOLOv5n)	ML
6	Photometric (specular, chrominance, DOF, lighting)	Native C
7	Temporal (movement, blink, similarity)	Native C
8	Face Embedding (MobileFaceNet)	ML
9	Frame Enhancement (ESPCN x2 super-resolution)	ML
10	Aggregation (gates + stabilizer)	Native C
11	Challenge-Response ("move closer")	Native C

Detection Layers

Layer	What It Detects	Type	Role in Pipeline
Texture	Paper grain, screen sub-pixels, skin micro-texture	Learned (CNN)	Gate + ML score (15%)
Depth	Flat surfaces vs 3D face geometry	Learned (CNN)	Gate + ML score (20% + 55%)
Frequency	Moire patterns, screen pixel grids	Classical (FFT + LBP)	Gate (moire + LBP both flagged)
Screen Reflection	Fingers holding device, screen bezels, reflections, artifacts	Learned (YOLOv5n)	Gate + ML score (8%)
Photometric	Specular reflections, color temperature, uniform DOF	Classical	Gate (combined score too low)
Temporal	Static images, missing blinks, replay patterns	Classical	Pre-classification (frames, face presence)
Face Match	Face swap mid-challenge	Learned (embedding)	Separate check at challenge evaluation
Frame Enhancement	Defocus blur on face regions	Learned (ESPCN SR)	Auto-enhances during challenge; ML quality gate decides keep/discard

Classification Gates

Per-frame classification uses sequential decision gates (first match wins):

flowchart TD
    A[Frame] --> B{Enough frames?}
    B -->|No| ANALYZING[ANALYZING]
    B -->|Yes| C{Face OK?}
    C -->|No| NO_FACE[NO_FACE]
    C -->|Yes| SF{Static frame?}
    SF -->|Yes| SPOOF[SPOOF_SUSPECTED]
    SF -->|No| LM{Low motion?}
    LM -->|Yes| SPOOF
    LM -->|No| D{Screen reflection?}
    D -->|Yes| SPOOF
    D -->|No| E{Moiré + LBP?}
    E -->|Yes| SPOOF
    E -->|No| F{Texture pass?}
    F -->|No| SPOOF
    F -->|Yes| G{CDCN depth?}
    G -->|No| SPOOF
    G -->|Yes| H{Photometric?}
    H -->|No| SPOOF
    H -->|Yes| LIVE[LIVE]

Not enough frames → ANALYZING
No face / low confidence → NO_FACE
Static frame gate: frame similarity ≥ staticFrameThreshold → SPOOF_SUSPECTED
Low motion gate: head movement variance < minMotionVariance → SPOOF_SUSPECTED
Screen reflection gate: 2+ PAD classes detected (finger, device, artifact, reflection) → SPOOF_SUSPECTED
Frequency gate: moire + LBP both above threshold → SPOOF_SUSPECTED
Texture gate: MiniFASNet genuine score below threshold → SPOOF_SUSPECTED
CDCN depth gate: depth below flatness threshold → SPOOF_SUSPECTED
Photometric gate: combined score below threshold → SPOOF_SUSPECTED
All signals pass → LIVE

ML Aggregate Score

The challenge evaluation score is a weighted sum of ML model signals:

Signal	Weight	Role
Texture analysis	15%	Surface micro-texture discrimination
Depth gate (MN3)	20%	Fast binary live/spoof signal
Depth map (CDCN)	55%	Primary depth discriminator
Screen reflection (YOLOv5n)	8%	Multi-class screen replay indicator

Dynamic weighting: if CDCN is unavailable, its weight redistributes to texture (2/3) and MN3 (1/3). Classical signals (frequency, photometric) act as gates but do not contribute to the continuous score.

Challenge-Response Flow

IDLE -> ANALYZING -> POSITIONING -> CHALLENGE_CLOSER -> EVALUATING -> LIVE -> DONE

ANALYZING -- Wait for stable face detection
POSITIONING -- Face must be centered and in range
CHALLENGE_CLOSER -- User must move closer (15%+ face area increase). ML scores collected during hold phase. ESPCN super-resolution auto-enhances blurry face regions (ML quality gate decides keep/discard).
EVALUATING -- Face match check (MobileFaceNet) + weighted score evaluation against threshold
LIVE -- 1s sustain timer, then verdict delivered
DONE -- Terminal state

Failed attempts restart from ANALYZING with escalating threshold (+0.08 per attempt). Retries are unlimited.

UI Flow (UI Mode)

The SDK provides only the camera screen. When the challenge completes (live or spoof), the result is delivered via OpenPadListener and the activity finishes immediately. Intro screens, verdict screens, and retry logic are the consuming app's responsibility.

CameraScreen (with FaceGuideOverlay) -> Result delivered via callback -> Activity finishes

CameraScreen: Live camera preview with animated face oval, phase indicators, instruction pill, gradient scrims

Model Assets

All models use float16 quantization for reduced size (~50% smaller than float32) with no measurable accuracy loss. They are stored as .pad files (brotli-compressed, XOR-scrambled) in pad-core/src/main/assets/models/. This prevents casual extraction and renaming of the TFLite models from the APK.

File	Original Size	Packed Size	Precision	Purpose
`face_detection.pad`	224 KB	182 KB	FP16	BlazeFace face detection
`texture_2x7.pad`	905 KB	748 KB	FP16	Texture analysis (2.7x crop scale)
`texture_4x0.pad`	908 KB	749 KB	FP16	Texture analysis (4.0x crop scale)
`depth_gate.pad`	5.8 MB	5.1 MB	FP16	MN3 fast depth gate
`depth_map.pad`	3.4 MB	3.0 MB	FP16	CDCN depth map
`screen_reflection.pad`	3.5 MB	3.0 MB	FP16	YOLOv5n screen replay detection
`face_embedding.pad`	2.5 MB	2.2 MB	FP16	MobileFaceNet face consistency
`face_enhance.pad`	44 KB	39 KB	FP16	ESPCN x2 face super-resolution
Total	17.3 MB	15.0 MB

Model Packing & Quantization

Models are quantized to float16 and packed using scripts in scripts/:

# Quantize FP32 models to FP16 (also repacks as .pad)
python scripts/quantize_fp16.py

# Pack only: .tflite -> .pad (brotli + XOR)
python scripts/pack_models.py

# Unpack: .pad -> .tflite (reverse)
python scripts/pack_models.py --unpack

FP32 backups of the original models are preserved in scripts/model_backups/ for reference. At runtime, ModelLoader.kt reverses the packing: XOR-descramble with a 32-byte key, then brotli-decompress into a ByteBuffer for the TFLite interpreter. The TFLite runtime transparently dequantizes float16 weights to float32 during inference.

Model Licenses

Model	License	Source
BlazeFace	Apache 2.0	Google/MediaPipe
MiniFASNet V2 + V1SE	Apache 2.0	MiniVision (Silent-Face-Anti-Spoofing)
Anti-Spoof MN3	MIT	kprokofi (PINTO Model Zoo)
CDCN depth map	In-house trained	Architecture from CDCN paper (arXiv:2003.04092)
YOLOv5n (screen reflection)	AGPL-3.0	Ultralytics; in-house trained on PAD dataset
MobileFaceNet	MIT	syaringan357
ESPCN x2	MIT	fannymonori/TF-ESPCN

Screen Reflection Model (YOLOv5n)

The screen reflection detector is a custom-trained YOLOv5n object detection model that identifies physical indicators of screen-based presentation attacks. It detects multiple PAD-specific classes simultaneously and uses their combination as a spoof signal.

Architecture: YOLOv5n (nano) -- 1.77M parameters, 4.1 GFLOPs, 384x384 input

Classes (5):

Class	What it detects	Spoof signal logic
`artifact`	Screen capture artifacts (moiré, color banding)	Counted when overlapping face
`device`	Physical device frame (phone, tablet, laptop)	Always counted when detected
`face`	Face displayed on a screen	Not counted (informational)
`finger`	Fingers holding a device in front of camera	Always counted when detected
`reflection`	Specular reflections on screen surface	Counted when overlapping face

The gate fires when 2+ spoof classes are detected in the same frame. On a live face, the model typically detects only a face class. On a screen replay, it detects combinations like finger + device + face or finger + reflection + face.

Training results (mAP on 108-image validation set):

Class	Precision	Recall	mAP@50	mAP@50-95
All	0.799	0.715	0.761	0.495
artifact	0.766	0.691	0.757	0.431
device	0.604	0.623	0.623	0.379
face	0.971	0.975	0.993	0.812
finger	0.853	0.765	0.824	0.447
reflection	0.800	0.520	0.609	0.406

Retraining:

The training pipeline is fully self-contained. To retrain with more data or a different dataset:

cd scripts
python3 -m venv venv
source venv/bin/activate
pip install torch torchvision pyyaml tensorflow

# Place YOLOv5-format dataset in scripts/test.v1i.yolov5pytorch/
python train_screen_detector.py             # full pipeline: train + export + install
python train_screen_detector.py --epochs 300  # custom epoch count
python train_screen_detector.py --export-only # re-export existing weights

The script handles dataset preparation (class remapping, val/test merge), YOLOv5 training, TFLite export, and .pad asset installation automatically.

Project Structure

OpenPAD/
├── app/                                    # Demo app
│   └── src/main/java/com/openpad/app/
│       ├── OpenPadApp.kt                   # Application class (theme setup)
│       ├── MainActivity.kt                 # UI mode + headless mode launcher + result display
│       ├── MainScreen.kt                   # Main UI composable
│       ├── MainViewModel.kt               # Main screen ViewModel
│       ├── ConfigBottomSheet.kt            # Runtime configuration editor (all OpenPadConfig params)
│       ├── ResultBottomSheet.kt            # Verification result details (depth stats + face crops)
│       └── HeadlessActivity.kt             # Headless integration demo
│
├── pad-core/                               # SDK library module
│   ├── src/main/cpp/                       # Native C layer (see cpp/README.md)
│   │   ├── include/openpad/                # Public headers
│   │   └── src/                            # JNI, core, image, frequency, photometric, temporal, decision
│   └── src/main/java/com/openpad/core/
│       │
│       ├── OpenPad.kt                      # Singleton entry point
│       ├── OpenPadConfig.kt                # Public configuration (10 named presets)
│       ├── OpenPadConfigMapper.kt          # Maps public config → internal config
│       ├── OpenPadThemeConfig.kt            # UI theme colors
│       ├── OpenPadListener.kt              # Result callback interface
│       ├── OpenPadResult.kt                # Verdict data class
│       ├── OpenPadError.kt                 # Error types
│       ├── OpenPadSession.kt               # Headless session interface + impl
│       ├── PadConfig.kt                    # Internal pipeline thresholds (InternalPadConfig)
│       ├── PadPipeline.kt                  # Pipeline factory
│       ├── PadPipelineContract.kt          # Pipeline interface
│       ├── DeviceCapabilityDetector.kt     # Device tier detection
│       ├── PadResult.kt                    # Per-frame result
│       │
│       ├── di/                             # Dependency injection
│       │   ├── PadModule.kt               #   Hilt DI module
│       │   └── PadSessionHolder.kt        #   Session lifecycle holder
│       │
│       ├── detection/                      # Layer 1: Face detection
│       │   ├── MediaPipeFaceDetector.kt    #   BlazeFace inference + SSD decode
│       │   ├── FaceDetection.kt            #   Face data class
│       │   └── FaceDetector.kt             #   Interface
│       │
│       ├── texture/                        # Layer 2: Texture analysis
│       │   ├── MiniFasNetAnalyzer.kt       #   Multi-scale MiniFASNet ensemble
│       │   ├── TextureAnalyzer.kt          #   Interface
│       │   └── TextureResult.kt            #   Result data class
│       │
│       ├── depth/                          # Layer 3: Depth analysis
│       │   ├── CdcnDepthAnalyzer.kt        #   MN3 + CDCN cascaded inference
│       │   ├── CascadedDepthAnalyzer.kt    #   Cascade orchestration
│       │   ├── DepthAnalyzer.kt            #   Interface
│       │   ├── DepthResult.kt              #   Result data class
│       │   └── DepthCharacteristics.kt     #   3D depth map statistics
│       │
│       ├── frequency/                      # Layer 4: Frequency analysis (native C)
│       │   ├── FrequencyAnalyzer.kt        #   Interface
│       │   ├── FrequencyResult.kt          #   FFT result data class
│       │   └── LbpResult.kt               #   LBP result data class
│       │
│       ├── device/                         # Layer 5: Screen reflection detection
│       │   ├── YoloScreenReflectionDetector.kt  # YOLOv5n screen replay detection (5 classes)
│       │   ├── ScreenReflectionDetector.kt #   Interface
│       │   └── ScreenReflectionResult.kt   #   Result data class
│       │
│       ├── photometric/                    # Layer 6: Photometric analysis
│       │   ├── PhotometricAnalyzer.kt      #   Specular, chrominance, DOF, lighting
│       │   └── PhotometricResult.kt        #   Result data class
│       │
│       ├── signals/                        # Layer 7: Temporal signals
│       │   ├── DefaultTemporalTracker.kt   #   Sliding window tracker
│       │   ├── TemporalSignalTracker.kt    #   Interface
│       │   └── TemporalFeatures.kt         #   Features data class
│       │
│       ├── embedding/                      # Layer 8: Face embedding
│       │   ├── MobileFaceNetAnalyzer.kt    #   Face consistency verification
│       │   ├── FaceEmbeddingAnalyzer.kt    #   Interface
│       │   └── FaceEmbeddingResult.kt      #   Result data class
│       │
│       ├── enhance/                        # Layer 9: Frame enhancement
│       │   ├── EspcnFrameEnhancer.kt      #   ESPCN x2 super-resolution (ML quality gate)
│       │   └── FrameEnhancer.kt           #   Interface
│       │
│       ├── aggregation/                    # Layer 10: Aggregation
│       │   ├── WeightedAggregator.kt       #   Rule gates + weighted fusion
│       │   ├── StateStabilizer.kt          #   Hysteresis state machine
│       │   ├── ScoreAggregator.kt          #   Interface
│       │   └── PadStatus.kt               #   Status enum (with fromInt companion)
│       │
│       ├── challenge/                      # Layer 11: Challenge-response
│       │   ├── MovementChallenge.kt        #   "Move closer" state machine
│       │   ├── ChallengeManager.kt         #   Interface
│       │   └── ChallengeState.kt           #   Phase enum + evidence
│       │
│       ├── evaluation/                     # Shared evaluation logic
│       │   ├── PositioningGuidance.kt      #   Face positioning guidance messages
│       │   ├── GenuineProbabilityCalculator.kt # Weighted ML score calculation
│       │   ├── FaceConsistencyChecker.kt   #   Face swap detection via embeddings
│       │   ├── ChallengeEvaluator.kt       #   Orchestrates full evaluation flow
│       │   └── OpenPadResultFactory.kt     #   Consistent result construction
│       │
│       ├── analyzer/                       # Frame processing
│       │   ├── PadFrameAnalyzer.kt         #   CameraX analyzer orchestration
│       │   ├── FramePreprocessor.kt        #   Gamma/CLAHE preprocessing
│       │   ├── BitmapConverter.kt          #   YUV conversion, crops, similarity
│       │   ├── NativeInputBuilder.kt       #   Native C input assembly
│       │   └── PadResultMapper.kt          #   Native output → PadResult mapping
│       │
│       ├── ndk/                            # Native bridge
│       │   ├── OpenPadNative.kt            #   JNI bridge + NativeFrameOutput
│       │   └── NativeChallengeManager.kt   #   Native challenge state bridge
│       │
│       ├── model/                          # Model loading
│       │   └── ModelLoader.kt              #   .pad decryption + TFLite loading
│       │
│       └── ui/                             # Built-in SDK UI (camera only)
│           ├── PadActivity.kt              #   Host activity (camera + result delivery)
│           ├── PadRoute.kt                 #   Navigation host composable
│           ├── IntroScreen.kt              #   Verification intro composable
│           ├── CameraScreen.kt             #   Camera preview with overlay
│           ├── FaceGuideOverlay.kt         #   Animated face oval
│           ├── VerdictScreen.kt            #   Verdict display composable
│           ├── theme/OpenPadTheme.kt       #   Compose theme (reads OpenPadThemeConfig)
│           └── viewmodel/                  #   MVI state management
│               ├── PadViewModel.kt
│               ├── PadSessionCallback.kt   #   Session result callback interface
│               ├── PadUiState.kt
│               ├── PadIntent.kt
│               └── VerdictState.kt         #   Verdict sealed class
│
├── scripts/
│   ├── pack_models.py                      # Brotli + XOR model packer
│   ├── quantize_fp16.py                    # FP16 model quantization
│   └── train_screen_detector.py            # YOLOv5n screen reflection training + TFLite export
│
└── gradle/libs.versions.toml              # Dependency version catalog

Building

# Build everything
./gradlew :pad-core:build :app:build

# Install demo app
./gradlew :app:installDebug

# Run unit tests
./gradlew :pad-core:test

Memory Footprint

On-Disk (APK Contribution)

All 8 models are FP16-quantized and Brotli-compressed (quality 11) with XOR scrambling. Total packed size: ~15 MB.

Model	Packed Size	Purpose
`depth_gate.pad`	5.1 MB	MobileNetV3 depth pre-filter
`depth_map.pad`	3.1 MB	CDCN full depth analysis
`screen_reflection.pad`	3.1 MB	YOLOv5n screen replay detection
`face_embedding.pad`	2.2 MB	MobileFaceNet face consistency
`texture_2x7.pad`	748 KB	MiniFASNet v2 texture
`texture_4x0.pad`	748 KB	MiniFASNet v1SE texture
`face_detection.pad`	183 KB	BlazeFace detection
`face_enhance.pad`	39 KB	ESPCN super-resolution

Runtime Memory

Models are loaded lazily on first use, not all at once. At inference time, FP16 weights are upcasted to FP32 by TFLite kernels.

Component	Estimated Peak
All 8 TFLite interpreters	~25–30 MB
Frame buffers (camera, crops, depth maps)	~5–10 MB
Total peak	~30–40 MB

The depth_map model uses the GPU delegate when available, offloading its weights to GPU VRAM instead of main RAM.
For comparison, a typical social media app uses 200–400 MB at runtime.

Dependencies

Dependency	Version	Purpose
AGP	8.10.0	Build system
Kotlin	2.2.0	Language
CameraX	1.5.3	Camera preview + frame analysis
Compose BOM	2026.01.01	UI (Material3)
Lifecycle	2.8.7	ViewModel + Compose integration
LiteRT	1.4.1	TFLite model inference
Brotli Dec	0.1.2	Model asset decompression
Timber	5.0.1	Logging

Detection Capabilities

Attack Type	Detection	Primary Signals
Printed photo (still)	Strong	Frame similarity + texture + temporal
Printed photo (hand-held)	Strong	Texture + depth + DOF variance
LCD screen (static)	Strong	FFT moire + LBP + texture + screen reflection
Phone screen (static image)	Strong	Texture + depth + screen reflection (finger + device classes)
Phone screen (video replay)	Strong	Depth + texture + screen reflection + LBP
High-PPI OLED replay	Moderate–Strong	Depth + screen reflection + photometric
Face swap mid-challenge	Strong	Face embedding consistency (MobileFaceNet)
3D printed/silicone mask	Weak–Moderate	Texture + photometric; training data collection underway

Android Compatibility

Edge-to-edge: The SDK camera screen handles system bar insets correctly (status bar, navigation bar)
16KB page alignment: Native libraries use useLegacyPackaging = false for Android 15+ compatibility
Minimum SDK: API 26 (Android 8.0)
Target SDK: API 35

Known Limitations

Single RGB camera -- no hardware depth sensing. 3D mask attacks (silicone, paper, 3D-printed) have limited detection. Training data is being collected to improve mask detection via texture and photometric signals.
Single fixed challenge -- "move closer" is predictable. Randomized multi-challenge is on the roadmap.
Client-side only -- all decisions on-device. A rooted device can bypass the pipeline.
Model weights are obfuscated, not encrypted -- XOR scrambling deters casual extraction but is not cryptographically secure.

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
.github/workflows		.github/workflows
app		app
build-logic		build-logic
gradle		gradle
pad-core		pad-core
resources		resources
scripts		scripts
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
README.md		README.md
build.gradle.kts		build.gradle.kts
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
jitpack.yml		jitpack.yml
settings.gradle.kts		settings.gradle.kts

Folders and files

Latest commit

History

Repository files navigation

OpenPAD SDK

** This is an AI gen photo, source: https://this-person-does-not-exist.com/en

Integration Guide

1. Add the dependency

2. Initialize

3a. UI Mode -- Camera Verification Flow

3b. Headless Mode -- Bring Your Own Camera

4. Theme Customization (UI Mode)

5. Cleanup

Configuration Reference

Configuration Presets

Usage

Key parameter differences from Default

Public API

Architecture

Pipeline Overview

Detection Layers

Classification Gates

ML Aggregate Score

Challenge-Response Flow

UI Flow (UI Mode)

Model Assets

Model Packing & Quantization

Model Licenses

Screen Reflection Model (YOLOv5n)

Project Structure

Building

Memory Footprint

On-Disk (APK Contribution)

Runtime Memory

Dependencies

Detection Capabilities

Android Compatibility

Known Limitations

See Also

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages