Photo Face Indexing Scripts

This repository contains the data preparation scripts used to build a face-searchable photo website.

These scripts are intended to be run locally, once, in sequence. The resulting outputs (faces.db and output/) are then used by a separate Flask web application.

Overview

Pipeline:

Generate face embeddings database from local photos
Generate thumbnails for fast web display
Fetch Google Drive shareable links using rclone
Parse rclone JSON output
Inject Drive URLs into the face database

Requirements

Python 3.8+
OpenCV
face_recognition (dlib)
NumPy
SQLite
rclone (configured with Google Drive)

Script Pipeline

1. Generate face database

python 01_generate_face_db.py

What it does:

    Recursively scans all local photo folders

    Detects all faces in every image

    Stores:

        image path

        face embedding

    Output:

        faces.db

2. Generate thumbnails

python 02_generate_thumbnails.py

What it does:

    Creates resized thumbnails for each photo

    Preserves folder structure

    Output:

        output/ directory

3. Get Google Drive links using rclone

First, configure rclone:

rclone config

Then run:

bash 03_get_drive_links.sh

What it does:

    Uses rclone lsjson

    Produces a JSON listing of all Drive files and paths

4. Parse rclone JSON output

python 04_parse_rclone_json.py

What it does:

    Converts rclone JSON output into clean
    local-path → Google Drive URL mappings

5. Inject Drive URLs into database

Before running this step, add a new column to the database:

ALTER TABLE faces ADD COLUMN drive_url TEXT;

Then run:

python 05_inject_drive_links.py

What it does:

    Matches local image paths with Drive URLs

    Injects URLs into faces.db

Final Outputs

After completing all steps, you will have:

    faces.db

        face embeddings

        image paths

        Google Drive URLs

    output/

        thumbnails for web display

These outputs are consumed by the web application, which lives in a
separate folder/repository.

🚀 Deploying the Web App to Railway

This section explains how to deploy the Flask web application using the pre-generated data (faces.db and thumbnails) on Railway.

The face indexing scripts are intended to be run locally. Only the web application is deployed to Railway.

Prerequisites

Before deploying, ensure you have:

faces.db generated by the scripts
output/ directory containing thumbnails
A working Flask web application
Docker installed locally
A Railway account
A GitHub account (for GitHub Container Registry)

Project Structure (Web App)

Your web app directory should look similar to this:

web/
├── app.py
├── wsgi.py
├── Dockerfile
├── requirements.txt
├── faces.db
├── output/
│ └── ...
├── templates/
│ ├── index.html
│ └── login.html

faces.db and output/ are copied from the script outputs.

1. Build Docker Image Locally

The Docker image must be built locally due to heavy dependencies. Railway will only run the image, not build it.

From inside the web app directory:

docker build -t ghcr.io/<your-username>/face-recog:latest .

2. Push Image to GitHub Container Registry (GHCR)

Login to GHCR

Create a GitHub Personal Access Token with the following scopes:

write:packages

read:packages

Then authenticate:

echo YOUR_GITHUB_TOKEN | docker login ghcr.io -u <your-username> --password-stdin

Push the Image

docker push ghcr.io/<your-username>/face-recog:latest

After pushing, set the package visibility to Public in GitHub → Packages → Package Settings. 3. Deploy on Railway

Open the Railway dashboard

Create a new project

Choose Deploy from Docker Image

Enter the image URL:

ghcr.io/<your-username>/face-recog:latest

Railway will pull and run the image directly.

4. Set Environment Variables

In the Railway service settings, add the following variables:

APP_PASSWORD=your_login_password
SECRET_KEY=some_long_random_string

Both variables are required for the app to start.

5. Access the Application

After deployment completes, Railway assigns a public URL:

https://<service-name>.up.railway.app

The URL can be found under:

Service → Settings → Domains

Open the URL, log in, and upload a photo to find matching images.

Notes on Performance and Usage

Only one face search is processed at a time to ensure stability

High-resolution uploads are automatically downscaled server-side

Google Drive serves full-quality images when thumbnails are clicked

Designed for private, not public-scale traffic

Updating the Deployment

After making changes to the web app:

docker build -t ghcr.io/<your-username>/face-recog:latest .
docker push ghcr.io/<your-username>/face-recog:latest

Then redeploy the service in Railway

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Photo Face Indexing Scripts

Overview

Requirements

Script Pipeline

1. Generate face database

2. Generate thumbnails

3. Get Google Drive links using rclone

4. Parse rclone JSON output

5. Inject Drive URLs into database

🚀 Deploying the Web App to Railway

Prerequisites

Project Structure (Web App)

1. Build Docker Image Locally

2. Push Image to GitHub Container Registry (GHCR)

4. Set Environment Variables

5. Access the Application

Notes on Performance and Usage

Updating the Deployment

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
website		website
01_generate_face_db.py		01_generate_face_db.py
02_generate_thumbnails.py		02_generate_thumbnails.py
03_get_drive_links.sh		03_get_drive_links.sh
04_parse_rclone_json.py		04_parse_rclone_json.py
05_inject_drive_links.py		05_inject_drive_links.py
README.md		README.md

Megh-Rana/face-photo-index

Folders and files

Latest commit

History

Repository files navigation

Photo Face Indexing Scripts

Overview

Requirements

Script Pipeline

1. Generate face database

2. Generate thumbnails

3. Get Google Drive links using rclone

4. Parse rclone JSON output

5. Inject Drive URLs into database

🚀 Deploying the Web App to Railway

Prerequisites

Project Structure (Web App)

1. Build Docker Image Locally

2. Push Image to GitHub Container Registry (GHCR)

4. Set Environment Variables

5. Access the Application

Notes on Performance and Usage

Updating the Deployment

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages