graphcap

Keywords: image captioning, scene graph, DAG, FastAPI, multimodal, machine learning, open source, artificial intelligence, datasets, open model initiative, OMI

graphcap is an open source system for generating image captions and scene graphs using multiple analytical perspectives. The project combines a React based user interface, a TypeScript data service and a Python inference bridge to produce structured captions that conform to declarative JSON schemas.

Features

Multi-perspective captioning – captions are produced using declarative "perspectives" that describe prompts and output schemas.
Modular architecture – separate microservices for the UI, data service, inference bridge and media processing, all coordinated through a local workspace volume.
Provider abstraction – easily integrate OpenAI, Ollama, Gemini or other vision-language providers through the provider factory API.
Extensible dataset management – upload, edit and organise images directly from the web interface.
Sphinx documentation – full developer and user documentation is located in the doc/ directory.

Quick start

The easiest way to run graphcap is with Docker Compose and the provided Taskfile commands. Ensure that Docker and the Task runner are installed, then execute:

# prepare configuration and build base images
task setup

# start all services in the background
task start

Once the services are running visit http://localhost:32200 in your browser. The default workspace is stored inside the workspace/ directory of this repository. For more details on configuration and available services see the installation guide.

Repository layout

apps/       # frontend and service applications
packages/   # shared libraries (TypeScript and Python)
doc/        # Sphinx documentation
workspace/  # local configuration and persistent volumes

Each package or application contains its own README with development instructions.

License

This project is licensed under the Apache 2.0 License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
.cursor/rules		.cursor/rules
.github		.github
.sonarlint		.sonarlint
.vscode		.vscode
apps		apps
deploy		deploy
doc		doc
packages		packages
test		test
workspace		workspace
.cursorignore		.cursorignore
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
.npmrc		.npmrc
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
INSTALLATION.md		INSTALLATION.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
Taskfile.yml		Taskfile.yml
docker-compose.override.example.yml		docker-compose.override.example.yml
docker-compose.yml		docker-compose.yml
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
pyproject.toml		pyproject.toml
sonar-project.properties		sonar-project.properties
tsconfig.base.json		tsconfig.base.json
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

graphcap

Features

Quick start

Repository layout

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

graphcap

Features

Quick start

Repository layout

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages