Ka-Doe is a document processing and previewing service powered by FastAPI, OCR, and feline-grade modularity. It helps you extract, form, and chunk documents — gracefully and efficiently.
“A soft-pawed tabby whose whispers tamed tempests, and whose radiant eyes kindled love amid flames.”
- 📄 Document Parsing – Extracts content from PDFs and images
- 🧠 Preview Chunking – Breaks content into bite-sized, previewable pieces
- ⚙️ Customizable API – Modular endpoints using FastAPI
- 🐱 PurrfectKit Integration – Built-in utilities for smarter processing
- Python 3.12.4
- Tesseract OCR
- Poppler
- ffmpeg, libmagic1
- Git
sudo apt-get update
sudo apt-get install tesseract-ocr tesseract-ocr-tha poppler-utils ffmpeg libmagic1
pip install fastapi uvicorn python-multipart \
git+https://github.com/suwalutions/PurrfectKit.git@meowuvicorn main:appVisit http://127.0.0.1:8080/docs to explore the API using Swagger UI.
.
├── api/ # Core logic and routes
│ └── routes/v1/ # Versioned API endpoints
├── docs/ # MkDocs documentation
├── dockers/ # Docker & Compose setup
├── main.py # Application entrypoint
├── requirements.txt # Python dependencies
├── mkdocs.yml # Documentation site config
Full documentation is available via MkDocs:
pip install mkdocs mkdocs-material mkdocstrings[python]
mkdocs serve -a localhost:8001Then visit: http://127.0.0.1:8001
Contributions are welcome! To contribute:
- Fork this repo
- Create a new branch: git checkout -b feature-name
- Commit changes: git commit -am 'Add feature'
- Push: git push origin feature-name
- Open a pull request 🐾
MIT License. See LICENSE for more info.
Built with care, curiosity, and cat-like precision. Powered by Python, driven by passion.