Skip to content
View abdelkareemkobo's full-sized avatar
DL and hybermedia systems
DL and hybermedia systems

Block or report abdelkareemkobo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
abdelkareemkobo/README.md

Abdelkareem Elkhateb

Arabic NLP Researcher • AI Engineer

Making AI models small, fast, and actually useful.

kareemai.com


About

I'm an NLP Engineer at Xbites, building the AI backend for Darin. I research Arabic embeddings with Hamza Salem Lab and contribute to NAMMA for open-source Arabic AI.

My thesis explores efficient transformer architectures for CPU deployment—focusing on model compression, quantization, and edge AI.

Research Metrics: 7 papers • 54 citations • 962 papers read


Current Focus

Arabic NLP — Building efficient language models for Arabic text understanding
Edge AI — Compressing transformers to run on resource-constrained devices
RAG Systems — Semantic search and retrieval with Qdrant for production
Research Engineering — Bridging academic research with production ML systems


Selected Work

BertHash-Femto

113× smaller than AraBERTv2 with 94% accuracy. Runs inference on edge devices.
GitHub • Python, PyTorch, ONNX

Zarra & Bojji

Tiny Arabic language models optimized for mobile devices.
Article • Model Compression, TensorFlow Lite

كم كالوري (KamCalorie)

Arabic-first nutrition search engine with NLP-powered food recognition.
Live • FastAPI, Astro.js, Arabic NLP

GPUVec

Real-time GPU pricing tracker and ML benchmarking platform.
Live • Data Engineering, Web Scraping

SEO Rat

SEO optimization tool for markdown-based static sites.
GitHub • Python, NLP, Markdown Processing


Technical Background

Languages: Python, Rust, C++, JavaScript
ML/AI: PyTorch, TensorFlow, Transformers, vLLM, FastAI
MLOps: Docker, Podman, CUDA, ONNX Runtime
Web: FastAPI, Astro.js, FastHTML, Svelte, Supabase
Databases: Qdrant, PostgreSQL, Vector Search


Writing

I write about Arabic NLP, model compression, and AI engineering at kareemai.com/blog:

Subscribe to my newsletter: gpuvec.substack.com


Research

Google ScholarResearchGatePapers


Connect

LinkedInX/TwitterUpworkEmail


"اللغة ليست عِلمًا .. بل هي شيء فوق العلم"
"Language is not a science — it is something above science."

Pinned Loading

  1. seo_rat seo_rat Public

    seo rat is an SEO to give you some help to optimize your content based on markdown for sites like astor, quarto

    Jupyter Notebook 1

  2. detectron_hacking detectron_hacking Public

    Jupyter Notebook

  3. hamos hamos Public

    Python

  4. Sariqat-al-Lahzat Sariqat-al-Lahzat Public

    Jupyter Notebook

  5. arabic-tweets-classification arabic-tweets-classification Public

    classify arabic tweets into ham or spam with transformers models

    Jupyter Notebook

  6. dinvo2-similar-image- dinvo2-similar-image- Public

    An implementation of how to use dinov2 with faiss to search for similar images in your dataset or get similar art like pineterest website :)

    Jupyter Notebook 3