Big Data Engineer · Data Scientist · ML Engineer
I build scalable ML systems on Apache Spark and production-grade data/AI pipelines with Docker/Kubernetes.
- 🧠 Focus: Big Data ML (SparkML/PySpark), MLOps, ML Engineering, Anomaly Detection
- 🏢 Current: TÜBİTAK BİLGEM — Cloud Computing and Big Data Research Laboratory (B3LAB)
- 🎓 MSc CS @ Istanbul Technical University
| Project ⚙️ | Description 📝 | Stars ⭐ |
|---|---|---|
| Building-Detection-MaskRCNN | Building detection with Mask R-CNN | |
| Federated-Learning-Comparative-Study | Federated learning experiments + comparative evaluation (Flower, fairness) | |
| Knowledge-and-Dataset-Distillation | Knowledge and Dataset Distillation on the MNIST Dataset |
| Project ⚙️ | Description 📝 | Stars ⭐ |
|---|---|---|
| Text-Similarity-Towhee-MilvusDB | Similarity search using embeddings and Milvus |
| Project ⚙️ | Description 📝 | Stars ⭐ |
|---|---|---|
| Dinosaur-Game-with-Deep-Q-Learning | DQN agent for the Chrome Dino game | |
| DRL-Competition 2024 | Deep Reinforcement Learning (DRL) agents to learn resource collection and combat |
Big Data / Lakehouse: PySpark · SparkML · Iceberg · S3 · SQL · MongoDB
Messaging: Kafka · MQTT
Monitoring: Airflow · Grafana
Analytics: Apache Druid · Superset
ML/LLM: PyTorch · (TensorFlow) · Ollama · LangChain
DevOps: Docker · Kubernetes · CI/CD · Git · Linux
Backend: Python · Go · REST · GraphQL
- Enhancing Credit Risk Assessment with Federated Learning (EAI ROSENET 2024) Access
- Small Object Detection and Tracking from Aerial Imagery (UBMK 2021) Access
- 💼 Preferred: LinkedIn — https://www.linkedin.com/in/mustafaaktasinfo
- 📫 Email: available on request via LinkedIn
- 🧑💻 GitHub: https://github.com/Mstfakts


