π» Backend Engineer | Distributed Systems | AI Systems
π M.S. Computer Science β University of Florida
π Building scalable infrastructure, developer tools, and intelligent systems
I'm a backend-focused software engineer specializing in distributed systems, scalable APIs, and AI-powered platforms.
My work spans cloud infrastructure, data pipelines, compiler design, and agentic AI systems, with a strong focus on building production-grade systems that scale.
Currently working on large-scale climate data pipelines and HPC-based research infrastructure at the University of Florida.
β’ Distributed Systems
β’ AI Agent Infrastructure
β’ Data Pipelines & Research Platforms
β’ Backend Architecture
β’ Compiler Design
Java β’ Go β’ Python β’ C β’ SQL β’ JavaScript
Spring Boot β’ REST APIs β’ Microservices β’ Kafka β’ Redis
LLM APIs β’ Vector Databases β’ Data Pipelines β’ Zarr
PostgreSQL β’ MySQL β’ MongoDB β’ Redis
AWS β’ Docker β’ CI/CD β’ GitHub Actions
Python Scientific Stack β’ Climate Data Processing β’ HPC Workloads
A distributed agentic AI execution system that converts user requests into coordinated tasks executed by distributed workers.
flowchart LR
A[User Query] --> B[API Gateway / FastAPI]
B --> C[AI Planner Agent]
C --> D[Task Queue]
D --> E1[Worker Node 1]
D --> E2[Worker Node 2]
D --> E3[Worker Node N]
E1 --> F[Processing Engine]
E2 --> F
E3 --> F
F --> G[Vector Database]
G --> H[LLM Summarization Engine]
H --> I[API Response Returned]
β’ AI planner that decomposes complex tasks
β’ Distributed worker execution system
β’ Vector database knowledge storage
β’ LLM summarization pipeline
β’ Scalable queue-based architecture
Python β’ FastAPI β’ Redis β’ Celery β’ Vector DB β’ LLM APIs
High-performance distributed web crawler designed for large-scale data collection and indexing.
flowchart LR
A[Seed URLs / Scheduler] --> B[Crawl Queue]
B --> C1[Worker Node 1]
B --> C2[Worker Node 2]
B --> C3[Worker Node N]
C1 --> D[Fetcher Engine]
C2 --> D
C3 --> D
D --> E[HTML Parser]
E --> F[URL Extractor]
F --> B
E --> G[Content Processing]
G --> H[Storage Layer]
H --> I[Search / Index DB]
H --> J[Metadata DB]
β’ Distributed crawling workers
β’ Queue-based URL scheduling
β’ Rate limiting and politeness policies
β’ Content parsing and indexing
β’ Scalable horizontal architecture
Go β’ Kafka β’ Redis β’ PostgreSQL β’ Docker
Large-scale climate data pipeline and research platform developed at the University of Florida.
The platform integrates:
β’ NASA POWER weather datasets
β’ CHIRPS precipitation datasets
β’ Geographic shapefile-based site analysis
flowchart LR
A[NASA POWER API] --> D[Data Ingestion Pipeline]
B[CHIRPS Dataset] --> D
C[User Provided Shapefile] --> F[Geospatial Processor]
D --> E[Climate Data Processing]
E --> G[Zarr Data Store]
G --> H[Dataset Index Manager]
H --> I[Auto Dataset Updates]
F --> J[Site Dataset Generator]
G --> J
J --> K[ICASA File Generator]
J --> L[Visualization Engine]
L --> M[Interactive UI Dashboard]
K --> N[Downloadable Site Packages]
M --> O[Research Insights]
J --> P[HPC Execution Engine]
P --> Q[HiPerGator Cluster]
β’ Automated ingestion of climate datasets
β’ Zarr-based data storage and indexing
β’ Automatic dataset updates when new data becomes available
β’ HPC deployment on UF HiPerGator cluster
β’ Scientific experiment execution workflows
β’ Site-specific dataset generation via shapefiles
β’ Automated ICASA file generation
β’ Interactive UI for data visualization and insights
β’ Plot generation and data downloads
This platform replaces a previously manual research workflow, significantly reducing time required for climate data preparation for agricultural and environmental scientists.
Python β’ Zarr β’ Climate Data APIs β’ HPC (HiPerGator) β’ React β’ Data Visualization
Designed and implemented an object-oriented extension to Pascal with a full compiler pipeline.
β’ Classes and inheritance
β’ Access modifiers
β’ Semantic analysis
β’ LLVM IR code generation
ANTLR4 β’ LLVM β’ Compiler Design β’ Language Engineering
πΌ LinkedIn https://linkedin.com/in/itswael
π Portfolio https://itswael.github.io
π§ Email errwael@gmail.com
I enjoy building systems that replace manual workflows with automation β from AI task orchestration engines to research data pipelines that process climate datasets on HPC clusters.
β Explore my repositories and feel free to collaborate!