Skip to content
View Shoko-official's full-sized avatar
Currently coding ...
Currently coding ...

Block or report Shoko-official

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Shoko-official/README.md

Hi, I'm Shoko

I build projects around machine learning, computer vision, and systems programming. My current focus is turning learning projects into cleaner, reproducible, and better-tested software.

Featured work

Project Stack What it shows Status
LightGBM-Cybersecurity Python, scikit-learn, LightGBM, pytest Packaged ML pipeline with preprocessing, baseline model, validation reports, runtime artifacts, and tests. Most production-like project
ML-Price-Guesser C++17, CMake, OpenMP Gradient-boosted decision tree pipeline for real-estate price prediction, built from scratch with temporal splitting and feature engineering. Active refactor / testing needed
LLM-From-Abs-Scratch C++17, CMake Neural-network layers and numerical checks built from scratch, with gradient checks for activation layers. Early research / learning project
Octobin Python, TensorFlow/Keras, OpenCV, Flask Real-time waste classification prototype using MobileNetV2 transfer learning and webcam-based correction workflow. Prototype
Infinite-Runner Python, Pygame 2D game loop, collisions, assets, scoring, pause/game-over states, and gameplay systems. Learning / game-dev project
Get_What_U_Need Python, Pygame Larger Pygame project with state management, asset loading, progression, audio, and fullscreen support. Collaborative / learning project

Experimental forks

These repositories are forks used for exploration. I do not present them as fully original framework rewrites unless the specific changes are documented in the repository.

Academic and learning repositories

These repositories are mainly coursework, exercises, or personal practice. They are kept public to show progression, but they are not the best representation of my current engineering standards.

Engineering standards I am improving

  • Add reproducible setup instructions to every serious project.
  • Keep generated files, IDE folders, datasets, and model artifacts out of Git when possible.
  • Add tests for core logic, especially C++ ML code and data preprocessing.
  • Document which repositories are original projects, forks, prototypes, or coursework.
  • Prefer clear configuration files over hardcoded local paths.
  • Add CI where the project can be built and tested reliably.

Tech I use

Languages: Python, C++, Java, C
ML/Data: scikit-learn, LightGBM, TensorFlow/Keras, pandas, NumPy
Systems/Tools: CMake, OpenMP, Git, pytest
Creative: Pygame, OpenCV


Thanks for visiting. The strongest repositories to review first are LightGBM-Cybersecurity, ML-Price-Guesser, and LLM-From-Abs-Scratch.

Pinned Loading

  1. ML-Price-Guesser ML-Price-Guesser Public

    C++ 10

  2. LLM-From-Abs-Scratch LLM-From-Abs-Scratch Public

    LLM-From-Abs-Scratch: A high-performance Large Language Model implementation built from the ground up in C++. Focuses on low-level optimization and architectural clarity.

    C++ 7

  3. tensorflow-turboquant tensorflow-turboquant Public

    Forked from tensorflow/tensorflow

    Turbocharged TensorFlow fork with experimental TurboQuant extension. High-performance weight-only, block-wise codebook quantization for Keras layers.

    C++ 7

  4. LightGBM-Cybersecurity LightGBM-Cybersecurity Public

    Cybersecurity Threat Detection: Leveraging LightGBM for high-speed, accurate network intrusion detection and anomaly analysis.

    Python 6

  5. Pytorch-TurboQuant Pytorch-TurboQuant Public

    Forked from pytorch/pytorch

    Pytorch-TurboQuant: High-performance weight-only quantization for PyTorch. Optimized for fast inference and reduced memory footprint.

    Python 5