Add Hugging Face Gemma fine-tuning Colab aligned with Kauldron example #1421

fuyuan-li · 2025-12-26T15:43:03Z

This PR is a small holiday follow-up to the OpenSpiel 2.0 year-end announcement, generalizing the Gemma + Kauldron example to the Hugging Face ecosystem (#1414).

The notebook mirrors the original task structure (prompt → response with a loss mask), while using publicly available Hugging Face checkpoints and GPU-friendly QLoRA fine-tuning to make the example easier to run on T4 GPU on colab.

Thanks again for the OpenSpiel 2.0 wrap-up and happy holidays!

lanctot · 2025-12-26T22:30:11Z

Wow, that's amazing... thanks!!!

Happy Holidays!

Add Hugging Face Gemma + QLoRA fine-tuning example

74affe2

lanctot mentioned this pull request Dec 26, 2025

2025 Wrap-up: Fine-tuning Gemma with Kauldron Example ✦︎ #1414

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Hugging Face Gemma fine-tuning Colab aligned with Kauldron example #1421

Add Hugging Face Gemma fine-tuning Colab aligned with Kauldron example #1421

Uh oh!

fuyuan-li commented Dec 26, 2025

Uh oh!

lanctot commented Dec 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add Hugging Face Gemma fine-tuning Colab aligned with Kauldron example #1421

Are you sure you want to change the base?

Add Hugging Face Gemma fine-tuning Colab aligned with Kauldron example #1421

Uh oh!

Conversation

fuyuan-li commented Dec 26, 2025

Uh oh!

lanctot commented Dec 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants