Oh Lab

Count-MORL Public

Official code for ICML'23 paper [Model-based Offline Reinforcement Learning with Count-based Conservatism]

Python 6 1

APPO Public

Official code for ICLR'25 paper [Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning]

Jupyter Notebook 2

An Official code for AAAI'24 paper [Learning Uncertainty-Aware Temporally-Extended Actions]

Python 1 2

LinearBandit Public

Simulation for linear bandits: LinUCB, LinTS, LinPHE

Jupyter Notebook 1

sparsity-agnostic-lasso-bandit Public

Sparsity-Agnostic Lasso Bandit

Python

OFU-MNL-plus Public

Official code for NeurIPS'24 paper [Nearly Minimax Optimal Regret for Multinomial Logistic Bandit]

Python 1

Provide feedback