Oh Lab
Popular repositories Loading
-
Count-MORL
Count-MORL PublicOfficial code for ICML'23 paper [Model-based Offline Reinforcement Learning with Count-based Conservatism]
-
UTE-Uncertainty-aware-Temporal-Extension
UTE-Uncertainty-aware-Temporal-Extension PublicAn Official code for AAAI'24 paper [Learning Uncertainty-Aware Temporally-Extended Actions]
-
LinearBandit
LinearBandit PublicSimulation for linear bandits: LinUCB, LinTS, LinPHE
Jupyter Notebook 1
-
sparsity-agnostic-lasso-bandit
sparsity-agnostic-lasso-bandit PublicSparsity-Agnostic Lasso Bandit
Python
-
OFU-MNL-plus
OFU-MNL-plus PublicOfficial code for NeurIPS'24 paper [Nearly Minimax Optimal Regret for Multinomial Logistic Bandit]
Python 1
Repositories
- CPQL Public
Official code for ICLR'26 paper [Peng's Q(λ) for Conservative Value Estimation in Offline Reinforcement Learning]
oh-lab/CPQL’s past year of commit activity - OFU-MNL-plus Public
Official code for NeurIPS'24 paper [Nearly Minimax Optimal Regret for Multinomial Logistic Bandit]
oh-lab/OFU-MNL-plus’s past year of commit activity - APPO Public
Official code for ICLR'25 paper [Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning]
oh-lab/APPO’s past year of commit activity - UTE-Uncertainty-aware-Temporal-Extension Public
An Official code for AAAI'24 paper [Learning Uncertainty-Aware Temporally-Extended Actions]
oh-lab/UTE-Uncertainty-aware-Temporal-Extension’s past year of commit activity - Count-MORL Public
Official code for ICML'23 paper [Model-based Offline Reinforcement Learning with Count-based Conservatism]
oh-lab/Count-MORL’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…