All

7 repositories

CPQL
Public
Official code for ICLR'26 paper [Peng's Q(λ) for Conservative Value Estimation in Offline Reinforcement Learning]
MIT License
•0•0•0•0•Updated Feb 27, 2026Feb 27, 2026
OFU-MNL-plus
Public
Official code for NeurIPS'24 paper [Nearly Minimax Optimal Regret for Multinomial Logistic Bandit]
Python
•
MIT License
•1•0•0•0•Updated Oct 23, 2025Oct 23, 2025
APPO
Public
Official code for ICLR'25 paper [Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning]
Jupyter Notebook
•0•2•0•0•Updated Feb 28, 2025Feb 28, 2025
UTE-Uncertainty-aware-Temporal-Extension
Public
An Official code for AAAI'24 paper [Learning Uncertainty-Aware Temporally-Extended Actions]
Python
•
Apache License 2.0
•2•1•2•0•Updated Mar 2, 2024Mar 2, 2024
LinearBandit
Public
Simulation for linear bandits: LinUCB, LinTS, LinPHE
Jupyter Notebook
•
Apache License 2.0
•1•0•0•0•Updated Mar 2, 2024Mar 2, 2024
sparsity-agnostic-lasso-bandit
Public
Sparsity-Agnostic Lasso Bandit
Python
•0•0•0•0•Updated Aug 7, 2023Aug 7, 2023
Count-MORL
Public
Official code for ICML'23 paper [Model-based Offline Reinforcement Learning with Count-based Conservatism]
Python
•
MIT License
•1•6•0•0•Updated May 29, 2023May 29, 2023

ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.