Skip to content
Change the repository type filter

All

    Repositories list

    • CPQL

      Public
      Official code for ICLR'26 paper [Peng's Q(λ) for Conservative Value Estimation in Offline Reinforcement Learning]
      MIT License
      0000Updated Feb 27, 2026Feb 27, 2026
    • Official code for NeurIPS'24 paper [Nearly Minimax Optimal Regret for Multinomial Logistic Bandit]
      Python
      MIT License
      1000Updated Oct 23, 2025Oct 23, 2025
    • APPO

      Public
      Official code for ICLR'25 paper [Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning]
      Jupyter Notebook
      0200Updated Feb 28, 2025Feb 28, 2025
    • An Official code for AAAI'24 paper [Learning Uncertainty-Aware Temporally-Extended Actions]
      Python
      Apache License 2.0
      2120Updated Mar 2, 2024Mar 2, 2024
    • Simulation for linear bandits: LinUCB, LinTS, LinPHE
      Jupyter Notebook
      Apache License 2.0
      1000Updated Mar 2, 2024Mar 2, 2024
    • Sparsity-Agnostic Lasso Bandit
      Python
      0000Updated Aug 7, 2023Aug 7, 2023
    • Official code for ICML'23 paper [Model-based Offline Reinforcement Learning with Count-based Conservatism]
      Python
      MIT License
      1600Updated May 29, 2023May 29, 2023
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.