zhxieml

Follow

🤡

Zhihui Xie zhxieml

🤡

Follow

90 followers · 118 following

SJTU
Shanghai
05:33 (UTC +08:00)
https://zhxie.site

Achievements

Achievements

Highlights

Pro

Pinned Loading

HKUNLP/critic-rl HKUNLP/critic-rl Public

Code for Paper: Teaching Language Models to Critique via Reinforcement Learning

Python 70 3
vlf-silkie/VLFeedback vlf-silkie/VLFeedback Public

Python 94 2
PDT PDT Public

Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer

Python 27 3
remiss-jailbreak remiss-jailbreak Public

Python 21
LSAR LSAR Public

Implementation of EMNLP 2022 paper: Discovering Low-rank Subspaces for Language-agnostic Multilingual Representations

Python 7
RelativeConUCB RelativeConUCB Public

Implementation of SIGIR 2021 paper: Comparison-based Conversational Recommender System with Relative Bandit Feedback

Python 9 3