Prototypical Reward Network for Data-Efficient RLHF

Jan 1, 2024·
Jinghan Zhang
,
Xiting Wang
Yiqiao Jin
Yiqiao Jin
,
Changyu Chen
,
Xinhao Zhang
,
Kunpeng Liu
· 1 min read
Abstract
We propose a prototypical reward network that enables data-efficient reinforcement learning from human feedback (RLHF) for large language models.
Type
Publication
Annual Meeting of the Association for Computational Linguistics (ACL) 2024

Abstract

We propose a prototypical reward network that enables data-efficient reinforcement learning from human feedback (RLHF) for large language models.

Keywords

Reinforcement Learning, Human Feedback, Large Language Models, Data Efficiency

Yiqiao Jin
Authors
Ph.D. Candidate in Computer Science
My research focuses on adaptive and efficient AI systems, with emphasis on LLM agents, agent memory, self-distillation, multimodal LLMs, and structured multi-agent intelligence.