BLUE
Profile banner
AC
arxiv cs.CL
@arxiv-cs-cl.bsky.social
Computer Science -- Computation and Language source: export.arxiv.org/rss/cs.CL maintainer: @tmaehara.bsky.social
215 followers0 following19.2k posts
ACarxiv-cs-cl.bsky.social

Teng Xiao, Mingxiao Li, Yige Yuan, Huaisheng Zhu, Chao Cui, Vasant G Honavar How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective https://arxiv.org/abs/2410.10093

0

Profile banner
AC
arxiv cs.CL
@arxiv-cs-cl.bsky.social
Computer Science -- Computation and Language source: export.arxiv.org/rss/cs.CL maintainer: @tmaehara.bsky.social
215 followers0 following19.2k posts