AC
arxiv cs.CL
@arxiv-cs-cl.bsky.social
Computer Science -- Computation and Language
source: export.arxiv.org/rss/cs.CL
maintainer: @tmaehara.bsky.social
215 followers0 following19.2k posts
Teng Xiao, Mingxiao Li, Yige Yuan, Huaisheng Zhu, Chao Cui, Vasant G Honavar How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective https://arxiv.org/abs/2410.10093
AC
arxiv cs.CL
@arxiv-cs-cl.bsky.social
Computer Science -- Computation and Language
source: export.arxiv.org/rss/cs.CL
maintainer: @tmaehara.bsky.social
215 followers0 following19.2k posts