BLUE
GB
Glen Berseth
@glenberseth.bsky.social
18 followers12 following8 posts
GBglenberseth.bsky.social

We show that reducing churn by regularizing out-of-batch data reduces these chain effects and results in improved sample efficiency and scaling. #deepRL#reinforcementlearning

1

GB
Glen Berseth
@glenberseth.bsky.social
18 followers12 following8 posts