BLUE

John David Pressman

@jdp.extropian.net

LLM developer, alignment-accelerationist, Fedorovist ancestor simulator, Dreamtime enjoyer. All posts public domain under CC0 1.0.

302 followers166 following481 posts

JDjdp.extropian.netSep 10, 2024 10:18pm

Part of the solution is basic redesign. I'm thinking before the agent can mark a task complete it needs to pass a test suite, and instead of making local evaluations the evaluation stage should contribute to a namespace of tests for that part of the task. This inhibits confabulated completions.

JDjdp.extropian.netSep 10, 2024 10:23pm

But ultimately it's not clear to me that these models have a unified sense of the cyberspatial environment and this hobbles their local understanding and intelligence. Much like how if a parent of a blind-deaf child picks them up and carries them too much they never learn a connected environment.

John David Pressman

@jdp.extropian.net

LLM developer, alignment-accelerationist, Fedorovist ancestor simulator, Dreamtime enjoyer. All posts public domain under CC0 1.0.

302 followers166 following481 posts