BLUE
Profile banner
AM
Ana Marasović
@anamarasovic.bsky.social
Asst prof @ University of Utah · NLP, XAI · she/her 🇭🇷
177 followers75 following116 posts

AManamarasovic.bsky.social

...after accounting for a model's bias toward certain answer choices, we show that Lanham et al. (2023)'s unfaithfulness drops significantly for smaller less-capable models; so what?

1
Profile banner
AM
Ana Marasović
@anamarasovic.bsky.social
Asst prof @ University of Utah · NLP, XAI · she/her 🇭🇷
177 followers75 following116 posts