BLUE
AD
Anil Doshi
@thedosh.bsky.social
Researcher: firms and genAI/ChatGPT/ innovation/ fake news/social media. Teacher: data analytics and strategy. Politics and news junkie. Board gamer.
63 followers90 following20 posts
ADthedosh.bsky.social

We have a set of 60 business models and we ask generative AI to rank the business models (based on a series of pairwise comparisons), and we also ask over 50 strategy professors to do the same. What do we find?

1

ADthedosh.bsky.social

First, *individual* evaluations are quite problematic. LLMs (just like people) tend to be biased and inconsistent. Second, when we put together the evaluations from *multiple* LLMs and/or prompt approaches, generative AI evaluations look pretty similar to the strategy professors.

1
AD
Anil Doshi
@thedosh.bsky.social
Researcher: firms and genAI/ChatGPT/ innovation/ fake news/social media. Teacher: data analytics and strategy. Politics and news junkie. Board gamer.
63 followers90 following20 posts