BLUE

Conspirador Norteño

@conspirator0.bsky.social

Data Scientist/Musician/Participant in the General Confusion. Resist autocracy and research/counter disinformation. I serve the realm. conspirator0.substack.com/ www.youtube.com/watch?v=F4_Li-f6DRQ

2k followers930 following696 posts

CNconspirator0.bsky.socialSep 20, 2024 4:07am

The results are extremely clear. Having the LLM generate Python code to do math and then running the resulting code was both faster and more accurate than having the LLM do the math itself, and the difference in performance widens as the numbers get larger.

table of results by operation and maximum operand value

line graph comparing accuracy of the two calculation methods

line graph comparing speed of the two calculation methods

CNconspirator0.bsky.socialSep 20, 2024 4:07am

More detail (and Python code) in this Substack post: conspirator0.substack.com/p/chatbots-a...

Chatbots and basic arithmetic

Prompting an LLM to write Python code to do simple math is more efficient and more accurate than having the LLM attempt to do the math itself

Conspirador Norteño

@conspirator0.bsky.social

Data Scientist/Musician/Participant in the General Confusion. Resist autocracy and research/counter disinformation. I serve the realm. conspirator0.substack.com/ www.youtube.com/watch?v=F4_Li-f6DRQ

2k followers930 following696 posts