Comment on Secret calculator hack brings ChatGPT to the TI-84, enabling easy cheating

<- View Parent
jacksilver@lemmy.world ⁨1⁩ ⁨month⁩ ago

LLMs do suck at math, if you look into it, the o1 models actually escape the LLM output and write a python function to calculate the output, I’ve been able to break their math functions by asking for functions that use math not in the standard Python library.

I know someone also wrote a wolfram integration to help solve LLMs math problems.

source
Sort:hotnewtop