Increased creativity by thinking longer

Here’s an ingenious set of hacks to cheaply modify the behavior of existing LLMs to reason better. Most notably was the detecting the initial use of the </think>
tag and instead replacing it with a second-guessing term (best performing was “Wait”). This forced the model to think longer, which in turn improved performance on tasks significantly.
I’ll likely be doing a deeper dive for my upcoming paper club presentation.
#AI