people have noticed that large language models get better at stuff like math and coding if the models have to spend time showing their work. apparently it can get excessive.
the prompt was “hi”
ALT
the eventual output was “Hi there! How can I assist you today?”