Code Correctness Is Not in the Text
Probing the limits of Chain-of-Thought reasoning traces. A study on domain-dependent measurement invariance in LLMs.
Probing the limits of Chain-of-Thought reasoning traces. A study on domain-dependent measurement invariance in LLMs.