5 months ago

Qinyan Zhang Xinping Lei Ruijie Miao Yu Fu Haojie Fan Le Chang Jiafan Hou Dingling Zhang Zhongfei Hou Ziqiang Yang

Abstract

Large Language Models (LLMs) achieve strong performance on diverse tasks butoften exhibit cognitive inertia, struggling to follow instructions thatconflict with the standardized patterns learned during supervised fine-tuning(SFT). To evaluate this limitation, we propose Inverse IFEval, a benchmark thatmeasures models Counter-intuitive Abilitytheir capacity to overridetraining-induced biases and comply with adversarial instructions. InverseIFEval introduces eight types of such challenges, including QuestionCorrection, Intentional Textual Flaws, Code without Comments, andCounterfactual Answering. Using a human-in-the-loop pipeline, we construct adataset of 1012 high-quality Chinese and English questions across 23 domains,evaluated under an optimized LLM-as-a-Judge framework. Experiments on existingleading LLMs demonstrate the necessity of our proposed Inverse IFEvalbenchmark. Our findings emphasize that future alignment efforts should not onlypursue fluency and factual correctness but also account for adaptability underunconventional contexts. We hope that Inverse IFEval serves as both adiagnostic tool and a foundation for developing methods that mitigate cognitiveinertia, reduce overfitting to narrow patterns, and ultimately enhance theinstruction-following reliability of LLMs in diverse and unpredictablereal-world scenarios.

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

5 months ago

Benchmarks

Supervised Fine-Tuning

Dataset

AI Infra

Method/Architecture

Qinyan Zhang Xinping Lei Ruijie Miao Yu Fu Haojie Fan Le Chang Jiafan Hou Dingling Zhang Zhongfei Hou Ziqiang Yang

Abstract

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

5 months ago

Benchmarks

Supervised Fine-Tuning

Dataset

AI Infra

Method/Architecture

Qinyan Zhang Xinping Lei Ruijie Miao Yu Fu Haojie Fan Le Chang Jiafan Hou Dingling Zhang Zhongfei Hou Ziqiang Yang

Abstract

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

Qinyan Zhang Xinping Lei Ruijie Miao Yu Fu Haojie Fan Le Chang Jiafan Hou Dingling Zhang Zhongfei Hou Ziqiang Yang11 more

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

Qinyan Zhang Xinping Lei Ruijie Miao Yu Fu Haojie Fan Le Chang Jiafan Hou Dingling Zhang Zhongfei Hou Ziqiang Yang11 more

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

Qinyan Zhang Xinping Lei Ruijie Miao Yu Fu Haojie Fan Le Chang Jiafan Hou Dingling Zhang Zhongfei Hou Ziqiang Yang11 more

Abstract

Build AI with AI

HyperAI Newsletters

Qinyan Zhang Xinping Lei Ruijie Miao Yu Fu Haojie Fan Le Chang Jiafan Hou Dingling Zhang Zhongfei Hou Ziqiang Yang

Qinyan Zhang Xinping Lei Ruijie Miao Yu Fu Haojie Fan Le Chang Jiafan Hou Dingling Zhang Zhongfei Hou Ziqiang Yang

Qinyan Zhang Xinping Lei Ruijie Miao Yu Fu Haojie Fan Le Chang Jiafan Hou Dingling Zhang Zhongfei Hou Ziqiang Yang