条件	Condition	描述
Control	无 system prompt	No system prompt
Baseline-Concise	"请简洁"	"Be concise"
Moyu-Lite	Moyu 精简版	Moyu Lite variant
Moyu-Standard	Moyu 标准版	Moyu Standard
Moyu-Strict	Moyu 严格版	Moyu Strict variant

ID	类型	Type	描述
S1	A	修复 complete_task 空指针 bug	Fix complete_task null pointer bug
S2	A	添加 list_tasks_sorted 函数	Add list_tasks_sorted function
S3	A	给 search 加 status 参数	Add status param to search
S4	A	添加 export_csv 函数	Add export_csv function
S5	A	给 list_tasks 加 assignee 筛选	Add assignee filter to list_tasks
S6	A	添加 bulk_complete 函数	Add bulk_complete function
S7	A	修复 delete_task 返回值	Fix delete_task return value
S8	A	添加 get_tasks_by_assignee	Add get_tasks_by_assignee
S9	B	重构为 context manager	Refactor to context managers
S10	B	添加 docstring	Add docstrings
S11	B	编写单元测试	Write unit tests
S12	C	修复 bug + 新功能	Fix bug + new feature

A = 过度工程高发区 (应缩减), B = 用户显式请求 (不应抑制), C = 混合 A = Over-engineering hotspots (should reduce), B = Explicit user requests (should NOT suppress), C = Mixed

模型	Model	提供商	Provider
Claude Sonnet 4	Anthropic
Claude Sonnet 4.5	Anthropic
Claude Haiku 4.5	Anthropic
GPT-5.4	OpenAI
GPT-5 Codex	OpenAI
Grok 4.1 Fast	xAI
Grok 4.20 Beta	xAI
LongCat Flash Chat	DeepSeek
LongCat Flash Thinking	DeepSeek
LongCat Flash Lite	DeepSeek

"Be concise" 与 Moyu 的区别 "Be Concise" vs Moyu

简单的 "请写简洁" 提示和 Moyu 在汇总 LOC 上效果相似（差异无统计显著性，p>0.25）。但 Moyu 的价值在于结构性改善——它减少的是过度工程信号（多余的 docstring、try/except、isinstance 检查），而非仅仅缩短代码。对特定模型（如 Haiku 4.5），Moyu 的 diff 缩减达到 49%。 A simple "be concise" prompt and Moyu show similar aggregate LOC reduction (difference not statistically significant, p>0.25). But Moyu's value is structural — it reduces over-engineering signals (unnecessary docstrings, try/except, isinstance checks), not just code length. For specific models (e.g. Haiku 4.5), Moyu achieved 49% diff reduction.

Lite vs Standard vs Strict Lite vs Standard vs Strict

Moyu-Lite 的 diff 缩减效果与 Standard 相当，适合小模型（大规则集反而是信息过载）。Standard 在 OE 信号消除上表现最稳定。Strict 对已有边界感的模型无额外增益。 Moyu-Lite achieves comparable diff reduction to Standard — ideal for smaller models (large rulesets cause information overload). Standard is most consistent at eliminating OE signals. Strict shows no additional benefit for models that already have good boundaries.

关键发现 Key Findings

LLM 在未受约束时确实存在系统性过度工程倾向，尤其在简单任务上表现最为突出 LLMs exhibit systematic over-engineering tendencies when unconstrained, especially pronounced in simple tasks
Moyu 的汇总 LOC 减少不具统计显著性（p=0.25），但在特定高讨好模型上效果显著——Haiku 4.5 的 diff 缩减 49%，OE 信号消除 100% Moyu's aggregate LOC reduction is not statistically significant (p=0.25), but the effect is pronounced on specific high-pleasing models — Haiku 4.5 saw 49% diff reduction and 100% OE signal elimination
简单的 "简洁" 提示在 diff 层面效果与 Moyu 相当，但 Moyu 在结构层面（AST 节点、OE 信号分解）更优 Simple "concise" prompts match Moyu at the diff level, but Moyu outperforms at the structural level (AST nodes, OE signal decomposition)
不同模型对 Moyu 策略的响应程度存在差异，提示了模型间过度工程行为的差异 Different models respond to Moyu with varying effectiveness, revealing inter-model differences in over-engineering behavior

局限性 Limitations

本研究的场景设计偏向于独立的代码修改任务，大型项目中的长对话场景需要进一步研究。此外，过度工程的定义本身具有主观性，不同团队和项目可能有不同标准。 The study's scenarios lean towards isolated code modification tasks; long-conversation scenarios in large projects require further investigation. Additionally, the definition of over-engineering is inherently subjective, with different teams and projects potentially having different standards.

结论 Conclusion

Moyu 证明了一种简单的、基于规则的 Prompt 策略可以有效地抑制 AI 编码助手的过度工程行为。它不需要模型微调，不依赖特定工具链，可以无缝集成到任何开发工作流中。少即是多 -- 有时，最好的代码就是没有写出来的代码。 Moyu demonstrates that a simple, rule-based prompt strategy can effectively suppress over-engineering behavior in AI coding assistants. It requires no model fine-tuning, depends on no specific toolchain, and integrates seamlessly into any development workflow. Less is more -- sometimes the best code is the code that was never written.

指标	Metric	描述
LOC	代码行数（不含空行和注释）	Lines of code (excl. blanks/comments)
OE Score	过度工程评分（0-10）	Over-engineering score (0-10)
Correctness	功能正确性（通过测试比例）	Functional correctness (test pass rate)
OE Signals	过度工程信号分解	Over-engineering signal decomposition

你的 AI 加了班，你也加了班 Your AI Worked Overtime. So Did You.

一行需求，四十三行代码 One Requirement, Forty-Three Lines

Prompt 生态三角 The Prompt Ecosystem Triangle

三条铁律 Three Iron Rules

只改被要求的 Only Change What Was Asked

用最简方案 Use the Simplest Solution

不确定就问 When Unsure, Ask

研究方法 Methodology

实验结果 Results

代码行数（按条件） Lines of Code by Condition

过度工程信号分解 OE Signal Decomposition

正确率 Correctness

模型 x 条件交互 Model x Condition Interaction

消融实验 Ablation Study

"Be concise" 与 Moyu 的区别 "Be Concise" vs Moyu

Lite vs Standard vs Strict Lite vs Standard vs Strict

B 类任务结果 B-Type Task Results

讨论与结论 Discussion & Conclusion

关键发现 Key Findings

局限性 Limitations

结论 Conclusion

参考文献 References