条件	Condition	描述
Control	无 system prompt	No system prompt
Baseline-Concise	"请简洁"	"Be concise"
Moyu-Lite	Moyu 精简版	Moyu Lite variant
Moyu-Standard	Moyu 标准版	Moyu Standard
Moyu-Strict	Moyu 严格版	Moyu Strict variant

ID	类型	Type	描述
S1	A	修复 complete_task 空指针 bug	Fix complete_task null pointer bug
S2	A	添加 list_tasks_sorted 函数	Add list_tasks_sorted function
S3	A	给 search 加 status 参数	Add status param to search
S4	A	添加 export_csv 函数	Add export_csv function
S5	A	给 list_tasks 加 assignee 筛选	Add assignee filter to list_tasks
S6	A	添加 bulk_complete 函数	Add bulk_complete function
S7	A	修复 delete_task 返回值	Fix delete_task return value
S8	A	添加 get_tasks_by_assignee	Add get_tasks_by_assignee
S9	B	重构为 context manager	Refactor to context managers
S10	B	添加 docstring	Add docstrings
S11	B	编写单元测试	Write unit tests
S12	C	修复 bug + 新功能	Fix bug + new feature

A = 过度工程高发区 (应缩减), B = 用户显式请求 (不应抑制), C = 混合 A = Over-engineering hotspots (should reduce), B = Explicit user requests (should NOT suppress), C = Mixed

指标	Metric	描述
LOC	代码行数（不含空行和注释）	Lines of code (excl. blanks/comments)
OE Score	过度工程评分（0-10）	Over-engineering score (0-10)
Correctness	功能正确性（通过测试比例）	Functional correctness (test pass rate)
OE Signals	过度工程信号分解	Over-engineering signal decomposition

"Be concise" 与 Moyu 的区别 "Be Concise" vs Moyu

简单的 "请写简洁" 提示确实能减少代码量，但效果远不如 Moyu。Baseline-Concise 条件平均减少 LOC 约 15-20%，而 Moyu-Standard 达到了更显著的减少。更关键的是，"简洁" 提示主要减少了注释和文档，而非过度工程行为本身。 A simple "be concise" prompt does reduce code volume, but far less effectively than Moyu. The Baseline-Concise condition reduced LOC by roughly 15-20%, while Moyu-Standard achieved significantly more. Crucially, the "concise" prompt mainly reduced comments and documentation, not the over-engineering behavior itself.

Lite vs Standard vs Strict Lite vs Standard vs Strict

Moyu-Lite 提供了基础的过度工程抑制；Standard 在 LOC 减少和正确性之间取得了最佳平衡；Strict 进一步压缩代码，但在复杂场景 (C 类) 中偶尔出现功能遗漏。 Moyu-Lite provides basic over-engineering suppression; Standard strikes the best balance between LOC reduction and correctness; Strict compresses further but occasionally misses functionality in complex (Type C) scenarios.

关键发现 Key Findings

LLM 在未受约束时确实存在系统性过度工程倾向，尤其在简单任务上表现最为突出 LLMs exhibit systematic over-engineering tendencies when unconstrained, especially pronounced in simple tasks
Moyu-Standard 在所有模型和场景类型上都实现了显著的 LOC 减少，同时保持或提高了正确率 Moyu-Standard achieved significant LOC reduction across all models and scenario types while maintaining or improving correctness
简单的 "简洁" 提示不足以解决过度工程问题，结构化策略是必要的 Simple "concise" prompts are insufficient for addressing over-engineering; structured strategies are necessary
不同模型对 Moyu 策略的响应程度存在差异，提示了模型间过度工程行为的差异 Different models respond to Moyu with varying effectiveness, revealing inter-model differences in over-engineering behavior

局限性 Limitations

本研究的场景设计偏向于独立的代码修改任务，大型项目中的长对话场景需要进一步研究。此外，过度工程的定义本身具有主观性，不同团队和项目可能有不同标准。 The study's scenarios lean towards isolated code modification tasks; long-conversation scenarios in large projects require further investigation. Additionally, the definition of over-engineering is inherently subjective, with different teams and projects potentially having different standards.

结论 Conclusion

Moyu 证明了一种简单的、基于规则的 Prompt 策略可以有效地抑制 AI 编码助手的过度工程行为。它不需要模型微调，不依赖特定工具链，可以无缝集成到任何开发工作流中。少即是多 -- 有时，最好的代码就是没有写出来的代码。 Moyu demonstrates that a simple, rule-based prompt strategy can effectively suppress over-engineering behavior in AI coding assistants. It requires no model fine-tuning, depends on no specific toolchain, and integrates seamlessly into any development workflow. Less is more -- sometimes the best code is the code that was never written.

模型	Model	提供商	Provider
Claude Sonnet 4	Anthropic
GPT-4o	OpenAI
Gemini 2.5 Pro	Google

你的 AI 加了班，你也加了班 Your AI Worked Overtime. So Did You.

一行需求，四十三行代码 One Requirement, Forty-Three Lines

Prompt 生态三角 The Prompt Ecosystem Triangle

三条铁律 Three Iron Rules

只改被要求的 Only Change What Was Asked

用最简方案 Use the Simplest Solution

不确定就问 When Unsure, Ask

研究方法 Methodology

实验结果 Results

代码行数（按条件） Lines of Code by Condition

过度工程信号分解 OE Signal Decomposition

正确率 Correctness

模型 x 条件交互 Model x Condition Interaction

消融实验 Ablation Study

"Be concise" 与 Moyu 的区别 "Be Concise" vs Moyu

Lite vs Standard vs Strict Lite vs Standard vs Strict

B 类任务结果 B-Type Task Results

讨论与结论 Discussion & Conclusion

关键发现 Key Findings

局限性 Limitations

结论 Conclusion

参考文献 References