UI DESIGN LAB
  • 首页
  • 社区
  • 资源库
  • 知识库
  • 文档
EN登录

文档文章

平台与维护
网站概述功能开发与版本更新记录关于协作
Template 库
Prompt 看板Style 看板
Agent 工作流
Agent 深度解析项目创作页与 Agent 工作流Agent 输出链路Prompt Agent EvalsDesign System PanelsKnowledge Base GuideDeepSeek Agent 故障复盘
UI DESIGN LAB文档中心

公开阅读 Markdown 文档,管理员可进入编辑页维护。

Prompt Agent Evals

Prompt Agent Evals

Prompt Intelligence Agent 的 fixture 评测集、评分维度和运行方式。

返回文档中心

章节预览

FixturesScoringRun

Prompt Agent Evals

This suite validates the Prompt Intelligence Agent before it calls a model for code.

Fixtures

The fixtures live in src/lib/generation/evals/prompt-agent-cases.json and cover:

  • ambiguous briefs
  • conflicting style requests
  • long production prompts
  • reference links
  • revision requests
  • Chinese creative direction
  • dashboard interfaces
  • marketing pages
  • strict brand constraints
  • missing design-system input

Scoring

Each fixture checks the intended structure rather than visual taste:

  • intent profile is present
  • decision board captures tradeoffs and conflicts
  • at least 6 design-system panels are expected
  • prompt template payload is present
  • prompt trace is present when evidence matters
  • Prompt QA can block code generation

Run

npm run eval:prompt-agent

--live is reserved for a later model-backed runner. The default runner is fixture-only and does not call an LLM.