UI DESIGN LAB
  • Home
  • Community
  • Library
  • Knowledge
  • Docs
ZHLogin

Docs

Platform
Site OverviewDevelopment LogCollaboration Guide
Template Library
Prompt BoardStyle Board
Agent Workflow
Agent Deep DiveProject Workflow AgentAgent Output PathPrompt Agent EvalsDesign System PanelsKnowledge Base GuideDeepSeek Agent Incident Notes
UI DESIGN LABDocumentation

Read public Markdown docs; admins can maintain them in editor pages.

Prompt Agent Evals

Prompt Agent Evals

Fixture suite, scoring dimensions, and run instructions for the Prompt Intelligence Agent.

Back to docs

Chapter preview

FixturesScoringRun

Prompt Agent Evals

This suite validates the Prompt Intelligence Agent before it calls a model for code.

Fixtures

The fixtures live in src/lib/generation/evals/prompt-agent-cases.json and cover:

  • ambiguous briefs
  • conflicting style requests
  • long production prompts
  • reference links
  • revision requests
  • Chinese creative direction
  • dashboard interfaces
  • marketing pages
  • strict brand constraints
  • missing design-system input

Scoring

Each fixture checks the intended structure rather than visual taste:

  • intent profile is present
  • decision board captures tradeoffs and conflicts
  • at least 6 design-system panels are expected
  • prompt template payload is present
  • prompt trace is present when evidence matters
  • Prompt QA can block code generation

Run

npm run eval:prompt-agent

--live is reserved for a later model-backed runner. The default runner is fixture-only and does not call an LLM.