Solid
Hermes-Agent
67
/100
ClawScore
Solid
Promising orchestration bones, but the receipt stack is narrower than the product ambition.
Compare
Scores, criteria, facts, and verdicts aligned on one evidence-bound review surface.
Solid
67
/100
ClawScore
Solid
Promising orchestration bones, but the receipt stack is narrower than the product ambition.
Solid
61
/100
ClawScore
Solid
Interesting orchestration ideas, but the current evidence base makes this a cautious draft.
Capability
76
/100
Score
Pending
Hermes-Agent is rated on capability from currently bound launch evidence. Unsupported details remain Analysis until receipts are attached.
68
/100
Score
Pending
Paperclip is rated on capability from currently bound launch evidence. Unsupported details remain Analysis until receipts are attached.
Reliability
62
/100
Score
Pending
The draft withholds enthusiasm until there are stronger receipts for retries, state handling, and long-running workflows.
54
/100
Score
Pending
The evidence set is too thin to reward production reliability without a ClawLab pass.
Setup & DX
68
/100
Score
Pending
Hermes-Agent is rated on setup & dx from currently bound launch evidence. Unsupported details remain Analysis until receipts are attached.
62
/100
Score
Pending
Paperclip is rated on setup & dx from currently bound launch evidence. Unsupported details remain Analysis until receipts are attached.
Safety & Control
64
/100
Score
Pending
Hermes-Agent is rated on safety & control from currently bound launch evidence. Unsupported details remain Analysis until receipts are attached.
58
/100
Score
Pending
Paperclip is rated on safety & control from currently bound launch evidence. Unsupported details remain Analysis until receipts are attached.
Cost Efficiency
74
/100
Score
Pending
Hermes-Agent is rated on cost efficiency from currently bound launch evidence. Unsupported details remain Analysis until receipts are attached.
70
/100
Score
Pending
Paperclip is rated on cost efficiency from currently bound launch evidence. Unsupported details remain Analysis until receipts are attached.
Docs & Support
58
/100
Score
Pending
Documentation coverage appears narrower than the ambition of the orchestration model.
52
/100
Score
Pending
Documentation needs operator verification before this can publish confidently.
Momentum
66
/100
Score
Pending
Hermes-Agent is rated on momentum from currently bound launch evidence. Unsupported details remain Analysis until receipts are attached.
60
/100
Score
Pending
Paperclip is rated on momentum from currently bound launch evidence. Unsupported details remain Analysis until receipts are attached.