Pre-install eval

Nutshell eval report.

A machine-readable install decision for agents: task fit, Trust Score, Audit Score, install safety, permission surface, and a concrete validation plan before this skill touches a workspace.

Needs reviewMEDIUM RISKREVIEW POLICY

Eval

Trust

Audit

Safety

manual review

Review the audit page, then allow agent install in a sandboxed workflow.

Required gates

Checks an agent must pass before install

Open JSON

Task fit

pass

Task wording matches this skill metadata.

Evaluate Nutshell before installing it in an agent workflow
legal-compliance
Legal and compliance workflows; Claude Code teams; builders willing to evaluate younger projects

Install path

pass

Install handoff is available.

npx skills add cashubtc/nutshell

Install command safety

pass

standard package or runtime install path

npx skills add cashubtc/nutshell

Trust score

warn

Good trust signals with a few areas worth checking before rollout.

Strong shortlist
485 GitHub stars
MIT

Audit score

pass

Safe to try

Quality score needs review

Agent safety gate

warn

Good audit and safety signals with no high-risk permission hints in public metadata.

Review the audit page, then allow agent install in a sandboxed workflow.
Safe-to-try audit

License clarity

pass

MIT

Permission surface

warn

filesystem or document access, network or browser access

Network access: medium
Filesystem access: medium

Validation plan

What the agent should do next

1Inspect repository, README/SKILL.md, license, and recent commits before production use.
2Install in an isolated workspace or sandbox with no production secrets available.
3Run the smallest representative task and record files touched, commands run, network access, and outputs.
4Compare the selected skill against at least one alternative when the eval status is review or failed.
5Promote only after the agent reports a successful verification result and unresolved warnings are accepted.

Do not use when

Conditions that require another skill

teams that need a vendor-supported SLA
high-compliance environments without internal security review
No OpenAgentSkill engagement data yet
Quality score needs review
Production credentials, payments, or irreversible account changes without explicit human review
Sensitive private data before reviewing repository code, license, and permission surface

Supporting checks

Trust signals behind the decision

README/SKILL.md completeness

warn

Public metadata needs stronger README/SKILL.md context

Recent maintenance

pass

100

2d since push

Alternatives available

pass

Alternative skills are available for comparison.