Pre-install eval

MedAgents eval report.

A machine-readable install decision for agents: task fit, Trust Score, Audit Score, install safety, permission surface, and a concrete validation plan before this skill touches a workspace.

Needs reviewMEDIUM RISKREVIEW POLICY

Eval

Trust

Audit

Safety

manual review

Test manually in an isolated workspace and compare against safer alternatives.

Required gates

Checks an agent must pass before install

Open JSON

Task fit

pass

Task wording matches this skill metadata.

Evaluate MedAgents before installing it in an agent workflow
agent-frameworks
Coding agents workflows; Claude Code teams; builders willing to evaluate younger projects

Install path

pass

Install handoff is available.

npx skills add gersteinlab/MedAgents

Install command safety

pass

standard package or runtime install path

npx skills add gersteinlab/MedAgents

Trust score

warn

Potentially useful, but at least one trust signal needs human inspection.

Manual review
350 GitHub stars
Unknown

Audit score

warn

Needs review

License is unclear

Agent safety gate

warn

Sparse or mixed signals. Useful for discovery, but not for autonomous installation.

Test manually in an isolated workspace and compare against safer alternatives.
License is unclear

License clarity

warn

Unknown

Unknown

Permission surface

pass

filesystem or document access

Network access: medium
Filesystem access: medium

Validation plan

What the agent should do next

1Inspect repository, README/SKILL.md, license, and recent commits before production use.
2Install in an isolated workspace or sandbox with no production secrets available.
3Run the smallest representative task and record files touched, commands run, network access, and outputs.
4Compare the selected skill against at least one alternative when the eval status is review or failed.
5Promote only after the agent reports a successful verification result and unresolved warnings are accepted.

Do not use when

Conditions that require another skill

teams that require actively maintained dependencies
production agents without a repository review
Repository looks stale
License is unclear
Repository appears stale
Quality score needs review

Supporting checks

Trust signals behind the decision

README/SKILL.md completeness

pass

Metadata includes enough usage and workflow context

Recent maintenance

fail

2y since push

Alternatives available

pass

Alternative skills are available for comparison.