🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
$ npx skills add huggingface/transformersScenario
Multimodal media
I need my agent to process images, video, or audio and extract useful information.
Agents
Claude Code + CLI + Codex
4 install targets
Maintenance
fresh
Pushed today
Risk
Safe to try
No major risk signals from available metadata
Gate
allow
Verified