🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and traini…
$ npx skills add huggingface/transformersScenario Multimodal media · I need my agent to process images, video, or audio and extract useful information.
Claude Code + CLI · 4 targets