[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
$ npx skills add FoundationVision/InfinityAlternatives
Compare similar skills by workflow fit, trust score, quality, GitHub adoption, maintenance, and install readiness.
Current skill
[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding
[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
$ npx skills add FoundationVision/Infinity[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
$ npx skills add FoundationVision/VAR✨ Reverse-engineered Python API for Google Gemini web app
$ npx skills add HanaokaYuzu/Gemini-API🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
$ npx skills add huggingface/diffusersStable Diffusion web UI
$ npx skills add AUTOMATIC1111/stable-diffusion-webui🚀 AI 全自动短视频引擎 | AI Fully Automated Short Video Engine
$ npx skills add AIDC-AI/Pixelle-Video[ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning
$ npx skills add bytedance/UNOImage-to-Image Translation in PyTorch
$ npx skills add junyanz/pytorch-CycleGAN-and-pix2pixA framework for efficient model inference with omni-modality models
$ npx skills add vllm-project/vllm-omniA powerful tool that translates ComfyUI workflows into executable Python code.
$ npx skills add pydn/ComfyUI-to-Python-ExtensionCogView4, CogView3-Plus and CogView3(ECCV 2024)
$ npx skills add zai-org/CogView4This project is the official implementation of 'DreamOmni2: Multimodal Instruction-based Editing and Generation (CVPR2026 Highlight)''
$ npx skills add JIA-Lab-research/DreamOmni2Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.
$ npx skills add invoke-ai/InvokeAIA 3B-active-parameter native unified multimodal model for image and video understanding, generation, and editing.
$ npx skills add bytedance/Lance🚀 AI 全自动化视频生成员工 | Your First AIGC Coworker. Chat an Idea. Get a Film. 🦞
$ npx skills add HITsz-TMG/VideoClawOpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
$ npx skills add open-mmlab/mmagicHow to choose
Use an alternative when it has a clearer install path, higher trust score, fresher maintenance, or better platform fit for your current agent stack. Keep UniTok if it already passes your workflow test and repository review.
Next step
Open the compare page, test the install commands in a sandbox, and check each repository before using a skill in production.