算力平台:
管道
管道提供了一种简单的方法,通过将所有必要的组件(多个独立训练的模型、调度器和处理器)捆绑到一个端到端的类中,来在推理过程中运行最先进的扩散模型。管道是灵活的,可以适应使用不同的调度器甚至模型组件。
所有管道都基于 [DiffusionPipeline] 类构建,该类提供了加载、下载和保存所有组件的基本功能。特定的管道类型(例如 [StableDiffusionPipeline])使用 [~DiffusionPipeline.from_pretrained] 加载时,会自动检测并加载管道组件,并传递给管道的 __init__ 函数。
下表列出了 🤗 Diffusers 中目前可用的所有管道及其支持的任务。点击一个管道以查看其摘要和已发表的论文。
| Pipeline | Tasks |
|---|---|
| aMUSEd | text2image |
| AnimateDiff | text2video |
| Attend-and-Excite | text2image |
| AudioLDM | text2audio |
| AudioLDM2 | text2audio |
| AuraFlow | text2image |
| BLIP Diffusion | text2image |
| CogVideoX | text2video |
| Consistency Models | unconditional image generation |
| ControlNet | text2image, image2image, inpainting |
| ControlNet with Flux.1 | text2image |
| ControlNet with Hunyuan-DiT | text2image |
| ControlNet with Stable Diffusion 3 | text2image |
| ControlNet with Stable Diffusion XL | text2image |
| ControlNet-XS | text2image |
| ControlNet-XS with Stable Diffusion XL | text2image |
| Dance Diffusion | unconditional audio generation |
| DDIM | unconditional image generation |
| DDPM | unconditional image generation |
| DeepFloyd IF | text2image, image2image, inpainting, super-resolution |
| DiffEdit | inpainting |
| DiT | text2image |
| Flux | text2image |
| Hunyuan-DiT | text2image |
| I2VGen-XL | text2video |
| InstructPix2Pix | image editing |
| Kandinsky 2.1 | text2image, image2image, inpainting, interpolation |
| Kandinsky 2.2 | text2image, image2image, inpainting |
| Kandinsky 3 | text2image, image2image |
| Kolors | text2image |
| Latent Consistency Models | text2image |
| Latent Diffusion | text2image, super-resolution |
| Latte | text2image |
| LEDITS++ | image editing |
| Lumina-T2X | text2image |
| Marigold | depth |
| MultiDiffusion | text2image |
| MusicLDM | text2audio |
| PAG | text2image |
| Paint by Example | inpainting |
| PIA | image2video |
| PixArt-α | text2image |
| PixArt-Σ | text2image |
| Self-Attention Guidance | text2image |
| Semantic Guidance | text2image |
| Shap-E | text-to-3D, image-to-3D |
| Stable Audio | text2audio |
| Stable Cascade | text2image |
| Stable Diffusion | text2image, image2image, depth2image, inpainting, image variation, latent upscaler, super-resolution |
| Stable Diffusion XL | text2image, image2image, inpainting |
| Stable Diffusion XL Turbo | text2image, image2image, inpainting |
| Stable unCLIP | text2image, image variation |
| T2I-Adapter | text2image |
| Text2Video | text2video, video2video |
| Text2Video-Zero | text2video |
| unCLIP | text2image, image variation |
| UniDiffuser | text2image, image2text, image variation, text variation, unconditional image generation, unconditional audio generation |
| Value-guided planning | value guided sampling |
| Wuerstchen | text2image |
DiffusionPipeline
[[autodoc]] DiffusionPipeline - all - call - device - to - components
[[autodoc]] pipelines.StableDiffusionMixin.enable_freeu
[[autodoc]] pipelines.StableDiffusionMixin.disable_freeu
FlaxDiffusionPipeline
[[autodoc]] pipelines.pipeline_flax_utils.FlaxDiffusionPipeline
PushToHubMixin
[[autodoc]] utils.PushToHubMixin
