1
0
Fork 0
onnx-web/docs/chain-pipelines.md

210 lines
5.1 KiB
Markdown
Raw Normal View History

# Chain Pipelines
Chain pipelines are a new feature in v0.6 that allows you to run any combination of models on images
of almost any size, by automatically splitting them into smaller tiles as needed. Individual models
are run on each tile, then the results are recombined and passed on to the next stage.
## Contents
- [Chain Pipelines](#chain-pipelines)
- [Contents](#contents)
- [Overview](#overview)
- [Format](#format)
- [Stages](#stages)
- [Blending Stages](#blending-stages)
2023-12-16 23:03:25 +00:00
- [Blend: Denoise](#blend-denoise)
- [Blend: Grid](#blend-grid)
- [Blend: Img2img](#blend-img2img)
2023-12-16 23:03:25 +00:00
- [Blend: Linear](#blend-linear)
- [Blend: Mask](#blend-mask)
- [Correction Stages](#correction-stages)
- [Correct: CodeFormer](#correct-codeformer)
- [Correct: GFPGAN](#correct-gfpgan)
2023-12-16 23:03:25 +00:00
- [Compound Stages](#compound-stages)
- [Highres Stage](#highres-stage)
- [Upscale Stage](#upscale-stage)
- [Persistence Stages](#persistence-stages)
- [Persist: Disk](#persist-disk)
- [Persist: S3](#persist-s3)
- [Reduction Stages](#reduction-stages)
- [Reduce: Crop](#reduce-crop)
- [Reduce: Thumbnail](#reduce-thumbnail)
- [Source Stages](#source-stages)
- [Source: Noise](#source-noise)
2023-12-16 23:03:25 +00:00
- [Source: S3](#source-s3)
- [Source: Txt2img](#source-txt2img)
2023-12-16 23:03:25 +00:00
- [Source: URL](#source-url)
- [Upscaling Stages](#upscaling-stages)
2023-12-16 23:03:25 +00:00
- [Upscale: BSRGAN](#upscale-bsrgan)
- [Upscale: Highres](#upscale-highres)
- [Upscale: Outpaint](#upscale-outpaint)
- [Upscale: Real ESRGAN](#upscale-real-esrgan)
2023-12-16 23:03:25 +00:00
- [Upscale: Simple](#upscale-simple)
- [Upscale: Stable Diffusion](#upscale-stable-diffusion)
2023-12-16 23:03:25 +00:00
- [Upscale: SwinIR](#upscale-swinir)
## Overview
### Format
The `/api/chain` endpoint accepts a chain pipeline in JSON format and adds it to the queue of background jobs.
Pipelines are defined mostly through their `stages`, where each stage specifies a function to be run and the
parameters for that function, including the name of the model to be used.
The output of the pipeline _will not_ automatically be saved to disk, which is the case for the single-stage
endpoints. You must use at least one `persist-*` stage. Persist stages can be placed anywhere in the pipeline
and can also save intermediate output, such as the result of a `source-txt2img` stage before upscaling it.
```json
{
"stages": [
{
"name": "start",
"type": "source-txt2img",
"params": {
"prompt": "a magical wizard"
}
},
{
"name": "expand",
"type": "upscale-outpaint",
"params": {
"border": 256,
"prompt": "a magical wizard in a robe fighting a dragon"
}
},
{
"name": "save-local",
"type": "persist-disk",
"params": {
2023-09-13 22:28:38 +00:00
"tiles": "hd8k"
}
}
]
}
```
The complete schema can be found in [`api/schema.yaml`](../api/schema.yaml) and some example pipelines are available
in [`common/pipelines`](../common/pipelines).
## Stages
### Blending Stages
2023-12-16 23:03:25 +00:00
#### Blend: Denoise
Run [fast non-local means denoising](https://docs.opencv.org/4.8.0/d5/d69/tutorial_py_non_local_means.html) using `cv2`.
2023-12-16 23:03:25 +00:00
#### Blend: Grid
Combine the source images into a grid.
2023-12-16 23:03:25 +00:00
#### Blend: Img2img
Run an img2img pipeline.
2023-12-16 23:03:25 +00:00
#### Blend: Linear
Blend two images using linear interpolation (0.0 is the first image, 1.0 is the second).
2023-12-16 23:03:25 +00:00
#### Blend: Mask
Blend two images using a mask.
### Correction Stages
#### Correct: CodeFormer
Run correction using CodeFormer.
2023-12-16 23:03:25 +00:00
#### Correct: GFPGAN
Run correction using GFPGAN.
2023-12-16 23:03:25 +00:00
### Compound Stages
Not currently available through JSON API.
#### Highres Stage
2023-12-17 04:39:50 +00:00
Prep one or more highres iterations. Each iteration is an upscale stage followed by img2img.
2023-12-16 23:03:25 +00:00
#### Upscale Stage
2023-12-17 04:39:50 +00:00
Prep upscale and correction stages.
2023-12-16 23:03:25 +00:00
### Persistence Stages
#### Persist: Disk
Save all of the sources to disk.
2023-12-16 23:03:25 +00:00
#### Persist: S3
Save all of the sources to an S3 bucket.
2023-12-16 23:03:25 +00:00
### Reduction Stages
#### Reduce: Crop
2023-12-17 04:39:50 +00:00
Crop a section out of each source.
2023-12-16 23:03:25 +00:00
#### Reduce: Thumbnail
2023-12-17 04:39:50 +00:00
Downscale each image into a thumbnail of itself.
2023-12-16 23:03:25 +00:00
### Source Stages
#### Source: Noise
Create a new source using a noise generator.
2023-12-16 23:03:25 +00:00
#### Source: S3
Load a new source from an S3 bucket.
2023-12-16 23:03:25 +00:00
#### Source: Txt2img
Run a txt2img pipeline.
2023-12-16 23:03:25 +00:00
#### Source: URL
Load a new source from a URL.
2023-12-16 23:03:25 +00:00
### Upscaling Stages
2023-12-16 23:03:25 +00:00
#### Upscale: BSRGAN
Upscaling stage using BSRGAN.
2023-12-16 23:03:25 +00:00
#### Upscale: Highres
Upscaling stage using highres.
2023-12-16 23:03:25 +00:00
#### Upscale: Outpaint
Upscaling stage using outpainting. This adds empty borders to the source image, optionally fills them with noise, and
then runs inpainting on those areas.
#### Upscale: Real ESRGAN
Upscaling stage using the Real ESRGAN upscaling models, available in x2 and x4 versions:
- https://github.com/xinntao/Real-ESRGAN/releases
2023-12-16 23:03:25 +00:00
#### Upscale: Simple
Upscaling stage using bilinear or Lanczos upscaling.
2023-12-16 23:03:25 +00:00
#### Upscale: Stable Diffusion
Upscaling stage using the Stable Diffusion x4 upscaling model:
- https://huggingface.co/stabilityai/stable-diffusion-x4-upscaler
- https://huggingface.co/ssube/stable-diffusion-x4-upscaler-onnx
2023-12-16 23:03:25 +00:00
#### Upscale: SwinIR
Upscaling stage using SwinIR.