Why sample-then-blend in `tiled_encode`?

Hello, thanks for the great work, I learned a lot from the paper!

I have a question about the `tiled_encode` implementation in the VAE encoder.

In [tile_parallel.py](https://github.com/SandAI-org/MAGI-1/blob/6ff822e74ded50611e81e1d0e115146b5c4dd2a5/inference/infra/parallelism/tile_parallel.py#L303), each tile is encoded separately using `self.encode_fn` with its default parameters. From the defaults [here](https://github.com/SandAI-org/MAGI-1/blob/6ff822e74ded50611e81e1d0e115146b5c4dd2a5/inference/model/vae/vae_model.py#L259), `sample_posterior=True`, which means the encoding step [produces the distribution parameters](https://github.com/SandAI-org/MAGI-1/blob/6ff822e74ded50611e81e1d0e115146b5c4dd2a5/inference/model/vae/vae_model.py#L272) and then [samples from it](https://github.com/SandAI-org/MAGI-1/blob/6ff822e74ded50611e81e1d0e115146b5c4dd2a5/inference/model/vae/vae_model.py#L275).

Later, in [tile_parallel.py](https://github.com/SandAI-org/MAGI-1/blob/6ff822e74ded50611e81e1d0e115146b5c4dd2a5/inference/infra/parallelism/tile_parallel.py#L314), these sampled latents are blended across tiles. This is a sample-then-blend approach, whereas many other VAE tiling implementations follow a blend-then-sample pattern (blending means/variances first, then sampling once from the blended distribution).

Is there a specific reason you chose sample-then-blend instead of blend-then-sample? I’m curious if it was for performance, simplicity, or a particular modeling choice.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Why sample-then-blend in `tiled_encode`? #103

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Why sample-then-blend in tiled_encode? #103

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions

Why sample-then-blend in `tiled_encode`? #103