Skip to content

This is an in progress project for a custom generative image diffusion model (images of my dog 🐶) using the UNet architecture

Notifications You must be signed in to change notification settings

blayyyyyk/momo-net

Repository files navigation

Overview

This is a custom image generation model build from scratch using Pytorch. The architecture is based of the foundational paper behind Stable Diffusion which can be found here.

Architecture Planning

  1. Variational Autoencoder (VAE)
    • Encoder
    • Decoder
  2. Diffusion UNet
    • FiLM (optional)
    • Time Embeddings
  3. CLIP Text Encoder

Training Planning

  1. Train using DDPM

About

This is an in progress project for a custom generative image diffusion model (images of my dog 🐶) using the UNet architecture

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published