A collection of resources and papers on Diffusion Models
-
Updated
Aug 1, 2024 - HTML
A collection of resources and papers on Diffusion Models
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
HunyuanVideo: A Systematic Framework For Large Video Generation Model Training
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
Diffusion model papers, survey, and taxonomy
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
《Pytorch实用教程》(第二版)无论是零基础入门,还是CV、NLP、LLM项目应用,或是进阶工程化部署落地,在这里都有。相信在本书的帮助下,读者将能够轻松掌握 PyTorch 的使用,成为一名优秀的深度学习工程师。
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
Lumina-T2X is a unified framework for Text to Any Modality Generation
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
Official PyTorch Code and Models of "RePaint: Inpainting using Denoising Diffusion Probabilistic Models", CVPR 2022
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
MMGeneration is a powerful toolkit for generative models, based on PyTorch and MMCV.
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
A general fine-tuning kit geared toward diffusion models.
Add a description, image, and links to the diffusion-models topic page so that developers can more easily learn about it.
To associate your repository with the diffusion-models topic, visit your repo's landing page and select "manage topics."