AI艺术相关的工具连接

Looking to get started with AI art? A good place to start is one of the popular apps like DreamStudio , midjourney , Wombo , or NightCafe . You can get a quick sense of how you can use words and phrases to guide image generation. Read up on prompt engineering to improve your results. Then you may want to move on to using Google Colab notebooks linked below like Deforum. If you have a good nVidia GPU of your own then you can also use NMKD Stable Diffusion GUI or Visions of Chaos to run the most popular notebooks locally. If you want to train your own Ai models check out the Ai art model training page, for animations check Stable Diffusion animations .

Text to Image

There are a TON of shared Google Colab notebooks floating around for doing text to image with pre-trained GAN and diffusion models. I’ve been compiling the ones I come across and try out and find interesting. Please hit me up on twitter (@pharmapsychotic) if you know a cool notebook that I am missing! Stable Diffusion is most popular right now.

Stable Diffusion WebUI by automatic1111 – run SD local with lots of features and extensions
StableStudio – local webui using Stability API for inference
Deforum Stable Diffusion 0.7 – group effort for ultimate SD notebook (discord) (youtube tutorial) (guide)
Disco Diffusion v5.6 by Somnai, gandamu, zippy721 (guide) (new guide) (youtube tutorial)
Huemin Jax Diffusion 2.7 by nshepperd, huemin_art (guide) (stitching guide)
pytti-tools v0.10 by DigThatData and sportsracer
VQGAN+CLIP by remi_durant

[2023/09/15] llamas.ipynb by @osanseviero QR ControlNet + SD1.5 for optical illusions ( tweet )
[2023/04/28] DeepFloyd IF (huggingface) (github)
[2023/04/05] Kandinsky 2.1 Batching+Dynamic prompting Colab by @jrobocat
[2023/04/03] Kandinsky 2.1 (huggingface) (site)
[2023/03/23] Image-to-text-to-image Colab by @jrobocat – batch CLIP Interrogator + SD generations
[2023/03/20] ModelScope text-to-video Colab by @camenduru (youtube) (github)
[2023/03/18] ModelScope text-to-video huggingface space
[2023/03/14] Unidiffuser – unified diffusion framework (github)
[2023/02/20] Stable Diffusion Auto Stitching by @oleg_ai_art (guide)
[2023/02/15] ControlNet – control Stable Diffusion with extra conditioning (youtube) (huggingface) (github) (models)
[2023/02/14] Pix2Pix video with coherence by @johnowhitaker – stylize video inputs!
[2023/01/30] Tune-a-Video – create short text2video sequences (github) (paper)
[2023/01/21] KLMC2 Animation – @DigThatData’s fork with lots of additions
[2023/01/20] InstructPix2Pix – use text instructions to modify images (huggingface)
[2023/01/19] Image Mixer by @Buntworthy – mix up to 5 images together with SD
[2023/01/14] Latent Blending by @j_stelzer – smooth transition between SD latents (github)
[2023/01/10] Custom Diffusion – fast SD finetune with multiple concepts (github)
[2022/12/22] Karlo – unCLIP architecture like DALLE-2 (huggingface) (github)
[2022/12/08] Stable Diffusion KLMC2 Animation by @RiversHaveWings
[2022/11/30] BAOAB-limit sampler – new SD sampler that can also make anims hella fast (paper)
[2022/11/25] Stable Diffusion 2.0 Web UI – by @anzorq (run SD 2.0 in colab using Diffusers)
[2022/11/24] Stable Diffusion 2.0 w Diffusers – by @amrrs (youtube)
[2022/11/08] Midjourney v4 Style – (dreambooth SD finetune on midjourney v4 outputs)
[2022/11/03] All-in-one Private Diffusions Colab – fork and upgrades to WD notebook (website)
[2022/10/25] Fast Dreambooth by TheLastBen (easy fast finetune of stable diffusion in colab)
[2022/10/08] Stable Worlds by @NaxAlpha (create panoramas with SD!)
[2022/09/29] MathRockDiffusion by ethansmith2000 (mods and improvements on Disco) ( guide )( cuts )
[2022/09/29] robo_diffusion_v1 by @nousr (a DreamBooth fine tune of stable diffusion)
[2022/09/27] Video Killed The Radio Star Diffusion by @DigThatData (transform music videos from YouTube)
[2022/09/25] fast-stable-diffusion – automatic111 ui, hlky ui, github (+25% speed and low VRAM)
[2022/09/18] Doohickey Diffusion by aicrumb (stable diffusion with CLIP guidance, perlin init, lots more)
[2022/09/18] optimized colab by neonsecret (stable diffusion with nice gradio gui in colab)
[2022/09/13] Stable Diffusion Batch by visoutre (includes tiled upscaling!) (tutorial)
[2022/09/11] Easy Diffusion by WASasquatch and NOP (stable diffusion with lots of still image features)
[2022/09/07] NMKD Stable Diffusion GUI (nice easy Windows GUI for stable by Noomkrad)
[2022/08/30] Simple Stable Diffusion by @ai_curio (supports prompt weighting)
[2022/08/29] Stable Diffusion WebUi by @altryne (fancy Gradio UI for stable diffusion)
[2022/08/28] Prompt Parrot v2.0 by @KyrickYoung (train gpt2 on prompt list then generate with stable-diff)
[2022/08/23] Stable Diffusion Interpolation by @ygantigravity (animate from own prompt to another!)
[2022/08/23] Deforum Stable Diffusion (discord link) 🔥
[2022/08/23] FunkyHorses Stable Diffusion by Coskaiy/Corran (has neat import from spreadsheet)
[2022/08/23] NOP’s Stable Diffusion Colab v0.19 by NOP#1337
[2022/08/23] Stable Diffusion Lite by @future__art (prompt queueing and seed mining)
[2022/08/23] Interactive notebook for Stable Diffusion
[2022/08/22] Stable Diffusion HuggingFace space by stabilityai
[2022/08/22] Stable Diffusion notebook by @pharmapsychotic 🔥 (easy to use and batch to gdrive) (tutorial)
[2022/08/22] Official Stable Diffusion notebook – requires hugging face account
[2022/08/22] DiscoStream v1.1 by @WASasquatch
[2022/08/20] Disco Diffusion v5.6 with Inpainting by @cut_pow
[2022/08/18] DiscoArt [w/ Batch Prompts + GPT3 generator] by Skquark
[2022/08/16] WAS’s Disco Diffusion v5.6-9 Portrait Generator Playground by WASasquatch
[2022/08/08] Paint Pour Diffusion by @EclecticBeams (diffusion trained on paint pour art)
[2022/07/31] Huemin Jax Diffusion 2.7 August 2022 by @huemin_art
[2022/07/30] CLIP Prior + VQGAN by @RiversHaveWings and @jd_pressman (a new VQGAN notebook 😮)
[2022/07/23] Textile Diffusion by @KaliYuga (diffusion trained on textiles)
[2022/07/21] Floral Diffusion by @jags111 (fine tunes for floral)
[2022/07/18] Liminal Diffusion v1 by @BrainArtLabs (diffusion trained on liminal photographs)
[2022/07/18] DifNESfusion 1.35 by @LufiQ (fork or PixelArtDiffusion with NES dataset)
[2022/07/18] Medieval Diffusion by @KaliYuga (diffusion trained on medieval art)
[2022/07/17] FeiArt_Handpainted CG Diffusion by @FeiArt_AiArt
[2022/07/17] Fantasy Diffusion by @LaVista (diffusion trained on fantasy art)
[2022/07/15] Ukiyo-e Portrait Diffusion by @avantcontra
[2022/07/15] Lithography Diffusion by @KaliYuga (diffusion trained on lithographic landscapes and portraits)
[2022/07/06] Disco v5.2 Dynamic Prompting (dynamic prompt variations – tutorial video )
[2022/07/06] Watercolor Diffusion by @KaliYuga (diffusion trained on watercolor paintings)
[2022/07/05] EnzymeZoo edits to Huemin Jax Diffusion by @EnzymeZoo (brought over masking from Majesty)
see older notebooks in the archive

Upscaling / Super-resolution

Check out the Upscaling Guide

Gigapixel AI by Topaz Labs (costs $99) <- voted #1
Real-ESRGAN – ( github ) <- voted #2
Real-ESRGAN Sber – a nice fine tuned ESRGAN model
chaiNNer – node base tool that can batch process ESRGAN upscale and more
Cupscale – Windows GUI for ESRGAN
Latent-SR – Nightmare Ai latent diffusion super resolution (slow but nice!)
PASD image super resolution – (github) pixel aware Stable Diffusion
Neural Love – credit based system for diffusion upscaling
Stable Diffusion Upscaler – latest and greatest 🔥
SuperRes Diffusion – Batch upscaling and super resolution with latent-diffusion
SwinIR – Hugging Face space
Upscale Model Database – big set of pretrained models for upscaling different types of content
Waifu2x (github) – designed for anime / manga
WaifuXL – newer and beats Waifu2x in quality
LetsEnhance.io – credit based web service for image super resolution

StyleGAN

[2022/08/23] Painting with StyleGAN by @jmoso13 (tutorial) – use VAE to navigate and animate!
[2022/04/25] StyleGAN-Humans + CLIP modified by Diego Porres to use StyleGAN3
StyleGAN2-ADA – train your own StyleGAN2 model from an image set you create
StyleCLIP – Text-drive manipulation of StyleGAN imagery
Structured Dreaming – Styledreams With helpers
Structured Dreaming (CLIP+StyleGAN) by @ArYoMo (tweet)
StyleGAN 2 pretrained models – can use these with Structured Dreaming
StyleGAN 2 awesome pretrained models – BIG collection of models
StyleGAN 3 training – train a StyleGAN and do interpolation video by @dvsch (currently busted)
StyleGAN 3 music video generation – (tweet)
StyleGAN 3 + CLIP by Annas
StyleGAN3 + CLIP by @nshepperd1 and @RiversHaveWings
StyleGANXL + CLIP by Eugenio Herrera and Rodrigo Mello
Lucid Sonic Dreams – animate path through StyleGAN latent space with music (github)

Text

GPT4All Chat – run local windows/linux/mac app like ChatGPT
oobabooga text-generation-webui – it’s like auto1111 sd web ui but for text models
StableLM space – huggingface space for language model from Stability AI

Goose.ai Playground – can use their playground to generate text with GPT-Neo
GPT Neo Colab notebook – use GPT-neo 1.3B and 2.7B from Google colab
GPT Neo HuggingFace – run GPT-neo 2.7B on HuggingFace
Neuralism Generative Art Prompt Generator – generate prompts to use for text to image
OpenAI GPT3 Playground – generate text with GPT-3 (requires free account)
Textsynth Playground – text completion using large language models

Dalle-2 Prompt Generator – nice site that let’s you generate interesting text prompts
Prompt Parrot by @KyrickYoung – train GPT2 on a list of your prompts
MadLib Prompt Generator – makes interesting prompts for you, by @remi_durant
Noodle Soup Prompts v2.1 by WASasquatch
Neuralism Prompt Generator – generative art prompt generator

Video

Image to video

[2023/11/22] Stable Video Colab by @mkshing

Text to video

camenduru text-to-video Colabs – great collection of Zeroscope, potat1, modelscope notebooks
AnimateDiff (colab) (github) – short video clubs with your own LoRA
ModelScope (colab) (huggingface) – super fun but prominant shutterstock watermarks
Text2Video-zero (colab) (github) (huggingface) (webui ext) – zero shot video from Stable Diffusion

Interpolation

Video Enhance AI by Topaz Labs – commercial upscaling and frame interpolation <- excellent
AnimationKit AI – video upscaling and interpolation tool <- great
FILM colab – by @KyrickYoung has pause, loops, reverse <- my fave FILM
3D Ken Burns Effect from single image – animated video from 2D image
3D Photo Inpainting – cool 3D effects for 2D images
Animating Pictures with Eulerian Motion Fields – code not out yet, looks like it’ll be awesome
DAIN colab – depth aware interpolation
EbSynth – stylize video by giving it ai or hand painted key frames from video
ESRGAN 4 Video – increase resolution of video with ESRGAN
FILM: Frame Interpolation for Large Motion – (replicate link) smooth interpolation/morphing
Flowframes – free Windows tool with patreon option, uses RIFE and other models
PyTTI-Tools: FILM – @DigThatData ‘s version of FILM for video frames
RIFE – smooth interpolation of video to increase frame rate
Sequence Frame Interpolation – batch version of FILM
Super Slomo – another way to increase frame rate of video
Video Art and Styling Tools – by @Coskaiy (style transfer, interpolation, superres, and more)

Animation

[2022/11/03] FrameSync.xyz – Automate Deforum Keyframe animations with waveforms
[2022/10/26] Tulpa Prompter by @dreamingtulpa – helper to build animation prompts (tweet)
[2022/08/15] AnimationPreview by @pharmapsychotic – quickly preview Deforum camera animations
[2022/08/04] DALL-E 2.5D Depth Warped Zoom by @deKxi
[2022/03/31] PyDub Audio to Disco Diffusion Keyframe Generator v0.1 by austinhquinn
[2022/02/26] Wiggle animation key frame generator by @zippy731
[2022/02/23] audio-reactive-video – by @vsewall2motion, skip video frames based on volume
Keyframe string generator for AI animation notebooks
Audio to keyframe string generator for AI animation notebooks

Prompt Engineering

To get good results with CLIP guided diffusion and VQGAN+CLIP you need to find the right words and phrases that will direct the neural network to the content and style you are looking for.

Image to Text

Antarctic-Captions by @dzryk
BLIP image captioning HuggingFace space
CLIP Interrogator by @pharmapsychotic – image to prompt! (huggingface) (lambda) (replicate) 🔥
CLIP prefix captioning inference notebook (github)
LLaVa: Large Language and Vision Assistant – ask vision model to describe image
personality-clip by @dzryk
PEZ: Prompts made EZ – prompt from image or long to short prompt (huggingface) (colab)

Prompt Guides

[2023/08/04] Stable Diffusion XL reference library – great guides for SDXL!!
[2022/11/29] Stable Diffusion V2 CFG Scale Comparison – nice ref of samplers and cfg scale
[2022/09/16] krea.ai search stable diffusion prompts and browse by modifiers
[2022/09/07] libraire.ai search 10 million stable diffusion prompts and images
[2022/08/28] Prompt Parrot v2.0 by @KyrickYoung (train gpt2 on prompt list then generate with stable-diff)
[2022/08/24] Lexica stable diffusion prompt search engine
[2022/08/13] Promptomania by @wszp – cool prompt building tool!
[2022/08/08] Stable Diffusion Artist Studies by @proximasan @EErratica @KyrickYoung @surrailabs
[2022/08/08] Stable Diffusion Modifier Studies by @proximasan +
[2022/07/30] Disco Diffusion Portrait Study by @enviraldesign
[2022/07/13] Dall-e 2 prompt book by @GuyP
[2022/03/25] Disco Diffusion Modifiers Study by @KyrickYoung and @sureailabs
[2022/03/21] DiscoDiffusion Model Comparison Study – by @KaliYuga
[2022/03/05] Midjourney Artist Dump – spreadsheet or artists and example render
[2022/02/26] Disco Diffusion 70+ Artist Studies
A Guide to Writing Prompts for Text to Image – Google Doc guide and advice
CLIP Retrieval Tool – see what kinds of images match strings for CLIP (wait a long time for it to load)
CLIP Prompt Engineering for Generative Art – nice long guide by Matthew McAteer
CLIP + VQGAN keyword comparison by @kingdomakrillic
Artist Studies by @remi_durant – big collection of results using different artist names
Art Movements and Styles as perceived by VQGAN + Clip (Imagenet 16k, ViT-B/32)
Art Movements and Styles as perceived by VQGAN + Clip (Imagenet 16k, ViT-B/16)
Art Movements and Styles as perceived by VQGAN + Clip (Imagenet 16k, RN50x16)
Art Movements and Styles as perceived by VQGAN + CLIP (Imagenet 16k, RN50x4)

Music

You can generate music with AI using OpenAI’s Jukebox. You can prompt Jukebox with an artist and music genre or with a short audio clip in WAV format. It generates new music for you in phases of increasing quality (level_2, level_1, level_0) and takes about 8 hours on Colab.

aiva – ai composition of soundtracks and music
amper – royalty free ai music creation
AudioLDM – text to audio latent diffusion model (huggingface) (replicate) 🆕 🔥
boomy – let’s you create and publish music with Ai but they hold the copyright
D3Net-MSS – colab for splitting music into separate clips for drums, vocals, etc
Dance Diffusion – audio diffusion! (guide)
Dance Diffusion Finetuning – fine tune on your own audio dataset
Easy One Click Jukebox – this is my favorite currently
Jukebox Community Build – download this ipynb and put in Colab Notebooks folder on Google Drive to use
lalal.ai – commercial music to stems service
Moises.ai – ai audio separation
mubert – nft friendly music remixed by Ai
Official OpenAI Jukebox – the official notebook from OpenAI
riffusion – stable diffusion fine tuned on audio spectrograms! (web)
Spleeter colab – split music into stems
Zags Jukebox v3.7 – (youtube tutorial)

Other

sdtools.org – cool wiki covering tools and methods related to Stable Diffusion
JAX CLIP Guided Diffusion 2.7 Guide – Google doc from huemin
Zippy’s Disco Diffusion Cheatsheet – Google Doc guide to Disco and all the parameters
EZ Charts – Google Doc Visual Reference Guides for CLIP-Guided Diffusion (see what all the parameters do!)
Hitchhiker’s Guide To The Latent Space – a guide that’s been put together with lots of colab notebooks too
Resources for GAN Artists – another big Google Doc with notebooks and resources for AI art
Way of the TTI Artist – pytti guide
Guide to install Disco Diffusion 5 on Windows with WSL – haven’t tried this yet challenge is pytorch3d
Great explanation of VQGAN+CLIP – https://ljvmiranda921.github.io/notebook/2021/08/08/clip-vqgan/
Nice overview of lots of different optimization algorithms SGD, Adam, RMSProp etc and their differences (also covered in this lecture)
Stanford’s Convolutional Neural Networks class on YouTube – https://www.youtube.com/playlist?list=PL3FW7Lu3i5JvHM8ljYj-zLfQRF3EO8sYv

ClipMatrix – text controlled 3D mesh deformation and stylization
CLIP-Mesh – text to 3D mesh with texture and normal map (still pretty simple and mixed results)
DreamFields – latest text to 3D (github)
ImageSorter by @pharmapsychotic – sort images by similarity (nice for StyleGAN/FiLM animated loops)
PIFuHD Colab – Human photo to 3D mesh of the human
Point-E – OpenAI’s text to 3d point clouds (github)
text2mesh – Kaggle notebook for text to 3D mesh
Watermark images – little notebook to add text watermark to images
Zero-Shot Text-Guided Object Generation with Dream Fields – text to 3D render

AI Art Discord Servers

There are quite a few Discord servers dedicated now to AI artists or discussing text to image techniques.

Ai NFT Discord – AI NFT Consortium. Has especially useful StyleGAN training resources
Disco Diffusion Discord – chat and tech support for the Disco notebook
EleutherAI Discord – researchers and good art room with more technical discussions
Jukebox Community Discord – server for using OpenAI Jukebox for music generation
LAION Discord – group working on replicating a full DALLE-E
NeuralismAI Discord – AI art competitions and knowledge exchange
Prompt Sharing Discord – community for sharing text to image prompts
VQGAN+CLIP Discord – home of Instagram #vqganclipcommunitycolab
Zoetrope Central Spoke Discord – support and discussion of the Looking Glass notebook

Learn to Code Generative Ai

The Illustrated Stable Diffusion – really nice overview of Stable Diffusion and the pieces that make it up
AIAIART – really nice ongoing youtube series and discussion in its Discord
Deep Learning for Art, Aesthetics, and Creativity – MIT course available on youtube
Dive into Deep Learning (online, free, interactive)
Deep Learning Foundations to Stable Diffusion – 4 videos from the fast.ai class
Generative Deep Learning: Teaching Machines to Paint, Write, Compose, and Play by David Foster [2019]
Really enjoyed this and it’s a great book! It’s from 2019 so doesn’t cover the very latest like VQGAN, CLIP, guided diffusion though.
HuggingFace Diffusion Models Class – nice coverage of the diffusers library and Stable Diffusion
The Artist in the Machine: The world of AI-powered creativity by Arthur I. Miller [2020]
Not very technical but engaging and inspiring view of many Ai art projects so far.
ml4a.net – online textbook, classes, and learning resources

Cool Apps

No Code AI Art tools

Artbreeder – StyleGAN model with “genes” (directions in latent space) for editing
Artbreeder Collage – CLIP guided diffusion on top of simple collages
Astria.ai – nice and easy Dreambooth training – upload images, and get finetuned SD model
BlueWillow – text to image Discord like MidJourney (appears to use Stable Diffusion finetunes)
CogView – text to image, Chinese model like DALL-E ( interview )
conjure.art – new text to image site currently in beta
craiyon – formerly known as dall-e mini, free and makes quick grids of 9 outputs
Dall-e 2 – OpenAI’s text to image
DeepDreamGenerator – deep style, thin style, deep dream
DreamStudio – easy to use text to image from creators of Stable Diffusion 🔥🔥🔥
Genmo – short animations (looks like KLMC2) 🎥
Kaiber – create short animations (looks like Deforum) 🎥
midjourney – text to image via discord bot 🔥🔥🔥
murf.ai – text to speech with Ai voices
neural.love image-upscale – credit based image upscaling service
NightCafe – style transfer, VQGAN, diffusion image generation
Ostagram – style transfer
Playform – style transfer, train stylegans, images morphs
pollinations.ai – run lots of popular notebooks
ProsePainter – interactive tool to “paint with words”
runwayml – video editing powered by AI 🎥
snowpixel – text to image and variations
StarryAI – text to image with easy selection of styles
synth.run – text to image app for iOS, Android, and web
tokkingheads – animate portraits with Ai
uberduck.ai – text to speech with lots of different voices
Visions of Chaos – run the popular AI notebooks locally on Windows (see the Machine Learning setup steps)
Wombo – Super fast and free
wzrd.ai – give it a music file and produce animation from big set of pretrained GANs

Create Game Assets

Layer – 2D assets and variations for games
Luma AI Imagine 3D – alpha test of text to 3D models
MirageML – 3D assets and prototyping
Scenario.gg – AI-generated game assets
withpoly – AI-generated textures and materials

Online Galleries to Showcase Art

OnCyber art galleries – https://oncyber.io – Cool 3D art gallery to showcase your art with links to NFT market
Spatial – https://spatial.io

本文转载地址：https://pharmapsychotic.com/tools.html

本文来自，经授权后发布，本文观点不代表Paragoger衍生者AI Agent学习中心立场，转载请联系原作者。

Text to Image

Upscaling / Super-resolution

StyleGAN