Awesome Generative AI

A curated list of tools, models, datasets, papers, and resources related to generative artificial intelligence.

Generative AI refers to algorithms (like GANs, VAEs, and LLMs) that can generate new content—text, images, music, code, video, and more. This list includes tools for researchers, developers, artists, and entrepreneurs exploring this transformative field.

Foundations & Concepts

Introduction to Generative AI (Google) – Overview of what Generative AI is and how it works.
GANs in 50 Lines of Code – Introductory example for Generative Adversarial Networks.
The Illustrated Transformer – Visual explanation of the Transformer model, core to modern generative AI.

Text Generation

GPT-4 – OpenAI’s latest large language model for text generation.
Claude – Constitutional AI-powered chatbot from Anthropic.
Bard – Google’s conversational generative AI model.
LLaMA – Meta’s open large language model.
Mistral – Lightweight open-source LLMs optimized for performance and scale.

Image Generation

Stable Diffusion – Open-source image synthesis model.
DALL·E 3 – Text-to-image model by OpenAI.
Midjourney – High-quality AI art generator from text prompts.
Deep Dream Generator – Tool for generating dream-like AI art using convolutional neural nets.

Music & Audio Generation

Riffusion – Real-time music generation using spectrograms and stable diffusion.
Jukebox (OpenAI) – AI model for generating music with vocals.
Soundraw – AI-powered music generator for creators.
Magenta – Research project by Google exploring music and art generation with ML.

Video Generation

Runway ML – Platform for video creation using generative models like Gen-2.
Pika Labs – AI-generated short-form video from text prompts.
Synthesia – Create AI-generated videos with avatars.
Kaiber – Turn audio and prompts into stylized video.

Code Generation

GitHub Copilot – AI pair programmer powered by OpenAI Codex.
Code Llama – Meta’s code generation model.
Replit Ghostwriter – AI coding assistant for Replit users.
Amazon CodeWhisperer – AI coding companion from AWS.

Multimodal Models

Gemini – Google’s multimodal foundation model.
GPT-4V (Vision) – GPT-4 with visual input capabilities.
CLIP – Connects vision (images) and language (text) via contrastive learning.
BLIP – Bootstrapped Language-Image Pretraining from Salesforce.

Frameworks & Libraries

Hugging Face Transformers – State-of-the-art models and tools for text, vision, and audio.
Diffusers – Library for diffusion models from Hugging Face.
LangChain – Framework for building LLM applications with chaining and memory.
Transformers.js – Run Hugging Face models in the browser with JavaScript.

Datasets

LAION-5B – Open large-scale dataset for training image-text models.
Common Crawl – Petabyte-scale web crawl data for training LLMs.
C4 (Colossal Clean Crawled Corpus) – Text corpus for language model training.
LibriSpeech – ASR dataset with thousands of hours of English speech.

Learning Resources

Awesome Prompt Engineering – Prompts, tools, and learning resources.
Full Stack Deep Learning – Practical course for building and scaling LLM applications.
DeepLearning.AI Courses – AI/ML courses including prompt engineering and diffusion models.
OpenAI Cookbook – Recipes and examples for using OpenAI’s APIs.

Communities

Hugging Face Forums – Community for open LLMs and model development.
r/GenerativeAI – Reddit community on generative models and applications.
Papers with Code – Research papers linked with code, datasets, and benchmarks.

Related Awesome Lists

Contribute

Contributions are welcome!

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.editorconfig		.editorconfig
.gitattributes		.gitattributes
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
README.md		README.md
check_readme_links.py		check_readme_links.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Repository files navigation

Awesome Generative AI

Contents

Foundations & Concepts

Text Generation

Image Generation

Music & Audio Generation

Video Generation

Code Generation

Multimodal Models

Frameworks & Libraries

Datasets

Learning Resources

Communities

Related Awesome Lists

Contribute

License

About

Uh oh!

Releases

Sponsor this project

Uh oh!

Packages

Languages

Uh oh!

awesomelistsio/awesome-generative-ai

Folders and files

Latest commit

History

Repository files navigation

Awesome Generative AI

Contents

Foundations & Concepts

Text Generation

Image Generation

Music & Audio Generation

Video Generation

Code Generation

Multimodal Models

Frameworks & Libraries

Datasets

Learning Resources

Communities

Related Awesome Lists

Contribute

License

About

Topics

Resources

Code of conduct

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Languages

Packages