{"product_id":"build-a-text-to-image-generator-from-scratch-with-transformers-and-diffusions-paperback","title":"Build a Text-To-Image Generator (from Scratch): With Transformers and Diffusions - Paperback","description":"\u003cdiv\u003e\u003cp style=\"text-align: right;\"\u003e\u003ca href=\"https:\/\/reportcopyrightinfringement.com\/\" target=\"_blank\" rel=\"nofollow\"\u003e\u003cb\u003eReport copyright infringement\u003c\/b\u003e\u003c\/a\u003e\u003c\/p\u003e\u003c\/div\u003e\u003cp\u003eby \u003cb\u003eMark Liu\u003c\/b\u003e (Author)\u003c\/p\u003e\u003cp\u003e\u003cb\u003eGet a free eBook (PDF or ePub) from Manning as well as access to the online liveBook format (and its AI assistant that will answer your questions in any language) when you purchase the print book.\u003c\/b\u003e \u003c\/p\u003e\u003cp\u003e\u003c\/p\u003eThis book takes you step-by-step through creating your own AI models that can generate images from text. You'll explore two methods of image generation--vision transformers and diffusion models--and learn vital AI development techniques as you go. \u003cp\u003e\u003c\/p\u003e Dive into the powerful models behind AI image generators. The best way to learn is to build something from scratch, and in this book you'll build your very own diffusion model and vision transformer. As you work through each stage of development, you'll develop an understanding of how these models can be customized, applied, and integrated for impressive multimodal AI. \u003cp\u003e\u003c\/p\u003e\u003ci\u003eBuild a Text-to-Image Generator (from Scratch)\u003c\/i\u003e teaches you how to: \u003cp\u003e\u003c\/p\u003e - Build and train models to generate high resolution images based on text descriptions\u003cbr\u003e - Edit an existing image based on text prompts\u003cbr\u003e - Build and train a model to add captions to images\u003cbr\u003e - Build and train a vision transformer to classify images\u003cbr\u003e - Fine-tune LLMs for downstream tasks such as classification, text or image generation\u003cbr\u003e - Better differentiate real images from deepfakes \u003cp\u003e\u003c\/p\u003e \u003cb\u003eAbout the technology\u003c\/b\u003e \u003cp\u003e\u003c\/p\u003e AI-generated images appear everywhere from high-end advertising to casual social media feeds. Text-to-image tools like Dall-e, Midjourney, and Flux make it easy to create AI art, but how do they work? In this book, you'll find out by building your own text-to-image generator! \u003cp\u003e\u003c\/p\u003e \u003cb\u003eAbout the book\u003c\/b\u003e \u003cp\u003e\u003c\/p\u003e \u003ci\u003eBuild a Text-to-Image Generator (from Scratch) \u003c\/i\u003eexplores both transformer-based image generation and diffusion models. You'll work hands-on to build a pair of simple generation models that can classify images, automatically add captions, reconstruct images, and enhance existing graphics. Author \u003cb\u003eMark Liu\u003c\/b\u003e guides you every step of the way with clear explanations, informative diagrams, and eye-opening examples you can build on your own laptop. \u003cp\u003e\u003c\/p\u003e \u003cb\u003eWhat's inside\u003c\/b\u003e \u003cp\u003e\u003c\/p\u003e - Build a vision transformer to classify images\u003cbr\u003e - Edit images using text prompts\u003cbr\u003e - Fine-tune image models \u003cp\u003e\u003c\/p\u003e\u003cb\u003eAbout the reader\u003c\/b\u003e \u003cp\u003e\u003c\/p\u003e Requires basic knowledge of generative AI models and intermediate Python skills. \u003cp\u003e\u003c\/p\u003e \u003cb\u003eAbout the author\u003c\/b\u003e \u003cp\u003e\u003c\/p\u003e \u003cb\u003eMark Liu\u003c\/b\u003e is the founding director of the Master of Science in Finance program at the University of Kentucky. He is also the author of \u003ci\u003eLearn Generative AI with PyTorch\u003c\/i\u003e. \u003cp\u003e\u003c\/p\u003e \u003cb\u003eTable of Contents\u003c\/b\u003e \u003cp\u003e\u003c\/p\u003e Part 1\u003cbr\u003e 1 A tale of two models: Transformers and diffusions\u003cbr\u003e 2 Build a transformer\u003cbr\u003e 3 Classify images with a vision transformer\u003cbr\u003e 4 Add captions to images\u003cbr\u003e Part 2\u003cbr\u003e 5 Generate images with diffusion models\u003cbr\u003e 6 Control what images to generate in diffusion models\u003cbr\u003e 7 Generate high-resolution images with diffusion models\u003cbr\u003e Part 3\u003cbr\u003e 8 CLIP: A model to measure the similarity between image and text\u003cbr\u003e 9 Text-to-image generation with latent diffusion\u003cbr\u003e 10 A deep dive into Stable Diffusion\u003cbr\u003e Part 4\u003cbr\u003e 11 VQGAN: Convert images into sequences of integers\u003cbr\u003e 12 A minimal implementation of DALL-E\u003cbr\u003e Part 5\u003cbr\u003e 13 New developments and challenges in text-to-image generation\u003cbr\u003e A Installing PyTorch and enabling GPU training locally and in Colab\u003ch3\u003eAuthor Biography\u003c\/h3\u003e\u003cp\u003eDr. \u003cb\u003eMark Liu\u003c\/b\u003e is a tenured finance professor and the founding director of the Master of Science in Finance program at the University of Kentucky. He has more than 20 years of coding experience, a Ph.D. in finance from Boston College.\u003c\/p\u003e\n            \u003cdiv\u003e\n\u003cstrong\u003eNumber of Pages:\u003c\/strong\u003e 360\u003c\/div\u003e\n            \u003cdiv\u003e\n\u003cstrong\u003eDimensions:\u003c\/strong\u003e 1.11 x 9.22 x 7.33 IN\u003c\/div\u003e\n            \u003cdiv\u003e\n\u003cstrong\u003ePublication Date:\u003c\/strong\u003e December 30, 2025\u003c\/div\u003e\n            ","brand":"BooksCloud","offers":[{"title":"Default Title","offer_id":53376883392819,"sku":"9781633435421","price":93.38,"currency_code":"USD","in_stock":true}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0300\/5595\/6612\/files\/UjLHhUsCEM9781633435421.webp?v=1779310451","url":"https:\/\/www.vysn.com\/products\/build-a-text-to-image-generator-from-scratch-with-transformers-and-diffusions-paperback","provider":"VYSN","version":"1.0","type":"link"}