DOI QR코드

DOI QR Code

Research on AI Painting Generation Technology Based on the [Stable Diffusion]

  • Chenghao Wang (Dept. of Multimedia, Graduate School of Digital Image and Contents Dongguk University) ;
  • Jeanhun Chung (Dept. of Multimedia, Graduate School of Digital Image and Contents Dongguk University)
  • 투고 : 2023.04.10
  • 심사 : 2023.04.20
  • 발행 : 2023.06.30

초록

With the rapid development of deep learning and artificial intelligence, generative models have achieved remarkable success in the field of image generation. By combining the stable diffusion method with Web UI technology, a novel solution is provided for the application of AI painting generation. The application prospects of this technology are very broad and can be applied to multiple fields, such as digital art, concept design, game development, and more. Furthermore, the platform based on Web UI facilitates user operations, making the technology more easily applicable to practical scenarios. This paper introduces the basic principles of Stable Diffusion Web UI technology. This technique utilizes the stability of diffusion processes to improve the output quality of generative models. By gradually introducing noise during the generation process, the model can generate smoother and more coherent images. Additionally, the analysis of different model types and applications within Stable Diffusion Web UI provides creators with a more comprehensive understanding, offering valuable insights for fields such as artistic creation and design.

키워드

참고문헌

  1. Maerten, Anne-Sofie, and Derya Soydaner. "From paintbrush to pixel: A review of deep neural networks in AI-generated art." arXiv preprint arXiv:2302.10913 (2023). DOI: https://doi.org/10.48550/arXiv.2302.10913
  2. Hutson, James, and Morgan Harper-Nichols. "Generative AI and Algorithmic Art: Disrupting the Framing of Meaning and Rethinking the Subject-Object Dilemma." Global Journal of Computer Science and Technology: D, 23.1
  3. Deckers, Niklas, et al. "The Infinite Index: Information Retrieval on Generative Text-To-Image Models." Proceedings of the 2023 Conference on Human Information Interaction and Retrieval. 2023. DOI: https://doi.org/10.1145/3576840.3578327
  4. Stable Diffusion Art, How does Stable Diffusion work? https://stable-diffusion-art.com/how-stable-diffusion-work/#Diffusion_model
  5. Medium, Generate High-Quality Image Using Stable Diffusion Web UI https://betterprogramming.pub/generate-high-quality-image-using-stable-diffusion-webui-de96d6947d85
  6. Hu, Edward J., et al. "Lora: Low-rank adaptation of large language models." arXiv preprint arXiv:2106.09685 (2021). DOI: https://doi.org/10.48550/arXiv.2106.09685
  7. Rombach, Robin, et al. "High-resolution image synthesis with latent diffusion models." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022.
  8. Mazzone, Marian, and Ahmed Elgammal. "Art, creativity, and the potential of artificial intelligence." Arts. Vol. 8. No. 1. MDPI, 2019. DOI: https://doi.org/10.3390/arts8010026
  9. Zhang, Lvmin, and Maneesh Agrawala. "Adding conditional control to text-to-image diffusion models." arXiv preprint arXiv:2302.05543 (2023). DOI: https://doi.org/10.48550/arXiv.2302.05543
  10. Lee, Seongmin, et al. "Diffusion Explainer: Visual Explanation for Text-to-image Stable Diffusion." arXiv preprint arXiv:2305.03509 (2023). DOI: https://doi.org/10.48550/arXiv.2305.03509