Description: DESCRIPTION META TAG
keywords should be placed here (40)
The most advanced text-to-image (T2I) models require significant training costs (e.g., millions of GPU hours), seriously hindering the fundamental innovation for the AIGC community while increasing CO 2 emissions. This paper introduces PIXART-α, a Transformer-based T2I diffusion model whose image generation quality is competitive with state-of-the-art image generators (e.g., Imagen, SDXL, and even Midjourney), reaching near-commercial application standards. Additionally, it supports high-resolution image sy
Comparisons of CO 2 emissions and training cost among T2I generators. PIXART-α achieves an exceptionally low training cost of $26,000. Compared to RAPHAEL, our CO 2 emissions and training costs are merely 1.1% and 0.85%, respectively.
Total clicks: