โš”๏ธ IDEA-Bench โš”๏ธ : How Far are Generative Models from Professional Designing?

| GitHub | Paper | Dataset |

๐Ÿ“œ Rules

  • Choose the better one from two anonymous models.
  • Click "New Round" to start a new round.
  • After the voting ends, the model name will be displayed and the voting selection cannot be changed.

โš ๏ธ Data Collection Consent

  • Your votes will be collected for research purposes only.
  • By using this service, you agree to the collection of your votes for research purposes.
  • Your data will be anonymized and will not be used for commercial purposes.

๐Ÿ† Arena Elo

Find out the best model for professional level image processing tasks! Welcome to upload your own model generation results!

๐Ÿ‘‡ Voting now!

name description creator upload time
ChatDiT A Training-Free Baseline for Task-Agnostic Free-Form Chatting with Diffusion Transformers. Tongyi Lab 2024-12-23 15:49
GPT-4o + FLUX.1 [dev] A new open-source image generation model developed by Black Forest Labs. Use GPT-4o for prompt rephrasing. Black Forest Labs 2024-12-23 15:50
GPT-4o + Stable Diffusion 3 Medium A Multimodal Diffusion Transformer (MMDiT) text-to-image model that features greatly improved performance in image quality, typography, complex prompt understanding, and resource-efficiency. Use GPT-4o for prompt rephrasing. Stability AI 2024-12-24 15:39
GPT-4o + PixArt-Sigma PixArt-Sigma consists of pure transformer blocks for latent diffusion: It can directly generate 1024px, 2K and 4K images from text prompts within a single sampling process. Use GPT-4o for prompt rephrasing. Huawei Noah's Ark Lab 2024-12-24 15:39
GPT-4o + DALLE-3 DALL-E 3 is the newest text-to-image generation model from OpenAI. Use GPT-4o for prompt rephrasing. OpenAI 2024-12-24 15:39
GPT-4o + Emu2 A generative multimodal model with 37 billion parameters, trained on large-scale multimodal sequences with a unified autoregressive objective. Use GPT-4o for prompt rephrasing. BAAI 2024-12-24 15:39
GPT-4o + OmniGen OmniGen is a unified image generation model that you can use to perform various tasks, including but not limited to text-to-image generation, subject-driven generation, Identity-Preserving Generation, and image-conditioned generation. Use GPT-4o for prompt rephrasing. BAAI 2024-12-24 15:39