diagrammergpt.github.io - DiagrammerGPT (2023)

Description: DiagrammerGPT: Generating Open-Domain, Open-Platform Diagrams via LLM Planning

gpt (225) diagram (214) diagrammer (3) diagram generation (1) diagram gpt (1) diagrammer gpt (1) diagrammergpt (1)

Example domain paragraphs

Text-to-image (T2I) generation has seen significant growth over the past few years. Despite this, there has been little work on generating diagrams with T2I models. A diagram is a symbolic/schematic representation that explains information using structurally rich and spatially complex visualizations (e.g., a dense combination of related objects, text labels, directional arrows, connection lines, etc.). Existing state-of-the-art T2I models often fail at diagram generation because they lack fine-grained objec

To address this gap, we present DiagrammerGPT , a novel two-stage text-to-diagram generation framework that leverages the layout guidance capabilities of LLMs (e.g., GPT-4) to generate more accurate open-domain, open-platform diagrams. In the first stage, we use LLMs to generate and iteratively refine 'diagram plans' (in a planner-auditor feedback loop) which describe all the entities (objects and text labels), their relationships (arrows or lines), and their bounding box layouts. In the second stage, we us

We hope that our work can inspire further research on the diagram generation capabilities of T2I models and LLMs.

Links to diagrammergpt.github.io (1)