Description: AppAgent: Multimodal Agents as Smartphone Users
22024.2.8 : Added qwen-vl-max (通义千问-VL) as an alternative multi-modal model. The model is currently free to use!
22024.1.31 : Evaluation benchmark used in AppAgent is released on Github.
22024.1.2 : 🔥 Added an optional method for the agent to bring up a grid overlay on the screen to tap/swipe anywhere on the screen.