lixirong.net - Xirong Li – multimedia intelligence

Example domain paragraphs

multimedia intelligence

Our ICMR’20 paper on interactive image captioning is online .

In this paper we study a brand new topic of interactive image captioning with human in the loop. Different from automated image captioning where a given test image is the sole input in the inference stage, we have access to both the test image and a sequence of (incomplete) user-input sentences in the interactive scenario. We formulate the problem as Visually Conditioned Sentence Completion (VCSC). For VCSC, we propose ABD-Cap, asynchronous bidirectional decoding for image caption completion. With ABD-Cap a

Links to lixirong.net (4)