논문명 | Cross Encoder-Decoder Transformer with Global-Local Visual Extractor for Medical Image Captioning |
---|---|
게재일 | 20220213 |
학술지명 | Sensors |
책임교수 | |
논문종류 | 01 SCI |
제1저자 | 이호준 |
교신저자 | 김지희 |
공동저자 | 최현준, 박지은, 채진영, 김지희 |
Impact Factor | 3.84700 |
Keyword | |
In this paper, we propose the Global-Local Visual Extractor (GLVE) and the Cross Encoder-Decoder Transformer (CEDT). The GLVE captures local features as well as global features in images. By extracting both global and local features with the GLVE, our model learns the organ size, skeletal structure, and irregular lesion area. |