赵哲焕
开通时间:..
最后更新时间:..
点击次数:
论文类型:期刊论文
发表时间:2017-04-21
发表刊物:Information (Switzerland)
收录刊物:EI
卷号:8
期号:2
摘要:Compound figure detection on figures and associated captions is the first step to making medical figures from biomedical literature available for further analysis. The performance of traditional methods is limited to the choice of hand-engineering features and prior domain knowledge. We train multiple convolutional neural networks (CNNs), long short-term memory (LSTM) networks, and gated recurrent unit (GRU) networks on top of pre-trained word vectors to learn textual features from captions and employ deep CNNs to learn visual features from figures. We then identify compound figures by combining textual and visual prediction. Our proposed architecture obtains remarkable performance in three run types-textual, visual and mixed-and achieves better performance in ImageCLEF2015 and ImageCLEF2016. © 2017 by the authors.