• Turn off MathJax
    Article Contents
    Research on Causal Image-Text Retrieval Embedded with Consensus Knowledge[J]. Chinese Journal of Engineering. doi: 10.13374/j.issn2095-9389.2023.05.28.001
    Citation: Research on Causal Image-Text Retrieval Embedded with Consensus Knowledge[J]. Chinese Journal of Engineering. doi: 10.13374/j.issn2095-9389.2023.05.28.001

    Research on Causal Image-Text Retrieval Embedded with Consensus Knowledge

    doi: 10.13374/j.issn2095-9389.2023.05.28.001
    • Available Online: 2023-08-16
    • Cross-modality image-text retrieval is a task to retrieve corresponding images or texts under the given another mode text or image. Traditional retrieval paradigms rely on deep learning to extract feature representations from images and texts, and map them to a common semantic space for semantic matching. However, this method relies more on the correlation of the data than on the real causal relationship behind the data, and faces challenges in the representation and interpretability of high-level semantic information. Therefore, we introduce causal inference and consensus knowledge on the basis of deep learning, and propose a causal image-text retrieval method embedded with consensus knowledge. Specifically, we incorporate causal interventions into the visual feature extraction module, replacing correlational relationships with causal relationships to learn causal visual features that capture underlying knowledge. These causal visual features are then concatenated with the original visual features to obtain the final visual feature representation. To solve the problem of insufficient text feature representation in this method, a more powerful text feature extraction model BERT is adopted, and consensus knowledge shared between two modal data is embedded for consensus level representation learning of image-text features. Experimental results on the MS-COCO dataset demonstrate that our approach achieves consistent KPI improvements of Recall@k and mR for bidirectional image-text retrieval tasks.

       

    • loading
    • 加載中

    Catalog

      通訊作者: 陳斌, bchen63@163.com
      • 1. 

        沈陽化工大學材料科學與工程學院 沈陽 110142

      1. 本站搜索
      2. 百度學術搜索
      3. 萬方數據庫搜索
      4. CNKI搜索
      Article views (109) PDF downloads(8) Cited by()
      Proportional views
      Related

      /

      DownLoad:  Full-Size Img  PowerPoint
      Return
      Return
      中文字幕在线观看