ATTENTION-GUIDED MULTIMODAL GRAPH NETWORKS FOR CROSS-MODAL IMAGE-TEXT MATCHING

Authors

  • KOMURAVELLI MOUNIKA and B.V. RAMNARESH YADAV Author

DOI:

https://doi.org/10.48047/69h5ek37

Keywords:

.

Abstract

The cross-modal retrieval, a technique used to align visual and text features to retrieve similar data from database. This technique always remains a challenging task due to the inherent heterogeneity between visual and text representations.

Downloads

Download data is not yet available.

References

Arora, Nitin, G. Sucharitha, and Subhash C. Sharma. "MVM-LBP: Mean− Variance− Median based LBP for face recognition." International Journal of Information Technology 15.3 (2023): 1231-1242.

Qian, Shengsheng, Tianzhu Zhang, and Changsheng Xu. "Multi-modal multi-view topic-opinion mining for social event analysis." Proceedings of the 24th ACM international conference on Multimedia. 2016.

Downloads

Published

2025-03-10

How to Cite

KOMURAVELLI MOUNIKA and B.V. RAMNARESH YADAV. (2025). ATTENTION-GUIDED MULTIMODAL GRAPH NETWORKS FOR CROSS-MODAL IMAGE-TEXT MATCHING. Cuestiones De Fisioterapia, 54(5), 358-372. https://doi.org/10.48047/69h5ek37