Cross-Modal Retrieval With CNN Visual Features: A New Baseline

IEEE Transactions on Cybernetics
Yunchao WeiShuicheng Yan


Recently, convolutional neural network (CNN) visual features have demonstrated their powerful ability as a universal representation for various recognition tasks. In this paper, cross-modal retrieval with CNN visual features is implemented with several classic methods. Specifically, off-the-shelf CNN visual features are extracted from the CNN model, which is pretrained on ImageNet with more than one million images from 1000 object categories, as a generic image representation to tackle cross-modal retrieval. To further enhance the representational ability of CNN visual features, based on the pretrained CNN model on ImageNet, a fine-tuning step is performed by using the open source Caffe CNN library for each target data set. Besides, we propose a deep semantic matching method to address the cross-modal retrieval problem with respect to samples which are annotated with one or multiple labels. Extensive experiments on five popular publicly available data sets well demonstrate the superiority of CNN visual features for cross-modal retrieval.


Nov 2, 2004·Neural Computation·David R HardoonJohn Shawe-Taylor
Jul 29, 2006·Science·G E Hinton, R R Salakhutdinov
Aug 17, 2011·IEEE Transactions on Pattern Analysis and Machine Intelligence·Yi YangYunhe Pan
Jul 26, 2012·IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society·Meng WangXindong Wu
Nov 5, 2013·IEEE Transactions on Cybernetics·Luming ZhangDeng Cai
Jan 25, 2014·IEEE Transactions on Pattern Analysis and Machine Intelligence·Jose Costa PereiraNuno Vasconcelos
Jun 22, 2014·IEEE Transactions on Cybernetics·Jingkuan SongYang Yang
Sep 24, 2014·IEEE Transactions on Cybernetics·Meng WangShuicheng Yan
Nov 25, 2014·IEEE Transactions on Cybernetics·Xianglong LiuXuelong Li
Oct 16, 2015·IEEE Transactions on Cybernetics·Xueliang LiuXuelong Li


Jul 12, 2018·IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society·Jufeng YangMing-Hsuan Yang
Sep 9, 2017·IEEE Transactions on Cybernetics·Guanqun CaoMoncef Gabbouj
Jul 12, 2018·IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society·Yuxin PengYuxin Yuan
Dec 20, 2016·IEEE Transactions on Cybernetics·Hanli WangSam Kwong
Feb 7, 2017·IEEE Transactions on Cybernetics·Xiao ZengChun Qi
Dec 4, 2016·IEEE Transactions on Cybernetics·Yong ZhangMahardhika Pratama
Aug 15, 2018·IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society·Lingyun SongSamar Abbas

Related Concepts

Neural Networks (Anatomic)
2-(3',4'-dihydroxyphenyl)ethylene sulfate
Biological Neural Networks
Research Study
Neural Network Simulation
Acid-labile subunit, Drosophila

Trending Feeds


Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Synthetic Genetic Array Analysis

Synthetic genetic arrays allow the systematic examination of genetic interactions. Here is the latest research focusing on synthetic genetic arrays and their analyses.

Congenital Hyperinsulinism

Congenital hyperinsulinism is caused by genetic mutations resulting in excess insulin secretion from beta cells of the pancreas. Here is the latest research.

Neural Activity: Imaging

Imaging of neural activity in vivo has developed rapidly recently with the advancement of fluorescence microscopy, including new applications using miniaturized microscopes (miniscopes). This feed follows the progress in this growing field.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Epigenetic Memory

Epigenetic memory refers to the heritable genetic changes that are not explained by the DNA sequence. Find the latest research on epigenetic memory here.

Cell Atlas of the Human Eye

Constructing a cell atlas of the human eye will require transcriptomic and histologic analysis over the lifespan. This understanding will aid in the study of development and disease. Find the latest research pertaining to the Cell Atlas of the Human Eye here.

Femoral Neoplasms

Femoral Neoplasms are bone tumors that arise in the femur. Discover the latest research on femoral neoplasms here.

STING Receptor Agonists

Stimulator of IFN genes (STING) are a group of transmembrane proteins that are involved in the induction of type I interferon that is important in the innate immune response. The stimulation of STING has been an active area of research in the treatment of cancer and infectious diseases. Here is the latest research on STING receptor agonists.

Related Papers

IEEE Transactions on Pattern Analysis and Machine Intelligence
Yunchao WeiShuicheng Yan
IEEE Transactions on Pattern Analysis and Machine Intelligence
Christoph H LampertStefan Harmeling
IEEE Transactions on Neural Networks and Learning Systems
Li NiuDong Xu
© 2021 Meta ULC. All rights reserved