ICMR 2016


Camera Ready Submission
Long Papers
Short Papers
Special Session Papers
Brave New Ideas Papers
Best Paper Candidates
Student Showcase
21°F NYC, US


Camera Ready manuscript preparation instructions

Please follow the guidelines on the Sheridan Publication page to prepare the Camera Ready version of your accepted manuscript.

The hard deadline to submit the Camera ready version of the manuscripts is April 26th, 2016


8 GPU-FV: Realtime Fisher Vector and Its Applications in Video Monitoring

12 Diverse Yet Efficient Retrieval using Locality Sensitive Hashing

25 Homemade TS-Net for Automatic Face Recognition

29 Correlation Autoencoder Hashing for Supervised Cross-Modal Search

38 Action Recognition by Learning Deep Multi-Granular Spatio-Temporal Video Representation

45 Matching User Photos to Online Products with Robust Deep Features

49 Regional Subspace Projection Coding for Image Retrieval

56 The LFM-1b Dataset for Music Retrieval and Recommendation

61 Mouse Activity as an Indicator of Interestingness in Video

72 Video Emotion Recognition with Transferred Deep Feature Encodings

73 Pooling Objects for Recognizing Scenes without Examples

101 Foreground Object Sensing for Saliency Detection

102 Constrained Local Enhancement of Semantic Features by Content-Based Sparsity

109 Scaling Group Testing Similarity Search

132 Automatic Identification of Sports Video Highlights using Viewer Interest Features

137 Scene-driven Retrieval in Edited Videos using Aesthetic and Semantic Deep Features

148 Diverse Concept-Level Features for Multi-Object Classification

151 ACD: Action Concept Discovery from Image-Sentence Corpora

167 Event Detection with Zero Example: Select the Right and Suppress the Wrong Concepts

193 Multilingual Visual Sentiment Concept Matching


19 Vinereactor: Crowdsourced Spontaneous Facial Expression Data

23 Mirroring Facial Expressions: Evidence from Visual Analysis of Dyadic Interactions

31 Cross-Task Study on Music Information Retrieval Recent Results

42 Sequential Correspondence Hierarchical Dirichlet Processes for Video Data Analysis

46 A Computational Approach to Finding Facial Patterns of a Babyface

51 Video Description Generation Using Audio and Visual Cues

52 Contextual Media Retrieval Using Natural Language Queries

60 Learning Music Embedding with Metadata for Context Aware Recommendation

62 Region Trajectories for Video Semantic Concept Detection

63 Audiovisual Summarization of Lectures and Meetings Using a Segment Similarity Graph

68 Recurrent Support Vector Machines for Audio-Based Multimedia Event Detection

76 Adding Chinese Captions to Images

85 Emotion Recognition from EEG Signals Enhanced by User's Profile

88 Multimodal Deep Convolutional Neural Network for Audio-Visual Emotion Recognition

92 Large-Scale E-Commerce Image Retrieval with Top-Weighted Convolutional Neural Networks

94 Web Video Popularity Prediction using Sentiment and Content Visual Features

111 Accurate Aggregation of Local Features by using K-sparse Autoencoder for 3D Model Retrieval

114 Image Annotation using Multi-scale Hypergraph Heat Diffusion Framework

118 Discrete Cross-modal Hashing

123 CNN-based Style Vector for Style Image Retrieval

128 MVC: A Dataset for View-Invariant Clothing Retrieval and Attribute Prediction

135 A Quality Adaptive Multimodal Affect Recognition System for User-Centric Multimedia Indexing

140 Rank Diffusion for Context-Based Image Retrieval

153 Bags of Local Convolutional Features for Scalable Instance Search

158 Interactive Multimodal Learning on 100 Million Images

162 Combining Holistic and Part-based Deep Representations for Computational Painting Categorization

173 Bidirectional Joint Representation Learning with Symmetrical Deep Neural Networks for Multimodal and Crossmodal Applications

176SSD Technology Enables Dynamic Maintenance of Persistent High-Dimensional Indexes

177 Item-Based Video Recommendation: an Hybrid Approach considering Human Factors

178 Human’s Scene Sketch Understanding

182Retrieval of Multimedia objects by Fusing Multiple Modalities

189 Incremental Learning for Fine-Grained Image Recognition

194 Spatially Localized Visual Dictionary Learning

201 Semantic Binary Codes

209 On the Effects of Spam Filtering and Incremental Learning for Web-Supervised Visual Concept Classification

220 Semi-supervised Identification of Rarely Appearing Persons in Video by Correcting Weak Labels


80 A Short Survey of Recent Advances in Graph Matching

107 The ImageNet Shuffle: Reorganized Pre-training for Video Event Detection

198 Learning for Traffic State Estimation on Large Scale of Incomplete Data


47 Personalized Privacy-aware Image Classification

222 The science and detection of tilting

224 Using Photos as Micro-Reports of Events

227 Searching for Audio by Sketching Mental Images of Sound – A Brave New Idea for Audio Retrieval in Creative Music Production


25 Homemade TS-Net for Automatic Face Recognition

38 Action Recognition by Learning Deep Multi-Granular Spatio-Temporal Video Representation

73 Pooling Objects for Recognizing Scenes without Examples

193 Multilingual Visual Sentiment Concept Matching


221 Multimodal Analysis of User-Generated Content in Support of Social Media Applications

228 Multimodal Visual Pattern Mining with Convolutional Neural Network

229 Facial Landmark Detection and Tracking for Facial Behavior Analysis


32 SentiCart: Cartography and Geo-contextualization for Multilingual Visual Sentiment

74 Personalized Retrieval and Browsing of Classical Music and Supporting Multimedia Material

90 The Social Picture

108 Watching What and How Politicians Discuss Various Topics - A Large-Scale Video Analytics UI

127 A Multimedia Big Data Retrieval Framework to Detect Dyslexia among Children

142 Multimodal Event Detection and Summarization in Large Scale Image Collections

143 Object-aware Deep Network for Commodity Image Retrieval

152 An Automated End-To-End Pipeline for Fine-Grained Video Annotation using Deep Neural Networks

175 Serendipity-driven Celebrity Video Hyperlinking

207 Complura: Exploring and Leveraging a Large-scale Multilingual Visual Sentiment Ontology

SHARE LinkedIn Weibo