MINI-Net: Multiple Instance Ranking Network for Video Highlight Detection

Fa-Ting Hong      Xuanteng Huang      Wei-Hong Li      Wei-Shi Zheng      
School of Computer Science and Engineering, Sun Yat-sen University


We address the weakly supervised video highlight detectionproblem for learning to detect the segments that are more attractivein training videos given their video event label but without expensivesupervision of manually annotating highlight segments. While averting lo-calizing highlight segments manually, such a weakly supervised modellingis challenging because a video in our daily life could contain highlightsegments with multiple event types,e.g., skiing and surfing. In this work,we propose to cast such weakly supervised video highlight detectionmodelling for a given specific event as a multiple instance ranking net-work (MINI-Net) learning. We consider each video as a bag of segments,and therefore the proposed MINI-Net learns to enforce higher highlightscore for a positive bag that contains highlight segments of a specificevent than the ones for negative bags that are irrelevant. In particular,we form a max-max ranking loss in order to acquire a reliable relativecomparison between the most likely positive segment instance and themost hard negative segment instance. With such max-max ranking loss,our MINI-Net leverages all segment information effectively to acquirea more distinct video feature representation for localizing the highlightsegments of a specific event in a video. The extensive experimental resultson three challenging public benchmarks clearly validate the efficacy ofour multiple instance ranking approach for solving the problem.





            title={MINI-Net: Multiple Instance Ranking Networkfor Video Highlight Detection}, 
            author={Fa-Ting Hong, Xuanteng Huang, Wei-Hong Li, and Wei-Shi Zheng},


This project page is learned from the GFP-GAN, thanks to Xintao Wang.


If you have any question, please contact Fa-Ting Hong at