TY - GEN
T1 - Extracting key frames from consumer videos using bi-layer group sparsity
AU - Wang, Zheshen
AU - Kumar, Mrityunjay
AU - Luo, Jiebo
AU - Li, Baoxin
PY - 2011
Y1 - 2011
N2 - Compared to well-edited videos with predefined structures (e.g., news or sports videos), extracting key frames from unconstrained consumer videos remains a much more challenging problem due to their extremely diverse contents (no pre-imposed structure) and uncontrolled video quality (e.g., due to poor lighting or camera shake). In order to exploit spatio-temporal correlation present in the video for key frame extraction, we propose a bilayer group sparse representation in which the input video frames are first segmented into homogeneous patches and group sparsity is imposed at two levels simultaneously: (i) patch-to-frame, and (ii) frame-to-sequence. The grouped sparse coefficients are further combined with frame quality scores to generate key frames. Extensive experiments are performed on videos from actual end users. Results obtained by the proposed approach compare favorably with existing methods to confirm its effectiveness.
AB - Compared to well-edited videos with predefined structures (e.g., news or sports videos), extracting key frames from unconstrained consumer videos remains a much more challenging problem due to their extremely diverse contents (no pre-imposed structure) and uncontrolled video quality (e.g., due to poor lighting or camera shake). In order to exploit spatio-temporal correlation present in the video for key frame extraction, we propose a bilayer group sparse representation in which the input video frames are first segmented into homogeneous patches and group sparsity is imposed at two levels simultaneously: (i) patch-to-frame, and (ii) frame-to-sequence. The grouped sparse coefficients are further combined with frame quality scores to generate key frames. Extensive experiments are performed on videos from actual end users. Results obtained by the proposed approach compare favorably with existing methods to confirm its effectiveness.
KW - Consumer video
KW - Group sparsity
KW - Key frame extraction
UR - http://www.scopus.com/inward/record.url?scp=84455201798&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84455201798&partnerID=8YFLogxK
U2 - 10.1145/2072298.2072051
DO - 10.1145/2072298.2072051
M3 - Conference contribution
AN - SCOPUS:84455201798
SN - 9781450306164
T3 - MM'11 - Proceedings of the 2011 ACM Multimedia Conference and Co-Located Workshops
SP - 1505
EP - 1508
BT - MM'11 - Proceedings of the 2011 ACM Multimedia Conference and Co-Located Workshops
T2 - 19th ACM International Conference on Multimedia ACM Multimedia 2011, MM'11
Y2 - 28 November 2011 through 1 December 2011
ER -