TY - GEN
T1 - Video-based motion expertise analysis in simulation-based surgical training using Hierarchical Dirichlet Process Hidden Markov Model
AU - Zhang, Qiang
AU - Li, Baoxin
PY - 2011
Y1 - 2011
N2 - In simulation-based surgical training, a key task is to rate the performance of the operator, which is done currently by senior surgeons. This is a costly practice and objectively quantifiable assessment metrics are often missing. Researchers have been working towards building automated systems to achieve computational understanding of surgical skills, largely through analysis of motion data captured by video or other sensors. In this paper, we extend the Hierarchical Dirichlet Process Hidden Markov Model (HDP-HMM) for this purpose. We start with detecting spatial temporal interest points from the video capturing the tool motion of an operator, and then generate visual words from the descriptors of those interest points. For each frame, we construct a histogram with the associated interest points, i.e. the "bag of words", and then every video is represented by a sequence of those histograms. For sequences of each motion expertise level, we infer an HDP-HMM model. Finally, the classification of the motion expertise level for a testing sequence is based on choosing a model that maximizes the likelihood of the given sequence. Compared with the other action recognition algorithms, such as kernel SVM, our method leads to a better result. Further, the proposed approach also provides some important cues on the patterns of motion for each expertise level.
AB - In simulation-based surgical training, a key task is to rate the performance of the operator, which is done currently by senior surgeons. This is a costly practice and objectively quantifiable assessment metrics are often missing. Researchers have been working towards building automated systems to achieve computational understanding of surgical skills, largely through analysis of motion data captured by video or other sensors. In this paper, we extend the Hierarchical Dirichlet Process Hidden Markov Model (HDP-HMM) for this purpose. We start with detecting spatial temporal interest points from the video capturing the tool motion of an operator, and then generate visual words from the descriptors of those interest points. For each frame, we construct a histogram with the associated interest points, i.e. the "bag of words", and then every video is represented by a sequence of those histograms. For sequences of each motion expertise level, we infer an HDP-HMM model. Finally, the classification of the motion expertise level for a testing sequence is based on choosing a model that maximizes the likelihood of the given sequence. Compared with the other action recognition algorithms, such as kernel SVM, our method leads to a better result. Further, the proposed approach also provides some important cues on the patterns of motion for each expertise level.
KW - Dirichlet
KW - HDP-HMM
KW - Motion expertise
KW - Surgery simulation
KW - Video analysis
UR - http://www.scopus.com/inward/record.url?scp=84555196181&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84555196181&partnerID=8YFLogxK
U2 - 10.1145/2072545.2072550
DO - 10.1145/2072545.2072550
M3 - Conference contribution
AN - SCOPUS:84555196181
SN - 9781450309912
T3 - MM'11 - Proceedings of the 2011 ACM Multimedia Conference and Co-Located Workshops - 2011 ACM International Workshop on Medical Multimedia Analysis and Retrieval, MMAR'11
SP - 19
EP - 24
BT - MM'11 - Proceedings of the 2011 ACM Multimedia Conference and Co-Located Workshops - 2011 ACM International Workshop on Medical Multimedia Analysis and Retrieval, MMAR'11
T2 - 2011 ACM Multimedia Conference, MM'11 and Co-Located Workshops - 2011 ACM International Workshop on Medical Multimedia Analysis and Retrieval, MMAR'11
Y2 - 28 November 2011 through 1 December 2011
ER -