Polygon-based bounding volume as a spatio-temporal data model for video content access

J. J. Song; Y. C. Park; P. K. Kim; K. S. Kim; F. Golshani; Sethuraman Panchanathan

Polygon-based bounding volume as a spatio-temporal data model for video content access

J. J. Song, Y. C. Park, P. K. Kim, K. S. Kim, F. Golshani, Sethuraman Panchanathan

Computer Science and Engineering

Research output: Contribution to journal › Conference article › peer-review

Abstract

Indexing, retrieval and delivery of visual and spatio-temporal properties of video objects requires efficient data models and sound operations on the model are mandatory. However, most object-based video data models address only a single aspect of those properties. In this paper, we present an efficient video object representation method that captures the visual, spatial and temporal properties of objects in a video in the form of an unified abstracted data type. The proposed data type is a polygon mesh, named video object mesh, which is defined in a spatio-temporal domain. Based on the application needs, a contour of an object is modeled with a polygonal contour. With the contour and color information of the object, content-based triangularization is performed. A video object in a frame is modeled with two dimensional-polygon mesh. Each vertex in the mesh, color information is embedded for further use. By using motion analysis, a corresponding vertex in the adjacent frame is identified connected to the vertex that is being analyzed. These processes are continued until a video object disappears. The result of these processes is a three dimensional polygon mesh that models location variant motion and location invariant motion that can not be captured by traditional trajectory based motion model. The proposed model is also useful camera motion analysis. Since a surface shape of a video object mesh has partial information of camera motion.

Original language	English (US)
Pages (from-to)	171-182
Number of pages	12
Journal	Proceedings of SPIE - The International Society for Optical Engineering
Volume	4210
State	Published - 2000
Event	Internet Multimedia Management Systems - Boston, MA, USA Duration: Nov 6 2000 → Nov 7 2000

ASJC Scopus subject areas

Electronic, Optical and Magnetic Materials
Condensed Matter Physics
Computer Science Applications
Applied Mathematics
Electrical and Electronic Engineering

Cite this

@article{e293120ca57445349f29b8afba6fb175,

title = "Polygon-based bounding volume as a spatio-temporal data model for video content access",

abstract = "Indexing, retrieval and delivery of visual and spatio-temporal properties of video objects requires efficient data models and sound operations on the model are mandatory. However, most object-based video data models address only a single aspect of those properties. In this paper, we present an efficient video object representation method that captures the visual, spatial and temporal properties of objects in a video in the form of an unified abstracted data type. The proposed data type is a polygon mesh, named video object mesh, which is defined in a spatio-temporal domain. Based on the application needs, a contour of an object is modeled with a polygonal contour. With the contour and color information of the object, content-based triangularization is performed. A video object in a frame is modeled with two dimensional-polygon mesh. Each vertex in the mesh, color information is embedded for further use. By using motion analysis, a corresponding vertex in the adjacent frame is identified connected to the vertex that is being analyzed. These processes are continued until a video object disappears. The result of these processes is a three dimensional polygon mesh that models location variant motion and location invariant motion that can not be captured by traditional trajectory based motion model. The proposed model is also useful camera motion analysis. Since a surface shape of a video object mesh has partial information of camera motion.",

author = "Song, {J. J.} and Park, {Y. C.} and Kim, {P. K.} and Kim, {K. S.} and F. Golshani and Sethuraman Panchanathan",

year = "2000",

language = "English (US)",

volume = "4210",

pages = "171--182",

journal = "Proceedings of SPIE - The International Society for Optical Engineering",

issn = "0277-786X",

publisher = "SPIE",

note = "Internet Multimedia Management Systems ; Conference date: 06-11-2000 Through 07-11-2000",

}

TY - JOUR

T1 - Polygon-based bounding volume as a spatio-temporal data model for video content access

AU - Song, J. J.

AU - Park, Y. C.

AU - Kim, P. K.

AU - Kim, K. S.

AU - Golshani, F.

AU - Panchanathan, Sethuraman

PY - 2000

Y1 - 2000

N2 - Indexing, retrieval and delivery of visual and spatio-temporal properties of video objects requires efficient data models and sound operations on the model are mandatory. However, most object-based video data models address only a single aspect of those properties. In this paper, we present an efficient video object representation method that captures the visual, spatial and temporal properties of objects in a video in the form of an unified abstracted data type. The proposed data type is a polygon mesh, named video object mesh, which is defined in a spatio-temporal domain. Based on the application needs, a contour of an object is modeled with a polygonal contour. With the contour and color information of the object, content-based triangularization is performed. A video object in a frame is modeled with two dimensional-polygon mesh. Each vertex in the mesh, color information is embedded for further use. By using motion analysis, a corresponding vertex in the adjacent frame is identified connected to the vertex that is being analyzed. These processes are continued until a video object disappears. The result of these processes is a three dimensional polygon mesh that models location variant motion and location invariant motion that can not be captured by traditional trajectory based motion model. The proposed model is also useful camera motion analysis. Since a surface shape of a video object mesh has partial information of camera motion.

AB - Indexing, retrieval and delivery of visual and spatio-temporal properties of video objects requires efficient data models and sound operations on the model are mandatory. However, most object-based video data models address only a single aspect of those properties. In this paper, we present an efficient video object representation method that captures the visual, spatial and temporal properties of objects in a video in the form of an unified abstracted data type. The proposed data type is a polygon mesh, named video object mesh, which is defined in a spatio-temporal domain. Based on the application needs, a contour of an object is modeled with a polygonal contour. With the contour and color information of the object, content-based triangularization is performed. A video object in a frame is modeled with two dimensional-polygon mesh. Each vertex in the mesh, color information is embedded for further use. By using motion analysis, a corresponding vertex in the adjacent frame is identified connected to the vertex that is being analyzed. These processes are continued until a video object disappears. The result of these processes is a three dimensional polygon mesh that models location variant motion and location invariant motion that can not be captured by traditional trajectory based motion model. The proposed model is also useful camera motion analysis. Since a surface shape of a video object mesh has partial information of camera motion.

UR - http://www.scopus.com/inward/record.url?scp=0034429941&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0034429941&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:0034429941

SN - 0277-786X

VL - 4210

SP - 171

EP - 182

JO - Proceedings of SPIE - The International Society for Optical Engineering

JF - Proceedings of SPIE - The International Society for Optical Engineering

T2 - Internet Multimedia Management Systems

Y2 - 6 November 2000 through 7 November 2000

ER -

Polygon-based bounding volume as a spatio-temporal data model for video content access

Abstract

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this