東華大學圖書館 |

語系: 繁體中文

說明(常見問題)

回圖書館首頁

手機版館藏查詢

登入

回首頁

切換: 標籤 | MARC模式 | ISBD

Exploring image and video by classif...

Lu, Le.

FindBook

Google Book

Amazon

博客來

Exploring image and video by classification and clustering on global and local visual features.

紀錄類型:	書目-電子資源 : Monograph/item
正題名/作者:	Exploring image and video by classification and clustering on global and local visual features./
作者:	Lu, Le.
面頁冊數:	157 p.
附註:	Source: Dissertation Abstracts International, Volume: 68-04, Section: B, page: 2437.
Contained By:	Dissertation Abstracts International68-04B.
標題:	Artificial Intelligence. -
電子資源:	http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3262467

Exploring image and video by classification and clustering on global and local visual features.
Lu, Le.

Exploring image and video by classification and clustering on global and local visual features. - 157 p.

Source: Dissertation Abstracts International, Volume: 68-04, Section: B, page: 2437.

Thesis (Ph.D.)--The Johns Hopkins University, 2007.

Images and Videos are complex 2-dimensional spatially correlated data patterns or 3-dimensional spatial-temporally correlated data volumes. Associating the correlations between visual data signals (acquired by imaging sensors) and high-level semantic human knowledge is the core challenging problem of supervised pattern recognition and computer vision. Finding the underlying correlations among large amounts of image or video data themselves is another unsupervised data self-structuring issue. From the previous literature and our own research work using computing machines as tools, there are a lot of efforts trying to address these two tasks statistically, by making good use of recently developed supervised (a.k.a. Classification) and Unsupervised (a.k.a. Clustering) statistical machine learning paradigms.Subjects--Topical Terms:

769149
Artificial Intelligence.

Exploring image and video by classification and clustering on global and local visual features.
LDR:04737nmm 2200277 4500 001 1833383
005 20071004071624.5
008 130610s2007 eng d
035 $a (UMI)AAI3262467
035 $a AAI3262467
040 $a UMI $c UMI
100 1 $a Lu, Le. $3 1922087
245 1 0 $a Exploring image and video by classification and clustering on global and local visual features.
300 $a 157 p.
500 $a Source: Dissertation Abstracts International, Volume: 68-04, Section: B, page: 2437.
500 $a Adviser: Gregory D. Hager.
502 $a Thesis (Ph.D.)--The Johns Hopkins University, 2007.
520 $a Images and Videos are complex 2-dimensional spatially correlated data patterns or 3-dimensional spatial-temporally correlated data volumes. Associating the correlations between visual data signals (acquired by imaging sensors) and high-level semantic human knowledge is the core challenging problem of supervised pattern recognition and computer vision. Finding the underlying correlations among large amounts of image or video data themselves is another unsupervised data self-structuring issue. From the previous literature and our own research work using computing machines as tools, there are a lot of efforts trying to address these two tasks statistically, by making good use of recently developed supervised (a.k.a. Classification) and Unsupervised (a.k.a. Clustering) statistical machine learning paradigms.
520 $a In this dissertation, we are interested on studying four specific computer vision problems involving unsupervised visual data partitioning, discriminative multiple-class classification and online adaptive appearance learning, using statistical machine learning techniques. Our four tasks are based on extracting both global and local visual appearance patterns in general image and video domains. First, we develop a new clustering algorithm to exploit temporal video structures into piecewise elements (a.k.a. video shot segmentation) by combining central and subspace constraints for a unified solution. The proposed algorithm is also demonstrated its applicability to illumination-invariant face clustering. Second, we detect and recognize the spatial-temporal video subvolumes as action units using a trained 3D-surface action model via multi-scale temporal searching, The dynamic 3D-surface based action model is built up as an empirical distribution over the basic static posture elements in the spirit of texton representation. Thus the action matching process is based on the similarity measurement among histograms. The basic posture units are considered as intermediate visual representations learned by a three-staged clustering algorithm figure-segmented image sequences. Third, we train a discriminative-probabilistic multi-modal density classifier to evaluate the responses of 20 semantic material classes from a large collection of challenging home photos. Then the task of learning photo categories is based on the global image features extracted from the material class-specific density response maps over spatial domain. We adopt the classifier combination technique of a set of random weak discriminators to handle the complex multi-modal photo-feature distributions in high dimensional parameter space. Fourth, we propose a unified nonparametric approach for three applications: location based dynamic template video tracking in low to medium resolution, segmentation based object-level image matching across viewpoints, and binary foreground/background segmentation tracking. The main contributions exist in three areas: (1) we demonstrate that an online classification framework allows very flexible image density matching function constructions to address the general data-driven classification problem; (2) we devise an effective dynamic appearance modeling algorithm requiring only simple nonparametric computations (mean, median, standard deviation) for easy implementation; (3) we present a random patch based computational representation for classifying image segments of object-specific matching and tracking which is highly descriptive and discriminative compared with general image segment descriptors. This proposed approach has been extensively demonstrated of being able to maintain an effective object-level appearance models quite robustly over time under a variety of challenging conditions, such as severe changing, occluding and deformable appearance templates and moving cameras.
590 $a School code: 0098.
650 4 $a Artificial Intelligence. $3 769149
650 4 $a Computer Science. $3 626642
690 $a 0800
690 $a 0984
710 2 0 $a The Johns Hopkins University. $3 1017431
773 0 $t Dissertation Abstracts International $g 68-04B.
790 1 0 $a Hager, Gregory D., $e advisor
790 $a 0098
791 $a Ph.D.
792 $a 2007
856 4 0 $u http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3262467