語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
到查詢結果
[ null ]
切換:
標籤
|
MARC模式
|
ISBD
Discovering interesting patterns and...
~
The University of Oklahoma., School of Computer Science.
FindBook
Google Book
Amazon
博客來
Discovering interesting patterns and associations in data streams.
紀錄類型:
書目-語言資料,印刷品 : Monograph/item
正題名/作者:
Discovering interesting patterns and associations in data streams./
作者:
Jiang, Nan.
面頁冊數:
167 p.
附註:
Adviser: Lee Williams.
Contained By:
Dissertation Abstracts International70-04B.
標題:
Computer Science. -
電子資源:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3354711
ISBN:
9781109118810
Discovering interesting patterns and associations in data streams.
Jiang, Nan.
Discovering interesting patterns and associations in data streams.
- 167 p.
Adviser: Lee Williams.
Thesis (Ph.D.)--The University of Oklahoma, 2009.
A data stream is a sequence of items that arrive in a timely order. Different from data in traditional static databases, data streams are continuous, unbounded, usually come with high speed, and have a data value distribution that often changes with time (Guha, 2001). As more applications such as web transactions, telephone records, and network flows generate a large number of data streams every day, efficient knowledge discovery of data streams is an active and growing research area in data mining with broad applications. Traditional data mining algorithms are developed to work on a complete static dataset and, thus, cannot be applied directly in data stream applications.
ISBN: 9781109118810Subjects--Topical Terms:
626642
Computer Science.
Discovering interesting patterns and associations in data streams.
LDR
:05586nam 2200373 a 45
001
854411
005
20100702
008
100702s2009 ||||||||||||||||| ||eng d
020
$a
9781109118810
035
$a
(UMI)AAI3354711
035
$a
AAI3354711
040
$a
UMI
$c
UMI
100
1
$a
Jiang, Nan.
$3
1020778
245
1 0
$a
Discovering interesting patterns and associations in data streams.
300
$a
167 p.
500
$a
Adviser: Lee Williams.
500
$a
Source: Dissertation Abstracts International, Volume: 70-04, Section: B, page: 2390.
502
$a
Thesis (Ph.D.)--The University of Oklahoma, 2009.
520
$a
A data stream is a sequence of items that arrive in a timely order. Different from data in traditional static databases, data streams are continuous, unbounded, usually come with high speed, and have a data value distribution that often changes with time (Guha, 2001). As more applications such as web transactions, telephone records, and network flows generate a large number of data streams every day, efficient knowledge discovery of data streams is an active and growing research area in data mining with broad applications. Traditional data mining algorithms are developed to work on a complete static dataset and, thus, cannot be applied directly in data stream applications.
520
$a
One area of data mining research is to mine association relationship in a data set. Most of association mining techniques for data streams can be categorized into two types: those developed based on frequent patterns and those developed based on closed patterns. Due to the number of frequent patterns are often huge and redundant, non-informative patterns are contained in frequent patterns. An alternative way is to develop the association mining approaches for data streaming applications based on closed patterns, which generally represent a small subset of all frequent patterns, but provide complete and condensed information. In these researches, the closed pattern mining is the prerequisite condition for non-redundant and informative association mining.
520
$a
In this dissertation, a sliding window technique for dynamic mining of closed patterns in data streams is proposed, and an approach of mining non-redundant and informative associations based on the discovered closed patterns is developed. The closed pattern and relevant association mining techniques are selected research area in this dissertation. First, the closed patterns for a given collection of data are currently the most compact data knowledge that can provide complete support information for all data patterns. Compared with other techniques, the proposed closed pattern mining technique has potential to largely decrease the number of subsequent combinatorial calculations performed on the data patterns. Second, the memory requirement to store the closed patterns and relevant associations is generally lower than the corresponding frequent patterns and associations. In some data streaming applications, memory usage is an important measurement, because in these applications memory usage is the bottleneck for knowledge discovery. Third, the associations generated for data streams are the knowledge used to identify the relations within the data. The discovered relations can find their wide applications in many data streaming environments.
520
$a
Different from the closed pattern mining techniques on traditional databases, which require multiple scans of the entire database, the proposed technique determines the closed patterns with a single scan. It is an incremental mining process; as the sliding window advances, new data transactions enter and old data transactions exit the window. But instead of regenerating closed patterns from the entire window, the proposed technique updates the old set of closed patterns whenever a new transaction arrives and/or an old transaction leaves the sliding window to obtain the current set of closed patterns. This incremental feature allows the user to get the most recent updated closed patterns without rescanning the entire updated database, which saves not only the computation time, but more importantly, the I/O operating time to load and write data from database to memory. Third, the proposed sliding window technique can handle both the insertion and deletion operations independently, which allows the user to adjust the sliding window size in different application environments. Furthermore, the proposed interesting patterns and association mining framework can handle different users' requests at the same time at their specified support and confidence thresholds, and interested input and output patterns.
520
$a
The research includes both theoretical proofs of correctness for the proposed algorithms and simulation experiments to compare the proposed techniques with those existing in the literature using synthetic and real datasets. The utility of the proposed technique is applied to sensor network databases of a traffic management and an environmental monitoring site for missing data estimation purpose.
590
$a
School code: 0169.
650
4
$a
Computer Science.
$3
626642
690
$a
0984
710
2
$a
The University of Oklahoma.
$b
School of Computer Science.
$3
1020777
773
0
$t
Dissertation Abstracts International
$g
70-04B.
790
$a
0169
790
1 0
$a
Antonio, John
$e
committee member
790
1 0
$a
Atiquzzaman, Mohammed
$e
committee member
790
1 0
$a
Dong, Yifei
$e
committee member
790
1 0
$a
Kim, Changwook
$e
committee member
790
1 0
$a
Schwarzkopf, Albert
$e
committee member
790
1 0
$a
Williams, Lee,
$e
advisor
791
$a
Ph.D.
792
$a
2009
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3354711
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9070331
電子資源
11.線上閱覽_V
電子書
EB W9070331
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入
(1)帳號:一般為「身分證號」;外籍生或交換生則為「學號」。 (2)密碼:預設為帳號末四碼。
帳號
.
密碼
.
請在此電腦上記得個人資料
取消
忘記密碼? (請注意!您必須已在系統登記E-mail信箱方能使用。)