Language:
English
繁體中文
Help
回圖書館首頁
手機版館藏查詢
Login
Back
Switch To:
Labeled
|
MARC Mode
|
ISBD
Discovering interesting patterns and...
~
The University of Oklahoma., School of Computer Science.
Linked to FindBook
Google Book
Amazon
博客來
Discovering interesting patterns and associations in data streams.
Record Type:
Language materials, printed : Monograph/item
Title/Author:
Discovering interesting patterns and associations in data streams./
Author:
Jiang, Nan.
Description:
167 p.
Notes:
Adviser: Lee Williams.
Contained By:
Dissertation Abstracts International70-04B.
Subject:
Computer Science. -
Online resource:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3354711
ISBN:
9781109118810
Discovering interesting patterns and associations in data streams.
Jiang, Nan.
Discovering interesting patterns and associations in data streams.
- 167 p.
Adviser: Lee Williams.
Thesis (Ph.D.)--The University of Oklahoma, 2009.
A data stream is a sequence of items that arrive in a timely order. Different from data in traditional static databases, data streams are continuous, unbounded, usually come with high speed, and have a data value distribution that often changes with time (Guha, 2001). As more applications such as web transactions, telephone records, and network flows generate a large number of data streams every day, efficient knowledge discovery of data streams is an active and growing research area in data mining with broad applications. Traditional data mining algorithms are developed to work on a complete static dataset and, thus, cannot be applied directly in data stream applications.
ISBN: 9781109118810Subjects--Topical Terms:
626642
Computer Science.
Discovering interesting patterns and associations in data streams.
LDR
:05586nam 2200373 a 45
001
854411
005
20100702
008
100702s2009 ||||||||||||||||| ||eng d
020
$a
9781109118810
035
$a
(UMI)AAI3354711
035
$a
AAI3354711
040
$a
UMI
$c
UMI
100
1
$a
Jiang, Nan.
$3
1020778
245
1 0
$a
Discovering interesting patterns and associations in data streams.
300
$a
167 p.
500
$a
Adviser: Lee Williams.
500
$a
Source: Dissertation Abstracts International, Volume: 70-04, Section: B, page: 2390.
502
$a
Thesis (Ph.D.)--The University of Oklahoma, 2009.
520
$a
A data stream is a sequence of items that arrive in a timely order. Different from data in traditional static databases, data streams are continuous, unbounded, usually come with high speed, and have a data value distribution that often changes with time (Guha, 2001). As more applications such as web transactions, telephone records, and network flows generate a large number of data streams every day, efficient knowledge discovery of data streams is an active and growing research area in data mining with broad applications. Traditional data mining algorithms are developed to work on a complete static dataset and, thus, cannot be applied directly in data stream applications.
520
$a
One area of data mining research is to mine association relationship in a data set. Most of association mining techniques for data streams can be categorized into two types: those developed based on frequent patterns and those developed based on closed patterns. Due to the number of frequent patterns are often huge and redundant, non-informative patterns are contained in frequent patterns. An alternative way is to develop the association mining approaches for data streaming applications based on closed patterns, which generally represent a small subset of all frequent patterns, but provide complete and condensed information. In these researches, the closed pattern mining is the prerequisite condition for non-redundant and informative association mining.
520
$a
In this dissertation, a sliding window technique for dynamic mining of closed patterns in data streams is proposed, and an approach of mining non-redundant and informative associations based on the discovered closed patterns is developed. The closed pattern and relevant association mining techniques are selected research area in this dissertation. First, the closed patterns for a given collection of data are currently the most compact data knowledge that can provide complete support information for all data patterns. Compared with other techniques, the proposed closed pattern mining technique has potential to largely decrease the number of subsequent combinatorial calculations performed on the data patterns. Second, the memory requirement to store the closed patterns and relevant associations is generally lower than the corresponding frequent patterns and associations. In some data streaming applications, memory usage is an important measurement, because in these applications memory usage is the bottleneck for knowledge discovery. Third, the associations generated for data streams are the knowledge used to identify the relations within the data. The discovered relations can find their wide applications in many data streaming environments.
520
$a
Different from the closed pattern mining techniques on traditional databases, which require multiple scans of the entire database, the proposed technique determines the closed patterns with a single scan. It is an incremental mining process; as the sliding window advances, new data transactions enter and old data transactions exit the window. But instead of regenerating closed patterns from the entire window, the proposed technique updates the old set of closed patterns whenever a new transaction arrives and/or an old transaction leaves the sliding window to obtain the current set of closed patterns. This incremental feature allows the user to get the most recent updated closed patterns without rescanning the entire updated database, which saves not only the computation time, but more importantly, the I/O operating time to load and write data from database to memory. Third, the proposed sliding window technique can handle both the insertion and deletion operations independently, which allows the user to adjust the sliding window size in different application environments. Furthermore, the proposed interesting patterns and association mining framework can handle different users' requests at the same time at their specified support and confidence thresholds, and interested input and output patterns.
520
$a
The research includes both theoretical proofs of correctness for the proposed algorithms and simulation experiments to compare the proposed techniques with those existing in the literature using synthetic and real datasets. The utility of the proposed technique is applied to sensor network databases of a traffic management and an environmental monitoring site for missing data estimation purpose.
590
$a
School code: 0169.
650
4
$a
Computer Science.
$3
626642
690
$a
0984
710
2
$a
The University of Oklahoma.
$b
School of Computer Science.
$3
1020777
773
0
$t
Dissertation Abstracts International
$g
70-04B.
790
$a
0169
790
1 0
$a
Antonio, John
$e
committee member
790
1 0
$a
Atiquzzaman, Mohammed
$e
committee member
790
1 0
$a
Dong, Yifei
$e
committee member
790
1 0
$a
Kim, Changwook
$e
committee member
790
1 0
$a
Schwarzkopf, Albert
$e
committee member
790
1 0
$a
Williams, Lee,
$e
advisor
791
$a
Ph.D.
792
$a
2009
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3354711
based on 0 review(s)
Location:
ALL
電子資源
Year:
Volume Number:
Items
1 records • Pages 1 •
1
Inventory Number
Location Name
Item Class
Material type
Call number
Usage Class
Loan Status
No. of reservations
Opac note
Attachments
W9070331
電子資源
11.線上閱覽_V
電子書
EB W9070331
一般使用(Normal)
On shelf
0
1 records • Pages 1 •
1
Multimedia
Reviews
Add a review
and share your thoughts with other readers
Export
pickup library
Processing
...
Change password
Login