語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
到查詢結果
[ null ]
切換:
標籤
|
MARC模式
|
ISBD
Downloading data from textual deep W...
~
Yuan, Xiaolei.
FindBook
Google Book
Amazon
博客來
Downloading data from textual deep Web using clustering.
紀錄類型:
書目-語言資料,印刷品 : Monograph/item
正題名/作者:
Downloading data from textual deep Web using clustering./
作者:
Yuan, Xiaolei.
面頁冊數:
69 p.
附註:
Source: Masters Abstracts International, Volume: 46-04, page: 2175.
Contained By:
Masters Abstracts International46-04.
標題:
Computer Science. -
電子資源:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=MR34985
ISBN:
9780494349854
Downloading data from textual deep Web using clustering.
Yuan, Xiaolei.
Downloading data from textual deep Web using clustering.
- 69 p.
Source: Masters Abstracts International, Volume: 46-04, page: 2175.
Thesis (M.Sc.)--University of Windsor (Canada), 2007.
Keywords: deep web, hidden web data discovery, data mining, clustering, information retrieval.
ISBN: 9780494349854Subjects--Topical Terms:
626642
Computer Science.
Downloading data from textual deep Web using clustering.
LDR
:01709nam 2200253 a 45
001
963696
005
20110831
008
110831s2007 ||||||||||||||||| ||eng d
020
$a
9780494349854
035
$a
(UMI)AAIMR34985
035
$a
AAIMR34985
040
$a
UMI
$c
UMI
100
1
$a
Yuan, Xiaolei.
$3
1286759
245
1 0
$a
Downloading data from textual deep Web using clustering.
300
$a
69 p.
500
$a
Source: Masters Abstracts International, Volume: 46-04, page: 2175.
502
$a
Thesis (M.Sc.)--University of Windsor (Canada), 2007.
520
$a
Keywords: deep web, hidden web data discovery, data mining, clustering, information retrieval.
520
$a
Deep web is the web that is dynamically generated from data sources such as databases or file systems. Crawling deep web is the process of collecting hidden data by issuing appropriate queries in order to download most of the data. Our main challenge is to select appropriate queries in order to obtain most of the data from a data source. A naive solution, which selects the queries that return most results, is problematic because (1) the results may not cover the data source, and more importantly, (2) the results suffer from high overlap, which makes the acquisition of new data items almost impossible after certain steps. The thesis experiments with four different algorithms to select the queries that minimize the overlap rate: (1) greedy algorithm based on set packing; (2)cluster-based algorithm to remove the queries that result in similar returns.
590
$a
School code: 0115.
650
4
$a
Computer Science.
$3
626642
690
$a
0984
710
2
$a
University of Windsor (Canada).
$3
1018526
773
0
$t
Masters Abstracts International
$g
46-04.
790
$a
0115
791
$a
M.Sc.
792
$a
2007
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=MR34985
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9124037
電子資源
11.線上閱覽_V
電子書
EB W9124037
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入
(1)帳號:一般為「身分證號」;外籍生或交換生則為「學號」。 (2)密碼:預設為帳號末四碼。
帳號
.
密碼
.
請在此電腦上記得個人資料
取消
忘記密碼? (請注意!您必須已在系統登記E-mail信箱方能使用。)