語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
A novel hybrid focused crawling algo...
~
Chen, Yuxin.
FindBook
Google Book
Amazon
博客來
A novel hybrid focused crawling algorithm to build domain-specific collections.
紀錄類型:
書目-語言資料,印刷品 : Monograph/item
正題名/作者:
A novel hybrid focused crawling algorithm to build domain-specific collections./
作者:
Chen, Yuxin.
面頁冊數:
85 p.
附註:
Adviser: Edward A. Fox.
Contained By:
Dissertation Abstracts International68-03B.
標題:
Computer Science. -
電子資源:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3256116
A novel hybrid focused crawling algorithm to build domain-specific collections.
Chen, Yuxin.
A novel hybrid focused crawling algorithm to build domain-specific collections.
- 85 p.
Adviser: Edward A. Fox.
Thesis (Ph.D.)--Virginia Polytechnic Institute and State University, 2007.
The Web, containing a large amount of useful information and resources, is expanding rapidly. Collecting domain-specific documents/information from the Web is one of the most important methods to build digital libraries for the scientific community. Focused Crawlers can selectively retrieve Web documents relevant to a specific domain to build collections for domain-specific search engines or digital libraries. Traditional focused crawlers normally adopting the simple Vector Space Model and local Web search algorithms typically only find relevant Web pages with low precision. Recall also often is low, since they explore a limited sub-graph of the Web that surrounds the starting URL set, and will ignore relevant pages outside this sub-graph. In this work, we investigated how to apply an inductive machine learning algorithm and meta-search technique, to the traditional focused crawling process, to overcome the above mentioned problems and to improve performance. We proposed a novel hybrid focused crawling framework based on Genetic Programming (GP) and meta-search. We showed that our novel hybrid framework can be applied to traditional focused crawlers to accurately find more relevant Web documents for the use of digital libraries and domain-specific search engines. The framework is validated through experiments performed on test documents from the Open Directory Project [22]. Our studies have shown that improvement can be achieved relative to the traditional focused crawler if genetic programming and meta-search methods are introduced into the focused crawling process.Subjects--Topical Terms:
626642
Computer Science.
A novel hybrid focused crawling algorithm to build domain-specific collections.
LDR
:02450nam 2200253 a 45
001
969742
005
20110920
008
110921s2007 eng d
035
$a
(UMI)AAI3256116
035
$a
AAI3256116
040
$a
UMI
$c
UMI
100
1
$a
Chen, Yuxin.
$3
1293800
245
1 2
$a
A novel hybrid focused crawling algorithm to build domain-specific collections.
300
$a
85 p.
500
$a
Adviser: Edward A. Fox.
500
$a
Source: Dissertation Abstracts International, Volume: 68-03, Section: B, page: 1717.
502
$a
Thesis (Ph.D.)--Virginia Polytechnic Institute and State University, 2007.
520
$a
The Web, containing a large amount of useful information and resources, is expanding rapidly. Collecting domain-specific documents/information from the Web is one of the most important methods to build digital libraries for the scientific community. Focused Crawlers can selectively retrieve Web documents relevant to a specific domain to build collections for domain-specific search engines or digital libraries. Traditional focused crawlers normally adopting the simple Vector Space Model and local Web search algorithms typically only find relevant Web pages with low precision. Recall also often is low, since they explore a limited sub-graph of the Web that surrounds the starting URL set, and will ignore relevant pages outside this sub-graph. In this work, we investigated how to apply an inductive machine learning algorithm and meta-search technique, to the traditional focused crawling process, to overcome the above mentioned problems and to improve performance. We proposed a novel hybrid focused crawling framework based on Genetic Programming (GP) and meta-search. We showed that our novel hybrid framework can be applied to traditional focused crawlers to accurately find more relevant Web documents for the use of digital libraries and domain-specific search engines. The framework is validated through experiments performed on test documents from the Open Directory Project [22]. Our studies have shown that improvement can be achieved relative to the traditional focused crawler if genetic programming and meta-search methods are introduced into the focused crawling process.
590
$a
School code: 0247.
650
4
$a
Computer Science.
$3
626642
690
$a
0984
710
2 0
$a
Virginia Polytechnic Institute and State University.
$3
1017496
773
0
$t
Dissertation Abstracts International
$g
68-03B.
790
$a
0247
790
1 0
$a
Fox, Edward A.,
$e
advisor
791
$a
Ph.D.
792
$a
2007
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3256116
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9128230
電子資源
11.線上閱覽_V
電子書
EB W9128230
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入