Language:
English
繁體中文
Help
回圖書館首頁
手機版館藏查詢
Login
Back
Switch To:
Labeled
|
MARC Mode
|
ISBD
Text mining biomedical literature fo...
~
The University of Texas at Dallas.
Linked to FindBook
Google Book
Amazon
博客來
Text mining biomedical literature for improving MEDLINE retrieval.
Record Type:
Language materials, printed : Monograph/item
Title/Author:
Text mining biomedical literature for improving MEDLINE retrieval./
Author:
Lin, Yongjing.
Description:
179 p.
Notes:
Adviser: Ying Liu.
Contained By:
Dissertation Abstracts International69-03B.
Subject:
Computer Science. -
Online resource:
http://pqdd.sinica.edu.tw/twdaoeng/servlet/advanced?query=3305851
ISBN:
9780549527039
Text mining biomedical literature for improving MEDLINE retrieval.
Lin, Yongjing.
Text mining biomedical literature for improving MEDLINE retrieval.
- 179 p.
Adviser: Ying Liu.
Thesis (Ph.D.)--The University of Texas at Dallas, 2008.
A major problem faced in biomedical informatics involves how best to present information retrieval results. This dissertation developed an approach that present users with reduced sets of relevant citations together with topic label. A text mining system is designed to group the retrieved citations, rank the citations in each cluster, and generate a set of keywords and MeSH terms to describe the common theme of each cluster.
ISBN: 9780549527039Subjects--Topical Terms:
626642
Computer Science.
Text mining biomedical literature for improving MEDLINE retrieval.
LDR
:03297nam 2200313 a 45
001
856938
005
20100709
008
100709s2008 ||||||||||||||||| ||eng d
020
$a
9780549527039
035
$a
(UMI)AAI3305851
035
$a
AAI3305851
040
$a
UMI
$c
UMI
100
1
$a
Lin, Yongjing.
$3
1023827
245
1 0
$a
Text mining biomedical literature for improving MEDLINE retrieval.
300
$a
179 p.
500
$a
Adviser: Ying Liu.
500
$a
Source: Dissertation Abstracts International, Volume: 69-03, Section: B, page: 1742.
502
$a
Thesis (Ph.D.)--The University of Texas at Dallas, 2008.
520
$a
A major problem faced in biomedical informatics involves how best to present information retrieval results. This dissertation developed an approach that present users with reduced sets of relevant citations together with topic label. A text mining system is designed to group the retrieved citations, rank the citations in each cluster, and generate a set of keywords and MeSH terms to describe the common theme of each cluster.
520
$a
A series of follow-up researches were conducted for better performance of the system. A spectral analysis clustering method was proposed based on the content similarity network techniques for information retrieval systems. The new approach organizes all these search results into categories intelligently. Our experimental results demonstrated that the presented method performs well in real world applications.
520
$a
Automated concept recognition for each cluster is one of the important tasks in our text mining system. The system can perform keyword, key MeSH term and key noun-phrase extraction. Within each cluster, the extraction of keyword and key MeSH term is based on modeling the document-term-matrix as a weighted bipartite graph. A mutual reinforcement principle is used to rank the terms. Our new key noun-phrase extraction method is based on the context-free grammatical rules extracted from the input documents. An existing algorithm called Sequitur is used for constructing the context-free grammar rules that re-represent a sequence as a hierarchical structure. Noun-phrases are extracted from the grammatical rules. The key noun-phrases were identified from top frequency rules without extracting all the grammatical rules. The experimental results showed that our key noun-phrase extraction method is effective in identifying key concepts from documents, and outperforms current widely-used methods.
520
$a
We also explored to rank MEDLINE citations using an existing web ranking algorithm, HITS (Hyperlink-Induced Topic Search) algorithm. We further extended HITS to supervised HITS to rank citations. Our results showed that supervised HITS algorithm significantly outperforms HITS algorithm (p<0.01). Compared with HITS, supervised HITS can improve citation ranking from 15% to more than 59% in almost all the cases we tested. Furthermore, MeSH terms outperforms text words in ranking citations, especially when HITS was applied (p<0.01).
590
$a
School code: 0382.
650
4
$a
Computer Science.
$3
626642
650
4
$a
Engineering, Biomedical.
$3
1017684
690
$a
0541
690
$a
0984
710
2
$a
The University of Texas at Dallas.
$3
1018411
773
0
$t
Dissertation Abstracts International
$g
69-03B.
790
$a
0382
790
1 0
$a
Liu, Ying,
$e
advisor
791
$a
Ph.D.
792
$a
2008
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoeng/servlet/advanced?query=3305851
based on 0 review(s)
Location:
ALL
電子資源
Year:
Volume Number:
Items
1 records • Pages 1 •
1
Inventory Number
Location Name
Item Class
Material type
Call number
Usage Class
Loan Status
No. of reservations
Opac note
Attachments
W9072100
電子資源
11.線上閱覽_V
電子書
EB W9072100
一般使用(Normal)
On shelf
0
1 records • Pages 1 •
1
Multimedia
Reviews
Add a review
and share your thoughts with other readers
Export
pickup library
Processing
...
Change password
Login