Language:
English
繁體中文
Help
回圖書館首頁
手機版館藏查詢
Login
Back
Switch To:
Labeled
|
MARC Mode
|
ISBD
Annotating a corpus of biomedical re...
~
White, Barbara Ellen.
Linked to FindBook
Google Book
Amazon
博客來
Annotating a corpus of biomedical research texts: Two models of rhetorical analysis.
Record Type:
Language materials, printed : Monograph/item
Title/Author:
Annotating a corpus of biomedical research texts: Two models of rhetorical analysis./
Author:
White, Barbara Ellen.
Description:
268 p.
Notes:
Source: Dissertation Abstracts International, Volume: 72-07, Section: A, page: .
Contained By:
Dissertation Abstracts International72-07A.
Subject:
Language, Rhetoric and Composition. -
Online resource:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=NR73559
ISBN:
9780494735596
Annotating a corpus of biomedical research texts: Two models of rhetorical analysis.
White, Barbara Ellen.
Annotating a corpus of biomedical research texts: Two models of rhetorical analysis.
- 268 p.
Source: Dissertation Abstracts International, Volume: 72-07, Section: A, page: .
Thesis (Ph.D.)--The University of Western Ontario (Canada), 2010.
Recent advances in the biomedical sciences have led to an enormous increase in the amount of research literature being published, most of it in electronic form; researchers are finding it difficult to keep up-to-date on all of the new developments in their fields. As a result there is a need to develop automated Text Mining tools to filter and organize data in a way which is useful to researchers. Human-annotated data are often used as the 'gold standard' to train such systems via machine learning methods.
ISBN: 9780494735596Subjects--Topical Terms:
1019205
Language, Rhetoric and Composition.
Annotating a corpus of biomedical research texts: Two models of rhetorical analysis.
LDR
:03233nam 2200301 4500
001
1402078
005
20111019131512.5
008
130515s2010 ||||||||||||||||| ||eng d
020
$a
9780494735596
035
$a
(UMI)AAINR73559
035
$a
AAINR73559
040
$a
UMI
$c
UMI
100
1
$a
White, Barbara Ellen.
$3
1681233
245
1 0
$a
Annotating a corpus of biomedical research texts: Two models of rhetorical analysis.
300
$a
268 p.
500
$a
Source: Dissertation Abstracts International, Volume: 72-07, Section: A, page: .
502
$a
Thesis (Ph.D.)--The University of Western Ontario (Canada), 2010.
520
$a
Recent advances in the biomedical sciences have led to an enormous increase in the amount of research literature being published, most of it in electronic form; researchers are finding it difficult to keep up-to-date on all of the new developments in their fields. As a result there is a need to develop automated Text Mining tools to filter and organize data in a way which is useful to researchers. Human-annotated data are often used as the 'gold standard' to train such systems via machine learning methods.
520
$a
This thesis reports on a project where three annotators applied two Models of rhetoric (argument) to a corpus of on-line biomedical research texts. How authors structure their argumentation and which rhetorical strategies they employ are key to how researchers present their experimental results; thus rhetorical analysis of a text could allow for the extraction of information which is pertinent for a particular researcher's purpose. The first Model stems from previous work in Computational Linguistics; it focuses on differentiating 'new' from 'old' information, and results from analysis of results. The second Model is based on Toulmin's argument structure (1958/2003); its main focus is to identify 'Claims' being made by the authors, but it also differentiates between internal and external evidence, as well as categories of explanation and implications of the current experiment.
520
$a
In order to properly train automated systems, and as a gauge of the shared understanding of the argument scheme being applied, inter-annotator agreement should be relatively high. The results of this study show complete (three-way) inter-annotator agreement on an average of 60.5% of the 400 sentences in the final corpus under Model 1, and 39.3% under Model 2. Analyses of the inter-annotator variation are done in order to examine in detail all of the factors involved; these include particular Model categories, individual annotator preferences, errors, and the corpus data itself. In order to reduce this inter-annotator variation, revisions to both Models are suggested; also it is recommended that in the future biomedical domain experts, possibly in tandem with experts in rhetoric, be used as annotators.
520
$a
KEY WORDS: annotation, argument, biomedical text, computational linguistics, information extraction, rhetoric, text mining
590
$a
School code: 0784.
650
4
$a
Language, Rhetoric and Composition.
$3
1019205
650
4
$a
Biology, Bioinformatics.
$3
1018415
650
4
$a
Information Science.
$3
1017528
690
$a
0681
690
$a
0715
690
$a
0723
710
2
$a
The University of Western Ontario (Canada).
$3
1017622
773
0
$t
Dissertation Abstracts International
$g
72-07A.
790
$a
0784
791
$a
Ph.D.
792
$a
2010
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=NR73559
based on 0 review(s)
Location:
ALL
電子資源
Year:
Volume Number:
Items
1 records • Pages 1 •
1
Inventory Number
Location Name
Item Class
Material type
Call number
Usage Class
Loan Status
No. of reservations
Opac note
Attachments
W9165217
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
On shelf
0
1 records • Pages 1 •
1
Multimedia
Reviews
Add a review
and share your thoughts with other readers
Export
pickup library
Processing
...
Change password
Login