Language:
English
繁體中文
Help
回圖書館首頁
手機版館藏查詢
Login
Back
Switch To:
Labeled
|
MARC Mode
|
ISBD
Imbalanced Binary Classification On ...
~
Zhang, Hui.
Linked to FindBook
Google Book
Amazon
博客來
Imbalanced Binary Classification On Hospital Readmission Data With Missing Values.
Record Type:
Electronic resources : Monograph/item
Title/Author:
Imbalanced Binary Classification On Hospital Readmission Data With Missing Values./
Author:
Zhang, Hui.
Published:
Ann Arbor : ProQuest Dissertations & Theses, : 2018,
Description:
42 p.
Notes:
Source: Masters Abstracts International, Volume: 80-04.
Contained By:
Masters Abstracts International80-04.
Subject:
Statistics. -
Online resource:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=10937360
ISBN:
9780438461406
Imbalanced Binary Classification On Hospital Readmission Data With Missing Values.
Zhang, Hui.
Imbalanced Binary Classification On Hospital Readmission Data With Missing Values.
- Ann Arbor : ProQuest Dissertations & Theses, 2018 - 42 p.
Source: Masters Abstracts International, Volume: 80-04.
Thesis (M.S.)--University of California, Los Angeles, 2018.
This item must not be sold to any third party vendors.
Hospital readmission is a costly, undesirable, and often preventable patient outcome of inpatient care. Early readmission prediction can effectively prevent life-threatening events and reduce healthcare costs. However, imbalanced class distribution and high missing value rates are usually associated with readmission data and need to be handled carefully before building classification models. In this paper, we investigate the prediction of hospital readmission on a dataset with high percentage of missing values and class imbalance problem. Different methods are applied to impute missing values in the categorical variables and numerical variables. In addition, SMOTE (Synthetic Minority Over-sampling Technique) and cost-sensitive learning are combined with different classification methods (LASSO logistic regression, random forest, and gradient boosting) to explore which one will yield the best classification performance on the readmission data. Total misclassification cost and area under ROC curve are used as evaluation metrics for model comparison. Our results show that the SMOTE method causes overfitting on our readmission data and cost-sensitive learning outperforms SMOTE in terms of total misclassification cost.
ISBN: 9780438461406Subjects--Topical Terms:
517247
Statistics.
Imbalanced Binary Classification On Hospital Readmission Data With Missing Values.
LDR
:02289nmm a2200325 4500
001
2208475
005
20191021073445.5
008
201008s2018 ||||||||||||||||| ||eng d
020
$a
9780438461406
035
$a
(MiAaPQ)AAI10937360
035
$a
(MiAaPQ)ucla:17314
035
$a
AAI10937360
040
$a
MiAaPQ
$c
MiAaPQ
100
1
$a
Zhang, Hui.
$3
1019075
245
1 0
$a
Imbalanced Binary Classification On Hospital Readmission Data With Missing Values.
260
1
$a
Ann Arbor :
$b
ProQuest Dissertations & Theses,
$c
2018
300
$a
42 p.
500
$a
Source: Masters Abstracts International, Volume: 80-04.
500
$a
Publisher info.: Dissertation/Thesis.
500
$a
Advisor: Wu, Yingnian.
502
$a
Thesis (M.S.)--University of California, Los Angeles, 2018.
506
$a
This item must not be sold to any third party vendors.
520
$a
Hospital readmission is a costly, undesirable, and often preventable patient outcome of inpatient care. Early readmission prediction can effectively prevent life-threatening events and reduce healthcare costs. However, imbalanced class distribution and high missing value rates are usually associated with readmission data and need to be handled carefully before building classification models. In this paper, we investigate the prediction of hospital readmission on a dataset with high percentage of missing values and class imbalance problem. Different methods are applied to impute missing values in the categorical variables and numerical variables. In addition, SMOTE (Synthetic Minority Over-sampling Technique) and cost-sensitive learning are combined with different classification methods (LASSO logistic regression, random forest, and gradient boosting) to explore which one will yield the best classification performance on the readmission data. Total misclassification cost and area under ROC curve are used as evaluation metrics for model comparison. Our results show that the SMOTE method causes overfitting on our readmission data and cost-sensitive learning outperforms SMOTE in terms of total misclassification cost.
590
$a
School code: 0031.
650
4
$a
Statistics.
$3
517247
650
4
$a
Bioinformatics.
$3
553671
690
$a
0463
690
$a
0715
710
2
$a
University of California, Los Angeles.
$b
Statistics.
$3
2104005
773
0
$t
Masters Abstracts International
$g
80-04.
790
$a
0031
791
$a
M.S.
792
$a
2018
793
$a
English
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=10937360
based on 0 review(s)
Location:
ALL
電子資源
Year:
Volume Number:
Items
1 records • Pages 1 •
1
Inventory Number
Location Name
Item Class
Material type
Call number
Usage Class
Loan Status
No. of reservations
Opac note
Attachments
W9385024
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
On shelf
0
1 records • Pages 1 •
1
Multimedia
Reviews
Add a review
and share your thoughts with other readers
Export
pickup library
Processing
...
Change password
Login