Language:
English
繁體中文
Help
回圖書館首頁
手機版館藏查詢
Login
Back
Switch To:
Labeled
|
MARC Mode
|
ISBD
An Experimental Study of Supervised ...
~
Alfarwan, Abdullah Mana.
Linked to FindBook
Google Book
Amazon
博客來
An Experimental Study of Supervised Machine Learning Techniques for Minor Class Prediction Utilizing Kernel Density Estimation: Factors Impacting Model Performance.
Record Type:
Electronic resources : Monograph/item
Title/Author:
An Experimental Study of Supervised Machine Learning Techniques for Minor Class Prediction Utilizing Kernel Density Estimation: Factors Impacting Model Performance./
Author:
Alfarwan, Abdullah Mana.
Published:
Ann Arbor : ProQuest Dissertations & Theses, : 2024,
Description:
446 p.
Notes:
Source: Dissertations Abstracts International, Volume: 86-01, Section: B.
Contained By:
Dissertations Abstracts International86-01B.
Subject:
Educational tests & measurements. -
Online resource:
https://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=31301194
ISBN:
9798383205914
An Experimental Study of Supervised Machine Learning Techniques for Minor Class Prediction Utilizing Kernel Density Estimation: Factors Impacting Model Performance.
Alfarwan, Abdullah Mana.
An Experimental Study of Supervised Machine Learning Techniques for Minor Class Prediction Utilizing Kernel Density Estimation: Factors Impacting Model Performance.
- Ann Arbor : ProQuest Dissertations & Theses, 2024 - 446 p.
Source: Dissertations Abstracts International, Volume: 86-01, Section: B.
Thesis (Ph.D.)--Western Michigan University, 2024.
This dissertation examined classification outcome differences among four popular individual supervised machine learning (ISML) models (logistic regression, decision tree, support vector machine, and multilayer perceptron) when predicting minor class membership within imbalanced datasets. The study context and the theoretical population sampled focus on one aspect of the larger problem of student retention and dropout prediction in higher education (HE): identification.This study differs from current literature by implementing an experimental design approach with simulated student data that closely mirrors HE situational and student data. Specifically, this study tested the predictive ability of the four ISML classification models (CLS) under experimentally manipulated conditions. These included total sample size (TS), minor class proportion (MCP), training-to-testing sample size ratios (TTSS), and the application of bagging techniques during model training (BAG). Using this 4-between, 1-within mixed design, five different outcome measures (precision, recall/sensitivity, specificity, F1-score and AUC) were examined and analyzed individually.For each outcome measure, findings revealed multiple statistically significant interactions among classifier models and design variables. Simple effect analyses of these interactions highlighted how TS, MCP, TTSS, and BAG differentially affect different measures of classification performance such as precision, recall/sensitivity, specificity, F1-score, and AUC. For instance, the presence of interactions involving MCP underscores the importance of informed modeling of class distribution for enhancing overall model predictive capability and performance.Such insights regarding how the experimental variables can critically affect different measures of classification success advances our understanding of how these four ISML models might be optimized for the prediction of student-at-risk status within imbalanced datasets. This dissertation provides a framework for using these or similar ISML models more effectively in HE. It points toward the development of predictive modeling methods that are more useful and perhaps equitable by demonstrating empirically the impact of one of the most challenging aspects of implementing machine learning in HE: maximizing the accurate identification of the minority class. This work contributes to the use of machine learning in HE and will help inform its use in smaller and larger educational research communities by providing strategies for improving the prediction of student dropout.
ISBN: 9798383205914Subjects--Topical Terms:
3168483
Educational tests & measurements.
Subjects--Index Terms:
Individual supervised machine learning
An Experimental Study of Supervised Machine Learning Techniques for Minor Class Prediction Utilizing Kernel Density Estimation: Factors Impacting Model Performance.
LDR
:03871nmm a2200397 4500
001
2402259
005
20241028051510.5
006
m o d
007
cr#unu||||||||
008
251215s2024 ||||||||||||||||| ||eng d
020
$a
9798383205914
035
$a
(MiAaPQ)AAI31301194
035
$a
AAI31301194
035
$a
2402259
040
$a
MiAaPQ
$c
MiAaPQ
100
1
$a
Alfarwan, Abdullah Mana.
$3
3772480
245
1 3
$a
An Experimental Study of Supervised Machine Learning Techniques for Minor Class Prediction Utilizing Kernel Density Estimation: Factors Impacting Model Performance.
260
1
$a
Ann Arbor :
$b
ProQuest Dissertations & Theses,
$c
2024
300
$a
446 p.
500
$a
Source: Dissertations Abstracts International, Volume: 86-01, Section: B.
500
$a
Advisor: Applegate, Brooks.
502
$a
Thesis (Ph.D.)--Western Michigan University, 2024.
520
$a
This dissertation examined classification outcome differences among four popular individual supervised machine learning (ISML) models (logistic regression, decision tree, support vector machine, and multilayer perceptron) when predicting minor class membership within imbalanced datasets. The study context and the theoretical population sampled focus on one aspect of the larger problem of student retention and dropout prediction in higher education (HE): identification.This study differs from current literature by implementing an experimental design approach with simulated student data that closely mirrors HE situational and student data. Specifically, this study tested the predictive ability of the four ISML classification models (CLS) under experimentally manipulated conditions. These included total sample size (TS), minor class proportion (MCP), training-to-testing sample size ratios (TTSS), and the application of bagging techniques during model training (BAG). Using this 4-between, 1-within mixed design, five different outcome measures (precision, recall/sensitivity, specificity, F1-score and AUC) were examined and analyzed individually.For each outcome measure, findings revealed multiple statistically significant interactions among classifier models and design variables. Simple effect analyses of these interactions highlighted how TS, MCP, TTSS, and BAG differentially affect different measures of classification performance such as precision, recall/sensitivity, specificity, F1-score, and AUC. For instance, the presence of interactions involving MCP underscores the importance of informed modeling of class distribution for enhancing overall model predictive capability and performance.Such insights regarding how the experimental variables can critically affect different measures of classification success advances our understanding of how these four ISML models might be optimized for the prediction of student-at-risk status within imbalanced datasets. This dissertation provides a framework for using these or similar ISML models more effectively in HE. It points toward the development of predictive modeling methods that are more useful and perhaps equitable by demonstrating empirically the impact of one of the most challenging aspects of implementing machine learning in HE: maximizing the accurate identification of the minority class. This work contributes to the use of machine learning in HE and will help inform its use in smaller and larger educational research communities by providing strategies for improving the prediction of student dropout.
590
$a
School code: 0257.
650
4
$a
Educational tests & measurements.
$3
3168483
650
4
$a
Statistics.
$3
517247
650
4
$a
Higher education.
$3
641065
650
4
$a
Educational evaluation.
$3
526425
653
$a
Individual supervised machine learning
653
$a
Classification models
653
$a
Logistic regression
653
$a
Dropout
690
$a
0288
690
$a
0463
690
$a
0443
690
$a
0745
710
2
$a
Western Michigan University.
$b
Educational Leadership, Research, & Technology.
$3
3701509
773
0
$t
Dissertations Abstracts International
$g
86-01B.
790
$a
0257
791
$a
Ph.D.
792
$a
2024
793
$a
English
856
4 0
$u
https://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=31301194
based on 0 review(s)
Location:
ALL
電子資源
Year:
Volume Number:
Items
1 records • Pages 1 •
1
Inventory Number
Location Name
Item Class
Material type
Call number
Usage Class
Loan Status
No. of reservations
Opac note
Attachments
W9510579
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
On shelf
0
1 records • Pages 1 •
1
Multimedia
Reviews
Add a review
and share your thoughts with other readers
Export
pickup library
Processing
...
Change password
Login