語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
FindBook
Google Book
Amazon
博客來
Parallel Computing Framework and GPU Performance Modeling.
紀錄類型:
書目-電子資源 : Monograph/item
正題名/作者:
Parallel Computing Framework and GPU Performance Modeling./
作者:
Xu, Wenjing.
面頁冊數:
1 online resource (124 pages)
附註:
Source: Dissertations Abstracts International, Volume: 84-01, Section: B.
Contained By:
Dissertations Abstracts International84-01B.
標題:
Computer science. -
電子資源:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=29258433click for full text (PQDT)
ISBN:
9798835504060
Parallel Computing Framework and GPU Performance Modeling.
Xu, Wenjing.
Parallel Computing Framework and GPU Performance Modeling.
- 1 online resource (124 pages)
Source: Dissertations Abstracts International, Volume: 84-01, Section: B.
Thesis (Ph.D.)--Louisiana Tech University, 2022.
Includes bibliographical references
During the past decades, High-Performance Computing (HPC) has been widely used in various industries. In particular, the exponential growth of GPU (graphics processing unit) is a key technology that has helped promoting the development of artificial intelligence in real-world use cases. When we use GPU to accelerate parallel applications, its programmability, resource management, and scheduling are non-trivial jobs to obtain optimized performance. Therefore, how to effectively exploit GPU resources and improve program performance has been a hot research topic recently.Benchmark does not always provide a good picture of the performance and details of the parallel applications. The various kinds of hardware devices and the constantly updated parallel programs make the performance analysis and modeling even more difficult.In this dissertation, there are four main contributions. First, we conduct a study on the GPU analytical performance model, which aims to estimate the suitable number of threads per block for performance improvement.Second, a novel method to elevate the limitation of GPU is proposed. This method offers a new way for optimization on GPU performance at the block schedule level.Third, we propose two parallel computing abstract models, namely, the computational and programming models that represent various computing paradigms based on Flynn's taxonomy and simplify the workload distribution characteristics. This framework provides a general way to create an analytical performance model.Finally, we validate our proposed abstract models and demonstrate their usefulness with real-world applications in AI (Artificial Intelligence) on a distributed GPU system. The analytical performance model for CNN (Convolutional Neural Network) application analyzes performance characteristics on multiple GPUs, enabling users to evaluate their techniques before running applications on targeted machines.
Electronic reproduction.
Ann Arbor, Mich. :
ProQuest,
2023
Mode of access: World Wide Web
ISBN: 9798835504060Subjects--Topical Terms:
523869
Computer science.
Subjects--Index Terms:
Distributed systemIndex Terms--Genre/Form:
542853
Electronic books.
Parallel Computing Framework and GPU Performance Modeling.
LDR
:03310nmm a2200409K 4500
001
2360017
005
20230925052759.5
006
m o d
007
cr mn ---uuuuu
008
241011s2022 xx obm 000 0 eng d
020
$a
9798835504060
035
$a
(MiAaPQ)AAI29258433
035
$a
AAI29258433
040
$a
MiAaPQ
$b
eng
$c
MiAaPQ
$d
NTU
100
1
$a
Xu, Wenjing.
$3
3193066
245
1 0
$a
Parallel Computing Framework and GPU Performance Modeling.
264
0
$c
2022
300
$a
1 online resource (124 pages)
336
$a
text
$b
txt
$2
rdacontent
337
$a
computer
$b
c
$2
rdamedia
338
$a
online resource
$b
cr
$2
rdacarrier
500
$a
Source: Dissertations Abstracts International, Volume: 84-01, Section: B.
500
$a
Advisor: Leangsuksun, Box.
502
$a
Thesis (Ph.D.)--Louisiana Tech University, 2022.
504
$a
Includes bibliographical references
520
$a
During the past decades, High-Performance Computing (HPC) has been widely used in various industries. In particular, the exponential growth of GPU (graphics processing unit) is a key technology that has helped promoting the development of artificial intelligence in real-world use cases. When we use GPU to accelerate parallel applications, its programmability, resource management, and scheduling are non-trivial jobs to obtain optimized performance. Therefore, how to effectively exploit GPU resources and improve program performance has been a hot research topic recently.Benchmark does not always provide a good picture of the performance and details of the parallel applications. The various kinds of hardware devices and the constantly updated parallel programs make the performance analysis and modeling even more difficult.In this dissertation, there are four main contributions. First, we conduct a study on the GPU analytical performance model, which aims to estimate the suitable number of threads per block for performance improvement.Second, a novel method to elevate the limitation of GPU is proposed. This method offers a new way for optimization on GPU performance at the block schedule level.Third, we propose two parallel computing abstract models, namely, the computational and programming models that represent various computing paradigms based on Flynn's taxonomy and simplify the workload distribution characteristics. This framework provides a general way to create an analytical performance model.Finally, we validate our proposed abstract models and demonstrate their usefulness with real-world applications in AI (Artificial Intelligence) on a distributed GPU system. The analytical performance model for CNN (Convolutional Neural Network) application analyzes performance characteristics on multiple GPUs, enabling users to evaluate their techniques before running applications on targeted machines.
533
$a
Electronic reproduction.
$b
Ann Arbor, Mich. :
$c
ProQuest,
$d
2023
538
$a
Mode of access: World Wide Web
650
4
$a
Computer science.
$3
523869
650
4
$a
Computer engineering.
$3
621879
650
4
$a
Artificial intelligence.
$3
516317
653
$a
Distributed system
653
$a
GPU
653
$a
HPC
653
$a
Parallel computing
653
$a
Performance modeling
653
$a
Performance optimization
655
7
$a
Electronic books.
$2
lcsh
$3
542853
690
$a
0984
690
$a
0464
690
$a
0800
710
2
$a
ProQuest Information and Learning Co.
$3
783688
710
2
$a
Louisiana Tech University.
$b
Computational Analysis and Modeling.
$3
3700630
773
0
$t
Dissertations Abstracts International
$g
84-01B.
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=29258433
$z
click for full text (PQDT)
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9482373
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入