語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Compiling Deep Learning Kernels to L...
~
Zhao, Tian.
FindBook
Google Book
Amazon
博客來
Compiling Deep Learning Kernels to Locality-Aware Dataflow.
紀錄類型:
書目-電子資源 : Monograph/item
正題名/作者:
Compiling Deep Learning Kernels to Locality-Aware Dataflow./
作者:
Zhao, Tian.
出版者:
Ann Arbor : ProQuest Dissertations & Theses, : 2023,
面頁冊數:
110 p.
附註:
Source: Dissertations Abstracts International, Volume: 84-12, Section: B.
Contained By:
Dissertations Abstracts International84-12B.
標題:
Programming languages. -
電子資源:
https://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=30462707
ISBN:
9798379651602
Compiling Deep Learning Kernels to Locality-Aware Dataflow.
Zhao, Tian.
Compiling Deep Learning Kernels to Locality-Aware Dataflow.
- Ann Arbor : ProQuest Dissertations & Theses, 2023 - 110 p.
Source: Dissertations Abstracts International, Volume: 84-12, Section: B.
Thesis (Ph.D.)--Stanford University, 2023.
Emerging deep learning applications require unprecedented computation and memory capacity. To accelerate these applications, novel processing systems such as dataflow accelerators strive to exploit multiple dimensions of parallelism within deep learning models, e.g., tensor and pipeline parallelism. Although these systems provide ultrahigh performance when fully utilized, compiling deep learning applications to harness their computation capability remains a challenging problem. With recent advances in domain-specific programming language, accelerator design, and machine learning, we now have the potential to better serve the needs of training and evaluating large deep learning applications on dataflow accelerators through algorithm, software, and hardware co-design.In this dissertation, I present the design and development of efficient deep learning optimizations and programming frameworks. I present two frameworks: SpatialRNN for accelerating recurrent neural network language models on spatial accelerators and Sigma for expressing and accelerating high-data-reuse deep learning kernels using reconfigurable dataflow accelerators. Our end-to-end evaluation using Sigma demonstrates a 5.4x speedup on kernels encompassing financial applications, traditional machine learning, language modeling and computer vision tasks over an Nvidia V100 GPU accelerator.
ISBN: 9798379651602Subjects--Topical Terms:
3683658
Programming languages.
Compiling Deep Learning Kernels to Locality-Aware Dataflow.
LDR
:02458nmm a2200361 4500
001
2398288
005
20240812064602.5
006
m o d
007
cr#unu||||||||
008
251215s2023 ||||||||||||||||| ||eng d
020
$a
9798379651602
035
$a
(MiAaPQ)AAI30462707
035
$a
(MiAaPQ)STANFORDhd752ps4385
035
$a
AAI30462707
040
$a
MiAaPQ
$c
MiAaPQ
100
1
$a
Zhao, Tian.
$3
1057526
245
1 0
$a
Compiling Deep Learning Kernels to Locality-Aware Dataflow.
260
1
$a
Ann Arbor :
$b
ProQuest Dissertations & Theses,
$c
2023
300
$a
110 p.
500
$a
Source: Dissertations Abstracts International, Volume: 84-12, Section: B.
500
$a
Advisor: Raina, Priyanka;Re, Christopher;Olukotun, Oyekunle.
502
$a
Thesis (Ph.D.)--Stanford University, 2023.
520
$a
Emerging deep learning applications require unprecedented computation and memory capacity. To accelerate these applications, novel processing systems such as dataflow accelerators strive to exploit multiple dimensions of parallelism within deep learning models, e.g., tensor and pipeline parallelism. Although these systems provide ultrahigh performance when fully utilized, compiling deep learning applications to harness their computation capability remains a challenging problem. With recent advances in domain-specific programming language, accelerator design, and machine learning, we now have the potential to better serve the needs of training and evaluating large deep learning applications on dataflow accelerators through algorithm, software, and hardware co-design.In this dissertation, I present the design and development of efficient deep learning optimizations and programming frameworks. I present two frameworks: SpatialRNN for accelerating recurrent neural network language models on spatial accelerators and Sigma for expressing and accelerating high-data-reuse deep learning kernels using reconfigurable dataflow accelerators. Our end-to-end evaluation using Sigma demonstrates a 5.4x speedup on kernels encompassing financial applications, traditional machine learning, language modeling and computer vision tasks over an Nvidia V100 GPU accelerator.
590
$a
School code: 0212.
650
4
$a
Programming languages.
$3
3683658
650
4
$a
Deep learning.
$3
3554982
650
4
$a
Bandwidths.
$3
3560998
650
4
$a
Optimization techniques.
$3
3681622
650
4
$a
Neural networks.
$3
677449
650
4
$a
Design.
$3
518875
650
4
$a
Keyboards.
$3
3681868
650
4
$a
Linear algebra.
$3
2923381
650
4
$a
Computer science.
$3
523869
650
4
$a
Mathematics.
$3
515831
690
$a
0389
690
$a
0729
690
$a
0800
690
$a
0984
690
$a
0405
710
2
$a
Stanford University.
$3
754827
773
0
$t
Dissertations Abstracts International
$g
84-12B.
790
$a
0212
791
$a
Ph.D.
792
$a
2023
793
$a
English
856
4 0
$u
https://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=30462707
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9506608
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入