語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
Privacy and Efficiency in Personaliz...
~
Carranza, Aldo Gael,
FindBook
Google Book
Amazon
博客來
Privacy and Efficiency in Personalized Decision-Making and Recommendation /
紀錄類型:
書目-電子資源 : Monograph/item
正題名/作者:
Privacy and Efficiency in Personalized Decision-Making and Recommendation // Aldo Gael Carranza.
作者:
Carranza, Aldo Gael,
面頁冊數:
1 electronic resource (218 pages)
附註:
Source: Dissertations Abstracts International, Volume: 85-04, Section: B.
Contained By:
Dissertations Abstracts International85-04B.
標題:
Decision making. -
電子資源:
https://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=30615162
ISBN:
9798380469920
Privacy and Efficiency in Personalized Decision-Making and Recommendation /
Carranza, Aldo Gael,
Privacy and Efficiency in Personalized Decision-Making and Recommendation /
Aldo Gael Carranza. - 1 electronic resource (218 pages)
Source: Dissertations Abstracts International, Volume: 85-04, Section: B.
In the current digital era, marked by the ubiquity of individual-level data and sophisticated artificial intelligence systems highly capable of exploiting data heterogeneity, data-driven personalized decision-making and recommendation systems have become prevalent in providing customized services and experiences to individuals. These bespoke systems, however, continue to present considerable challenges regarding their deployment in adaptive, heterogeneous, and privacy-sensitive settings. This dissertation presents three research projects that delve into some of these critical issues, offering insights and novel solutions aimed at enhancing the privacy and efficiency of personalized decision-making and recommendation systems.Chapter 1 of this dissertation investigates the challenges of model learning in contextual bandits for adaptive decision-making and presents a method to improve data efficiency and robustness to model misspecification in this online setting. Contextual bandit algorithms often estimate reward models to inform decision-making. However, true rewards can contain action-independent redundancies that are not relevant for decision-making. We show it is more data-efficient to estimate any function that explains the reward differences between actions, that is, the treatment effects. Motivated by this observation, building on recent work on oracle-based bandit algorithms, we provide a universal reduction of contextual bandits to general-purpose heterogeneous treatment effect estimation, and we design a simple and computationally efficient algorithm based on this reduction. Our theoretical and experimental results demonstrate that heterogeneous treatment effect estimation in contextual bandits offers practical advantages over reward estimation, including more efficient model estimation and greater robustness to model misspecification.In Chapter 2, we consider heterogeneous data adaptation and privacy in decision-making informed by historical observational data. We consider the problem of learning personalized decision policies on observational bandit feedback data from heterogeneous data sources. Moreover, we examine this problem in the federated setting where a central server aims to learn a policy on the data distributed across the heterogeneous sources without exchanging their raw data due to privacy concerns. We present a federated policy learning algorithm based on aggregation of local policies trained with doubly robust offline policy evaluation and learning strategies. We provide a novel regret analysis for our approach that establishes a finite-sample upper bound on a notion of global regret across a distribution of clients. In addition, for any individual client, we establish a corresponding local regret upper bound characterized by the presence of distribution shift relative to all other clients. We support our theoretical findings with experimental results. Our analysis and experiments provide insights into the value of heterogeneous client participation in federation for policy learning in heterogeneous settings.Lastly, in Chapter 3, we pivot slightly from the focus of the first two chapters on decision-making systems with online and offline policy learning methods to investigating data privacy in recommender systems. We propose a novel approach for developing privacy-preserving large-scale recommender systems using differentially private (DP) large language models (LLMs) which overcomes certain challenges and limitations in DP training these complex systems. This method is particularly well suited for the emerging area of LLM-based recommender systems, but can be readily employed for any recommender systems that process representations of natural language inputs. Our approach involves using DP training methods to fine-tune a publicly pre-trained LLM on a query generation task.
English
ISBN: 9798380469920Subjects--Topical Terms:
517204
Decision making.
Privacy and Efficiency in Personalized Decision-Making and Recommendation /
LDR
:05166nmm a22003733i 4500
001
2400455
005
20250522084129.5
006
m o d
007
cr|nu||||||||
008
251215s2023 miu||||||m |||||||eng d
020
$a
9798380469920
035
$a
(MiAaPQD)AAI30615162
035
$a
(MiAaPQD)STANFORDky391xq2715
035
$a
AAI30615162
040
$a
MiAaPQD
$b
eng
$c
MiAaPQD
$e
rda
100
1
$a
Carranza, Aldo Gael,
$e
author.
$3
3770454
245
1 0
$a
Privacy and Efficiency in Personalized Decision-Making and Recommendation /
$c
Aldo Gael Carranza.
264
1
$a
Ann Arbor :
$b
ProQuest Dissertations & Theses,
$c
2023
300
$a
1 electronic resource (218 pages)
336
$a
text
$b
txt
$2
rdacontent
337
$a
computer
$b
c
$2
rdamedia
338
$a
online resource
$b
cr
$2
rdacarrier
500
$a
Source: Dissertations Abstracts International, Volume: 85-04, Section: B.
500
$a
Advisors: Athey, Susan Committee members: Wager, Stefan; Weintraub, Gabriel; Bent, Stacey F.
502
$b
Ph.D.
$c
Stanford University
$d
2023.
520
$a
In the current digital era, marked by the ubiquity of individual-level data and sophisticated artificial intelligence systems highly capable of exploiting data heterogeneity, data-driven personalized decision-making and recommendation systems have become prevalent in providing customized services and experiences to individuals. These bespoke systems, however, continue to present considerable challenges regarding their deployment in adaptive, heterogeneous, and privacy-sensitive settings. This dissertation presents three research projects that delve into some of these critical issues, offering insights and novel solutions aimed at enhancing the privacy and efficiency of personalized decision-making and recommendation systems.Chapter 1 of this dissertation investigates the challenges of model learning in contextual bandits for adaptive decision-making and presents a method to improve data efficiency and robustness to model misspecification in this online setting. Contextual bandit algorithms often estimate reward models to inform decision-making. However, true rewards can contain action-independent redundancies that are not relevant for decision-making. We show it is more data-efficient to estimate any function that explains the reward differences between actions, that is, the treatment effects. Motivated by this observation, building on recent work on oracle-based bandit algorithms, we provide a universal reduction of contextual bandits to general-purpose heterogeneous treatment effect estimation, and we design a simple and computationally efficient algorithm based on this reduction. Our theoretical and experimental results demonstrate that heterogeneous treatment effect estimation in contextual bandits offers practical advantages over reward estimation, including more efficient model estimation and greater robustness to model misspecification.In Chapter 2, we consider heterogeneous data adaptation and privacy in decision-making informed by historical observational data. We consider the problem of learning personalized decision policies on observational bandit feedback data from heterogeneous data sources. Moreover, we examine this problem in the federated setting where a central server aims to learn a policy on the data distributed across the heterogeneous sources without exchanging their raw data due to privacy concerns. We present a federated policy learning algorithm based on aggregation of local policies trained with doubly robust offline policy evaluation and learning strategies. We provide a novel regret analysis for our approach that establishes a finite-sample upper bound on a notion of global regret across a distribution of clients. In addition, for any individual client, we establish a corresponding local regret upper bound characterized by the presence of distribution shift relative to all other clients. We support our theoretical findings with experimental results. Our analysis and experiments provide insights into the value of heterogeneous client participation in federation for policy learning in heterogeneous settings.Lastly, in Chapter 3, we pivot slightly from the focus of the first two chapters on decision-making systems with online and offline policy learning methods to investigating data privacy in recommender systems. We propose a novel approach for developing privacy-preserving large-scale recommender systems using differentially private (DP) large language models (LLMs) which overcomes certain challenges and limitations in DP training these complex systems. This method is particularly well suited for the emerging area of LLM-based recommender systems, but can be readily employed for any recommender systems that process representations of natural language inputs. Our approach involves using DP training methods to fine-tune a publicly pre-trained LLM on a query generation task.
546
$a
English
590
$a
School code: 0212
650
4
$a
Decision making.
$3
517204
650
4
$a
Computer science.
$3
523869
650
4
$a
Recommender systems.
$3
3562220
690
$a
0984
690
$a
0800
710
2
$a
Stanford University.
$e
degree granting institution.
$3
3765820
720
1
$a
Athey, Susan
$e
degree supervisor.
773
0
$t
Dissertations Abstracts International
$g
85-04B.
790
$a
0212
791
$a
Ph.D.
792
$a
2023
856
4 0
$u
https://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=30615162
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9508775
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入