Language:
English
繁體中文
Help
回圖書館首頁
手機版館藏查詢
Login
Back
Switch To:
Labeled
|
MARC Mode
|
ISBD
Text Classification: Exploiting the ...
~
Alkhereyf, Sakhar.
Linked to FindBook
Google Book
Amazon
博客來
Text Classification: Exploiting the Social Network.
Record Type:
Electronic resources : Monograph/item
Title/Author:
Text Classification: Exploiting the Social Network./
Author:
Alkhereyf, Sakhar.
Published:
Ann Arbor : ProQuest Dissertations & Theses, : 2021,
Description:
228 p.
Notes:
Source: Dissertations Abstracts International, Volume: 82-06, Section: B.
Contained By:
Dissertations Abstracts International82-06B.
Subject:
Computer science. -
Online resource:
https://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=28258554
ISBN:
9798557004442
Text Classification: Exploiting the Social Network.
Alkhereyf, Sakhar.
Text Classification: Exploiting the Social Network.
- Ann Arbor : ProQuest Dissertations & Theses, 2021 - 228 p.
Source: Dissertations Abstracts International, Volume: 82-06, Section: B.
Thesis (Ph.D.)--Columbia University, 2021.
This item must not be sold to any third party vendors.
Within the context of social networks, existing methods for document classification tasks typically only capture textual semantics while ignoring the text's metadata, e.g., the users who exchange emails and the communication networks they form. However, some work has shown that incorporating the social network information in addition to information from language is useful for various NLP applications, including sentiment analysis, inferring user attributes, and predicting interpersonal relations.In this thesis, we present empirical studies of incorporating social network information from the underlying communication graphs for various text classification tasks. We show different graph representations for different problems. Also, we introduce social network features extracted from these graphs. We use and extend graph embedding models for text classification.Our contributions are as follows. First, we have annotated large datasets of emails with fine-grained business and personal labels. Second, we propose graph representations for the social networks induced from documents and users and apply them on different text classification tasks. Third, we propose social network features extracted from these structures for documents and users. Fourth, we exploit different methods for modeling the social network of communication for four tasks: email classification into business and personal, overt display of power detection in emails, hierarchical power detection in emails, and Reddit post classification.Our main findings are: incorporating the social network information using our proposed methods improves the classification performance for all of the four tasks, and we beat the state-of-the-art graph embedding based model on the three tasks on email; additionally, for the fourth task (Reddit post classification), we argue that simple methods with the proper representation for the task can outperform a state-of-the-art generic model.
ISBN: 9798557004442Subjects--Topical Terms:
523869
Computer science.
Subjects--Index Terms:
Natural language processing
Text Classification: Exploiting the Social Network.
LDR
:03261nmm a2200445 4500
001
2275878
005
20210401103759.5
008
220723s2021 ||||||||||||||||| ||eng d
020
$a
9798557004442
035
$a
(MiAaPQ)AAI28258554
035
$a
AAI28258554
040
$a
MiAaPQ
$c
MiAaPQ
100
1
$a
Alkhereyf, Sakhar.
$3
3554124
245
1 0
$a
Text Classification: Exploiting the Social Network.
260
1
$a
Ann Arbor :
$b
ProQuest Dissertations & Theses,
$c
2021
300
$a
228 p.
500
$a
Source: Dissertations Abstracts International, Volume: 82-06, Section: B.
500
$a
Advisor: Rambow, Owen C.
502
$a
Thesis (Ph.D.)--Columbia University, 2021.
506
$a
This item must not be sold to any third party vendors.
520
$a
Within the context of social networks, existing methods for document classification tasks typically only capture textual semantics while ignoring the text's metadata, e.g., the users who exchange emails and the communication networks they form. However, some work has shown that incorporating the social network information in addition to information from language is useful for various NLP applications, including sentiment analysis, inferring user attributes, and predicting interpersonal relations.In this thesis, we present empirical studies of incorporating social network information from the underlying communication graphs for various text classification tasks. We show different graph representations for different problems. Also, we introduce social network features extracted from these graphs. We use and extend graph embedding models for text classification.Our contributions are as follows. First, we have annotated large datasets of emails with fine-grained business and personal labels. Second, we propose graph representations for the social networks induced from documents and users and apply them on different text classification tasks. Third, we propose social network features extracted from these structures for documents and users. Fourth, we exploit different methods for modeling the social network of communication for four tasks: email classification into business and personal, overt display of power detection in emails, hierarchical power detection in emails, and Reddit post classification.Our main findings are: incorporating the social network information using our proposed methods improves the classification performance for all of the four tasks, and we beat the state-of-the-art graph embedding based model on the three tasks on email; additionally, for the fourth task (Reddit post classification), we argue that simple methods with the proper representation for the task can outperform a state-of-the-art generic model.
590
$a
School code: 0054.
650
4
$a
Computer science.
$3
523869
650
4
$a
Mass communications.
$3
3422380
650
4
$a
Web studies.
$3
2122754
650
4
$a
Technical communication.
$3
3172863
650
4
$a
Information technology.
$3
532993
650
4
$a
Social research.
$3
2122687
653
$a
Natural language processing
653
$a
Social network
653
$a
Text classification
653
$a
Semantics
653
$a
Communication networks
653
$a
Email exchange
653
$a
Interpersonal relations
653
$a
Reddit
690
$a
0984
690
$a
0643
690
$a
0489
690
$a
0344
690
$a
0646
690
$a
0708
710
2
$a
Columbia University.
$b
Computer Science.
$3
1679268
773
0
$t
Dissertations Abstracts International
$g
82-06B.
790
$a
0054
791
$a
Ph.D.
792
$a
2021
793
$a
English
856
4 0
$u
https://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=28258554
based on 0 review(s)
Location:
ALL
電子資源
Year:
Volume Number:
Items
1 records • Pages 1 •
1
Inventory Number
Location Name
Item Class
Material type
Call number
Usage Class
Loan Status
No. of reservations
Opac note
Attachments
W9427612
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
On shelf
0
1 records • Pages 1 •
1
Multimedia
Reviews
Add a review
and share your thoughts with other readers
Export
pickup library
Processing
...
Change password
Login