Annika M. Schoene
Deep learning with knowledge graphs for fine-grained emotion classification in text
Schoene, Annika M.
Authors
Contributors
Dr Nina Dethlefs N.Dethlefs@hull.ac.uk
Supervisor
Alexander P. (Alexander Phillip) Turner
Supervisor
Abstract
This PhD thesis investigates two key challenges in the area of fine-grained emotion detection in textual data. More specifically, this work focuses on (i) the accurate classification of emotion in tweets and (ii) improving the learning of representations from knowledge graphs using graph convolutional neural networks.
The first part of this work outlines the task of emotion keyword detection in tweets and introduces a new resource called the EEK dataset. Tweets have previously been categorised as short sequences or sentence-level sentiment analysis, and it could be argued that this should no longer be the case, especially since Twitter increased its allowed character limit. Recurrent Neural Networks have become a well-established method to classify tweets over recent years, but have struggled with accurately classifying longer sequences due to the vanishing and exploding gradient descent problem. A common technique to overcome this problem has been to prune tweets to a shorter sequence length. However, this also meant that often potentially important emotion carrying information, which is often found towards the end of a tweet, was lost (e.g., emojis and hashtags). As such, tweets mostly face also problems with classifying long sequences, similar to other natural language processing tasks. To overcome these challenges, a multi-scale hierarchical recurrent neural network is proposed and benchmarked against other existing methods. The proposed learning model outperforms existing methods on the same task by up to 10.52%. Another key component for the accurate classification of tweets has been the use of language models, where more recent techniques such as BERT and ELMO have achieved great success in a range of different tasks. However, in Sentiment Analysis, a key challenge has always been to use language models that do not only take advantage of the context a word is used in but also the sentiment it carries. Therefore the second part of this work looks at improving representation learning for emotion classification by introducing both linguistic and emotion knowledge to language models. A new linguistically inspired knowledge graph called RELATE is introduced. Then a new language model is trained on a Graph Convolutional Neural Network and compared against several other existing language models, where it is found that the proposed embedding representations achieve competitive results to other LMs, whilst requiring less pre-training time and data. Finally, it is investigated how the proposed methods can be applied to document-level classification tasks. More specifically, this work focuses on the accurate classification of suicide notes and analyses whether sentiment and linguistic features are important for accurate classification.
Citation
Schoene, A. M. (2021). Deep learning with knowledge graphs for fine-grained emotion classification in text. (Thesis). University of Hull. Retrieved from https://hull-repository.worktribe.com/output/4223160
Thesis Type | Thesis |
---|---|
Deposit Date | May 11, 2021 |
Publicly Available Date | Feb 23, 2023 |
Keywords | Computer science |
Public URL | https://hull-repository.worktribe.com/output/4223160 |
Additional Information | Department of Computer Science and Technology, The University of Hull |
Award Date | Apr 23, 2021 |
Files
Thesis
(8.5 Mb)
PDF
Copyright Statement
© 2021 Schoene, Annika M. All rights reserved. No part of this publication may be reproduced without the written permission of the copyright holder.
You might also like
User Engagement Triggers in Social Media Discourse on Biodiversity Conservation
(2024)
Journal Article
Real-time social media sentiment analysis for rapid impact assessment of floods
(2023)
Journal Article
Downloadable Citations
About Repository@Hull
Administrator e-mail: repository@hull.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search