제주대학교 Repository

A Hybrid Approach for Topic Discovery and Recommendations based on Topic Modeling and Deep Learning

Metadata Downloads
Abstract
The elevated growth rate of internet users and boom in the applications development, mainly e-business, has familiarized users to write their comments and reviews about the product they received. These reviews help customers in decision making. In this thesis, we aim to propose a recommender system that is primarily based on user comments or reviews. We aim to utilize the user-generated data to perform topic discovery and recommendations based on the topics discovered. We take benefit of the language modeling practices and attempt to join them with neural networks (NN) to identify some latent discussion patterns in the feedbacks of users. Our goal is to design a probabilistic language modeling based neural network architecture, where RNN's special type Long-Short Term Memory (LSTM) model is used to predict discussion trends or topics. Probabilistic topic models utilize word co-occurrences across documents to identify topically related words. Due to their complementary nature, these models define different notions of word similarity, which, when combined, can produce better topical representations. We have trained our Latent Dirichlet Allocation (LDA) model with word embeddings to improve the topic quality. The hybrid of LDA-LSTM takes multiple inputs including words and topic embeddings, the temporal details, and probability distributions from LDA. To capture the long contextual dependencies and limited vocabulary challenge, LSTM model takes both direct word embeddings and temporal topic distributions. Latent topic distributions are used to feed LSTM layers that are learnt based on the LDA model which is pre trained. This proposed LDA-LSTM model, unlike previous studies, is capable of capturing both long range contextual and temporal dependencies. To enhance the recommendations results, we us objective function that takes into account the identified topic categories and user preferences. We have formulated an equation model to access the credibility or authenticity of a recommendation. The proposed hybrid approach is experimented on crowdfunding campaigns and is used for reliable project recommendations. We collected data for scam and non-scam crowdfunding projects and applied our proposed approach to suggest secure and optimized recommendations to investors by using their comments. For the proposed approach following metrics are considered for the performance analysis and evaluation of the proposed approach: i) prediction accuracy, ii) an optimal number of identified topics, and iii) the number of epochs. We compared our results with NN and NN-LDA based on these performance metrics. The strengths of both integrated models offer that the proposed model can play a significant part in an improved and effective understanding of user generated data such as crowdfunding comments.
Author(s)
Wafa Shafqat
Issued Date
2020
Awarded Date
2020. 2
Type
Dissertation
URI
http://dcoll.jejunu.ac.kr/common/orgView/000000009434
Affiliation
제주대학교 대학원
Department
대학원 컴퓨터공학과
Advisor
Byun, Yung Cheol
Table Of Contents
Acknowledgement iii
List of Figures iv
List of Tables vi
Abstract 1
Chapter 1: Introduction 3
Chapter 2: Related Work 9
2.1 Background Study . 9
2.1.1. Topic Modeling 9
2.1.2. Recommendations System 23
2.1.3 Deep Learning and Topic Modeling 27
2.2 Limitations of Existing Solutions . 29
Chapter 3: Proposed Model and Architecture 31
3.1 Conceptual Design 32
3.2 Topic Modeling based on LDA 36
3.2.1 Data Preprocessing 36
3.2.2 LDA Parameters and Configurations 37
3.3 Deep Learning Methods based on RNN-LSTM . 40
3.4 Credibility Estimation . 41
3.5 Overall Structure of the Proposed System 53
3.5.1 Input Data 55
3.5.2 Data Pre-processing 55
3.5.3 LDA and LSTM based Hybrid Model 56
3.5.4 Recommendations Module 56
3.6 Structural Details of the Hybrid Model . 57
Chapter 4: Crowdfunding Project Recommendations: An Example Application 59
4.1 Experimental Data . 64
4.2 Model (LSTM-LDA) Training 65
4.3 Project Integration . 73
4.4 Objective Function Formalization for the Optimal Project Recommendations 73
4.4.1 Optimization 78
4.5 Experimental Setup . 79
4.5.1 Training 80
4.5.2 Testing 81
4.6 Example Scenario 82
Chapter 5: Experiments and Performance Analysis 87
5.1 Optimized Recommendations and Prediction Accuracies 87
5.2 Prediction Accuracy of Topic Classes 88
5.3 Prediction Accuracy of Topic Classes for Variable Number of Topics 88
5.4 Discussion Trends in Suspicious Campaigns 89
5.5 Analysis of Recommendation Results 91
Chapter 6: Conclusions 97
References
Degree
Doctor
Publisher
제주대학교 대학원
Citation
Wafa Shafqat. (2020). A Hybrid Approach for Topic Discovery and Recommendations based on Topic Modeling and Deep Learning
Appears in Collections:
General Graduate School > Computer Engineering
공개 및 라이선스
  • 공개 구분공개
파일 목록

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.