Title
Fast Top-k Area Topics Extraction with Knowledge Base
Abstract
What are the most representative research topics in Artificial Intelligence (AI)? We formulate the problem as extracting top-k topics that can best represent a given area with the help of knowledge base. We theoretically prove that the problem is NP-hard and propose an optimization model, FastKATE, to address this problem by combining both explicit and latent representations for each topic. We leverage a large-scale knowledge base (Wikipedia) to generate topic embeddings using neural networks and use this kind of representations to help capture the representativeness of topics for given areas. We develop a fast heuristic algorithm to efficiently solve the problem with a provable error bound. We evaluate the proposed model on three real-world datasets. Experimental results demonstrate our model's effectiveness, robustness, real-timeness (return results in <;1s), and its superiority over several alternative methods.
Year
DOI
Venue
2018
10.1109/DSC.2018.00016
2018 IEEE Third International Conference on Data Science in Cyberspace (DSC)
Keywords
DocType
Volume
knowledge discovery,data mining,topic extraction,knowledge base,heuristic search
Conference
abs/1710.04822
ISBN
Citations 
PageRank 
978-1-5386-4211-5
1
0.36
References 
Authors
8
5
Name
Order
Citations
PageRank
Fang Zhang14814.46
Xiaochen Wang2189.25
Jingfei Han310.36
Jie Tang45871300.22
Shiyin Wang540.75