%0 Journal Article %A LI Jia-shan %A LU Mei-lian %A WANG Zi %T Hierarchical News Topic Detection Using Improved LSH %D 2014 %R 10.13190/j.jbupt.2014.03.007 %J Journal of Beijing University of Posts and Telecommunications %P 32-37 %V 37 %N 3 %X

To improve the timeliness of detecting topics in retrospective topic detection, an improved locality sensitive Hashing (LSH) algorithm is proposed and applied in constructing hierarchical topic model for web news. Firstly, the news content feature is excavated, and the topic feature is excavated using latent dirichlet allocation model. Then the non-binary content eigenvector and topic eigenvector are converted to binary feature space. Finally, news articles are clustered in order using binary content eigenvector and binary topic eigenvector by LSH, and the hierarchical topic-content news topic model is generated. Experiments prove the following results: extracting content feature and topic feature can express the news exactly; converting content eigenvector and topic eigenvector to unified binary space can reduce the time complexity of clustering, and thus increase the efficiency of topic detection while ensure the accuracy and semantic expansibility.

%U https://journal.bupt.edu.cn/EN/10.13190/j.jbupt.2014.03.007