[PDF][PDF] Genre-oriented web content extraction with deep convolutional neural networks and statistical methods

BD Nguyen-Hoang, BT Pham-Hong… - Proceedings of the …, 2018 - aclanthology.org
Extracting clean textual content from the Web is the first and an essential step to resolve
most of down-stream natural language processing tasks. Previous works in web content …

[引用][C] Bibliographic Information Extraction of Books' Web Pages Based on LDA Topic Model

X Li, Y Huo, L Huang