P Suhan, L Zhiqiang, D Juan - Journal of Software, 2018 - jsoftware.us
To simplify the operation of web text content extraction and improve the accuracy of that, a
new extraction method based on text-punctuation distribution and tag features (TPDT) is …