相关文章- 学术资源搜索

An Efficient Method Based on Region-adjacent Embedding for Text Classification of Chinese Electronic Medical Records

F Guo, T Wu, X Jin - 2020 5th International Conference on …, 2020 - ieeexplore.ieee.org

In the field of natural language processing (NLP), word-embedding-based models have
been widely applied in many tasks with great success, which are believed to make …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

From dataset recycling to multi-property extraction and beyond

T Dwojak, M Pietruszka, Ł Borchmann… - arXiv preprint arXiv …, 2020 - arxiv.org

This paper investigates various Transformer architectures on the WikiReading Information
Extraction and Machine Reading Comprehension dataset. The proposed dual-source model …

被引用次数：8 相关文章所有 8 个版本

A discriminative convolutional neural network with context-aware attention

Y Zhou, L Liao, Y Gao, H Huang, X Wei - ACM Transactions on …, 2020 - dl.acm.org

Feature representation and feature extraction are two crucial procedures in text mining.
Convolutional Neural Networks (CNN) have shown overwhelming success for text-mining …

被引用次数：2 相关文章

End-to-end QA on Covid-19: domain adaptation with synthetic training

R Gangi Reddy, B Iyer, M Arafat Sultan… - arXiv e …, 2020 - ui.adsabs.harvard.edu

End-to-end question answering (QA) requires both information retrieval (IR) over a large
document collection and machine reading comprehension (MRC) on the retrieved …

被引用次数：2 相关文章

[PDF] arxiv.org

Going full-tilt boogie on document understanding with text-image-layout transformer

R Powalski, Ł Borchmann, D Jurkiewicz… - Document Analysis and …, 2021 - Springer

We address the challenging problem of Natural Language Comprehension beyond plain-
text documents by introducing the TILT neural network architecture which simultaneously …

被引用次数：150 相关文章所有 7 个版本

Scalable document image information extraction with application to domain-specific analysis

Y Zheng, S Kong, W Zhu, H Ye - 2019 IEEE International …, 2019 - ieeexplore.ieee.org

Document images are ubiquitous, but existing methods mainly focus on the text reading but
not information understanding. In this paper, we propose a novel document image …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

Regulatory compliance through Doc2Doc information retrieval: A case study in EU/UK legislation where text similarity has limitations

I Chalkidis, M Fergadiotis, N Manginas… - arXiv preprint arXiv …, 2021 - arxiv.org

Major scandals in corporate history have urged the need for regulatory compliance, where
organizations need to ensure that their controls (processes) comply with relevant laws …

被引用次数：20 相关文章所有 5 个版本

The resume corpus: a large dataset for research in information extraction systems

Y Su, J Zhang, J Lu - 2019 15th International Conference on …, 2019 - ieeexplore.ieee.org

We publish a Chinese Resume Corpus for researches of information extraction. The corpus
contains 178 thousand resume documents and over 33 million words. The resume …

被引用次数：5 相关文章所有 2 个版本

[PDF] arxiv.org

A Data-centric Framework for Improving Domain-specific Machine Reading Comprehension Datasets

I Bojic, J Halim, V Suharman, S Tar, QC Ong… - arXiv preprint arXiv …, 2023 - arxiv.org

Low-quality data can cause downstream problems in high-stakes applications. Data-centric
approach emphasizes on improving dataset quality to enhance model performance. High …

A multi-resolution word embedding for document retrieval from large unstructured knowledge bases

T Cakaloglu, X Xu - arXiv preprint arXiv:1902.00663, 2019 - arxiv.org

Deep language models learning a hierarchical representation proved to be a powerful tool
for natural language processing, text mining and information retrieval. However …

被引用次数：5 相关文章所有 3 个版本

高级搜索

QQ 群

An Efficient Method Based on Region-adjacent Embedding for Text Classification of Chinese Electronic Medical Records

From dataset recycling to multi-property extraction and beyond

A discriminative convolutional neural network with context-aware attention

End-to-end QA on Covid-19: domain adaptation with synthetic training

Going full-tilt boogie on document understanding with text-image-layout transformer

Scalable document image information extraction with application to domain-specific analysis

Regulatory compliance through Doc2Doc information retrieval: A case study in EU/UK legislation where text similarity has limitations

The resume corpus: a large dataset for research in information extraction systems

A Data-centric Framework for Improving Domain-specific Machine Reading Comprehension Datasets

A multi-resolution word embedding for document retrieval from large unstructured knowledge bases

引用