Data collection and quality challenges in deep learning: A data-centric ai perspective

SE Whang, Y Roh, H Song, JG Lee - The VLDB Journal, 2023 - Springer
Data-centric AI is at the center of a fundamental shift in software engineering where machine
learning becomes the new software, powered by big data and computing infrastructure …

A survey on recent approaches for natural language processing in low-resource scenarios

MA Hedderich, L Lange, H Adel, J Strötgen… - arXiv preprint arXiv …, 2020 - arxiv.org
Deep neural networks and huge language models are becoming omnipresent in natural
language applications. As they are known for requiring large amounts of training data, there …

Fine-tuning pre-trained language model with weak supervision: A contrastive-regularized self-training approach

Y Yu, S Zuo, H Jiang, W Ren, T Zhao… - arXiv preprint arXiv …, 2020 - arxiv.org
Fine-tuned pre-trained language models (LMs) have achieved enormous success in many
natural language processing (NLP) tasks, but they still require excessive labeled data in the …

Deep evidential learning with noisy correspondence for cross-modal retrieval

Y Qin, D Peng, X Peng, X Wang, P Hu - Proceedings of the 30th ACM …, 2022 - dl.acm.org
Cross-modal retrieval has been a compelling topic in the multimodal community. Recently,
to mitigate the high cost of data collection, the co-occurred pairs (eg, image and text) could …

[图书][B] Sentiment analysis: Mining opinions, sentiments, and emotions

B Liu - 2020 - books.google.com
Sentiment analysis is the computational study of people's opinions, sentiments, emotions,
moods, and attitudes. This fascinating problem offers numerous research challenges, but …

Graph convolutional network with multiple weight mechanisms for aspect-based sentiment analysis

Z Zhao, M Tang, W Tang, C Wang, X Chen - Neurocomputing, 2022 - Elsevier
Aspect-based sentiment analysis (ABSA) aims at determining the sentiment polarity of the
given aspect term in a sentence. Recently, graph convolution network (GCN) has been used …

Denoising multi-source weak supervision for neural text classification

W Ren, Y Li, H Su, D Kartchner, C Mitchell… - arXiv preprint arXiv …, 2020 - arxiv.org
We study the problem of learning neural text classifiers without using any labeled data, but
only easy-to-provide rules as multiple weak supervision sources. This problem is …

Transfer learning and distant supervision for multilingual transformer models: A study on African languages

MA Hedderich, D Adelani, D Zhu, J Alabi… - arXiv preprint arXiv …, 2020 - arxiv.org
Multilingual transformer models like mBERT and XLM-RoBERTa have obtained great
improvements for many NLP tasks on a variety of languages. However, recent works also …

A dependency syntactic knowledge augmented interactive architecture for end-to-end aspect-based sentiment analysis

Y Liang, F Meng, J Zhang, Y Chen, J Xu, J Zhou - Neurocomputing, 2021 - Elsevier
The end-to-end aspect-based sentiment analysis (ABSA) task remains to be a long-standing
challenge, which aims to extract the aspect term and then identify its sentiment orientation. In …

CrowdChecked: detecting previously fact-checked claims in social media

M Hardalov, A Chernyavskiy, I Koychev… - arXiv preprint arXiv …, 2022 - arxiv.org
While there has been substantial progress in developing systems to automate fact-checking,
they still lack credibility in the eyes of the users. Thus, an interesting approach has emerged …