Completely labeled pathology datasets are often challenging and time-consuming to obtain. Semi-supervised learning (SSL) methods are able to learn from fewer labeled data points …
A Bissett, A Fitzgerald, T Meintjes, PM Mele, F Reith… - GigaScience, 2016 - Springer
Background Microbial inhabitants of soils are important to ecosystem and planetary functions, yet there are large gaps in our knowledge of their diversity and ecology. The …
Having a multitude of unlabeled data and few labeled ones is a common problem in many practical applications. A successful methodology to tackle this problem is self-training semi …
A Abdulhafedh - Open Access Library Journal, 2022 - scirp.org
Statistical techniques are important tools in modeling research work. However, there could be misleading outcomes if sufficient care is undermined in choosing the right approach …
The goal of sentiment analysis is to determine opinions, emotions, and attitudes presented in source material. In tweet sentiment analysis, opinions in messages can be typically …
D Wu, X Luo, G Wang, M Shang… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
Self-labeled technique, a paradigm of semisupervised classification (SSC), is highly effective in alleviating the shortage of labeled data in classification tasks via an iterative self …
M Piernik, T Morzy - Knowledge and Information Systems, 2021 - Springer
There is a certain belief among data science researchers and enthusiasts alike that clustering can be used to improve classification quality. Insofar as this belief is fairly …
This paper demonstrates a new water end-use disaggregation and classification tool that builds on existing end-use disaggregation studies and addresses the unavailability of code …
B Li, J Wang, Z Yang, J Yi, F Nie - Information Sciences, 2023 - Elsevier
Self-training is a commonly semi-supervised learning Algorithm framework. How to select the high-confidence samples is a crucial step for algorithms based on self-training …