[HTML][HTML] Dataset search: a survey

A Chapman, E Simperl, L Koesten, G Konstantinidis… - The VLDB Journal, 2020 - Springer
Generating value from data requires the ability to find, access and make sense of datasets.
There are many efforts underway to encourage data sharing and reuse, from scientific …

A survey on truth discovery

Y Li, J Gao, C Meng, Q Li, L Su, B Zhao… - ACM Sigkdd …, 2016 - dl.acm.org
Thanks to information explosion, data for the objects of interest can be collected from
increasingly more sources. However, for the same object, there usually exist conflicts among …

Big data integration

XL Dong, D Srivastava - 2013 IEEE 29th international …, 2013 - ieeexplore.ieee.org
The Big Data era is upon us: data is being generated, collected and analyzed at an
unprecedented scale, and data-driven decision making is sweeping through all aspects of …

A confidence-aware approach for truth discovery on long-tail data

Q Li, Y Li, J Gao, L Su, B Zhao, M Demirbas… - Proceedings of the …, 2014 - dl.acm.org
In many real world applications, the same item may be described by multiple sources. As a
consequence, conflicts among these sources are inevitable, which leads to an important …

Truth discovery on crowd sensing of correlated entities

C Meng, W Jiang, Y Li, J Gao, L Su, H Ding… - Proceedings of the 13th …, 2015 - dl.acm.org
With the popular usage of mobile devices and smartphones, crowd sensing becomes
pervasive in real life when human acts as sensors to report their observations about entities …

A Survey on Truth Discovery: Concepts, Methods, Applications, and Opportunities

S Wang, H Zhang, QZ Sheng, X Li… - … Transactions on Big …, 2024 - ieeexplore.ieee.org
In the era of data information explosion, there are different observations on an object (eg, the
height of the Himalayas) from different sources on the web, social sensing, crowd sensing …

Conflicts to harmony: A framework for resolving conflicts in heterogeneous data by truth discovery

Y Li, Q Li, J Gao, L Su, B Zhao, W Fan… - IEEE Transactions on …, 2016 - ieeexplore.ieee.org
In many applications, one can obtain descriptions about the same objects or events from a
variety of sources. As a result, this will inevitably lead to data or information conflicts. One …

On the meaningfulness of “big data quality”

D Firmani, M Mecella, M Scannapieco… - Data Science and …, 2016 - Springer
In this paper, we discuss the application of concept of data quality to big data by highlighting
how much complex is to define it in a general way. Already data quality is a …

Slimfast: Guaranteed results for data fusion and source reliability

T Rekatsinas, M Joglekar, H Garcia-Molina… - Proceedings of the …, 2017 - dl.acm.org
We focus on data fusion, ie, the problem of unifying conflicting data from data sources into a
single representation by estimating the source accuracies. We propose SLiMFast, a …

On optimality of jury selection in crowdsourcing

Y Zheng, R Cheng, S Maniu, L Mo - Proceedings of the 18th International …, 2015 - hub.hku.hk
Recent advances in crowdsourcing technologies enable computationally challenging tasks
(eg, sentiment analysis and entity resolution) to be performed by Internet workers, driven …