Retaining data from streams of social platforms with minimal regret

TT Nguyen, CT Duong, M Weidlich, H Yin… - … Joint Conference on …, 2017 - infoscience.epfl.ch
Today's social platforms, such as Twitter and Facebook, continuously generate massive
volumes of data. The resulting data streams exceed any reasonable limit for permanent …

Scalable approximation algorithm for graph summarization

MA Beg, M Ahmad, A Zaman, I Khan - … and Data Mining: 22nd Pacific-Asia …, 2018 - Springer
Massive sizes of real-world graphs, such as social networks and web graph, impose serious
challenges to process and perform analytics on them. These issues can be resolved by …

Popularity sensitive and domain-aware summarization for web tables

Y Xi, N Wang, S Hao, Y Zhang, X Chen - Information Sciences, 2023 - Elsevier
Table summarization can be of great help, which generates a concise and informative
overview of a table to assist users to understand the table easily and unambiguously. A high …

A Review Selection Method Based on Consumer Decision Phases in E-commerce

J Zhang, X Li, L Wang - ACM Transactions on Information Systems, 2023 - dl.acm.org
A valuable small subset strategically selected from massive online reviews is beneficial to
improve consumers' decision-making efficiency in e-commerce. Existing review selection …

Robust guarantees of stochastic greedy algorithms

A Hassidim, Y Singer - International Conference on Machine …, 2017 - proceedings.mlr.press
In this paper we analyze the robustness of stochastic variants of the greedy algorithm for
submodular maximization. Our main result shows that for maximizing a monotone …

Efficient representative subset selection over sliding windows

Y Wang, Y Li, KL Tan - IEEE Transactions on Knowledge and …, 2018 - ieeexplore.ieee.org
Representative subset selection (RSS) is an important tool for users to draw insights from
massive datasets. Existing literature models RSS as the submodular maximization problem …

Interaction-aware topic model for microblog conversations through network embedding and user attention

R He, X Zhang, D Jin, L Wang, J Dang… - Proceedings of the 27th …, 2018 - aclanthology.org
Traditional topic models are insufficient for topic extraction in social media. The existing
methods only consider text information or simultaneously model the posts and the static …

Fast machine learning in data science with a comprehensive data summarization

ST Al-Amin, C Ordonez - … Conference on Big Data (Big Data), 2021 - ieeexplore.ieee.org
Machine learning algorithms must be able to handle large volume in big data. Nowadays,
data science languages such as Python and R, are widely popular to compute machine …

Automatic generation of entity-oriented summaries for reputation management

J Rodríguez-Vidal, J Carrillo-de-Albornoz… - Journal of Ambient …, 2020 - Springer
Producing online reputation summaries for an entity (company, brand, etc.) is a focused
summarization task with a distinctive feature: issues that may affect the reputation of the …

Topic Extraction of Events on Social Media Using Reinforced Knowledge

X Zhang, R He - … Conference, KSEM 2018, Changchun, China, August …, 2018 - Springer
The conventional topic models for topic extraction of events on social media are insufficient
due to the data sparsity and the noise of microblog posts. The existing researches use word …