Cancerkg. org-a web-scale, interactive, verifiable knowledge graph-llm hybrid for assisting with optimal cancer treatment and care

M Gubanov, A Pyayt, A Karolak - Proceedings of the 33rd ACM …, 2024 - dl.acm.org
Here, we describe one of the first Web-scale hybrid Knowledge Graph (KG)-Large
Language Model (LLM), populated with the latest peer-reviewed medical knowledge on …

PA-FEAT: Fast Feature Selection for Structured Data via Progress-Aware Multi-Task Deep Reinforcement Learning

J Zhang, Z Luo, Q Xu, M Zhang - 2023 IEEE 39th International …, 2023 - ieeexplore.ieee.org
Feature selection is an effective technique for structured data analytics, aiming to eliminate
redundant features and irrelevant features for downstream tasks (eg, classification). With the …

Simplifying access to large-scale structured datasets by meta-profiling with scalable training set enrichment

S Pavia, R Khan, A Pyayt, M Gubanov - Proceedings of the 2022 …, 2022 - dl.acm.org
Accessing large-scale structured datasets such as WDC [21], having millions of tables
coming from hundreds of thousands of sources is very challenging [11, 13, 14, 30, 31]. Even …

Visualizing and Querying Large-scale Structured Datasets by Learning Multi-layered 3D Meta-Profiles

M Gubanov, A Pyayt, S Pavia - 2022 IEEE International …, 2022 - ieeexplore.ieee.org
Data profiling is a" set of statistical data analysis activities to determine properties of a
dataset". Historically, it was aimed at data (not meta-data), but at scale, the tables' meta-data …

Discovery and Matching Numerical Attributes in Data Lakes

P Sukprasert, GYY Chan, RA Rossi… - … Conference on Big …, 2023 - ieeexplore.ieee.org
In data platforms with thousands of data tables available for exploration, users often need to
retrieve some data based on limited knowledge of the data sources and schema. The task …

Leveraging Scalable Profiling to Learn and Visualize the Latest Trustworthy COVID-19 Medical Research Findings

M Gubanov, S Pavia, A Pyayt, W Goble - Proceedings of the 31st ACM …, 2022 - dl.acm.org
Accessing large-scale structured datasets such as WDC [31] or CORD-191 is very
challenging [11, 13, 14, 41, 42]. Even if one topic (eg Vaccine Side-Effects) is of interest, the …

[PDF][PDF] COVIDKG. ORG-a Web-scale COVID-19 Interactive, Trustworthy Knowledge Graph, Constructed and Interrogated for Bias using Deep-Learning

B Kandibedala, A Pyayt, N Piraino… - … on Extending Database …, 2023 - par.nsf.gov
We describe a Web-scale interactive Knowledge Graph (KG), populated with trustworthy
information from the latest published medical findings on COVID-19. Currently existing …

Learning Topical Structured Interfaces from Medical Research Literature

M Chauhan, A Pyayt, M Gubanov - … of the ACM Web Conference 2023, 2023 - dl.acm.org
Accessing large-scale structured datasets such as WDC or CORD-191 is very challenging.
Even if one topic (eg COVID-19 vaccine efficacy) is of interest, all topical tables in different …

Scalable Tabular Metadata Location and Classification in Large-Scale Structured Datasets

K Islam, M Gubanov - Database and Expert Systems Applications: 32nd …, 2021 - Springer
Tabular metadata (ie attribute names) location and classification is a fundamental problem
for large-scale structured corpora. Web tables [24], CORD-19 [35], have thousands to …

[PDF][PDF] Hybrid Metadata Classification in Large-scale Structured Datasets.

S Pavia, N Piraino, K Islam, A Pyayt, MN Gubanov - J. Data Intell., 2022 - rintonpress.com
Metadata location and classification is an important problem for large-scale structured
datasets. For example, Web tables [29] have hundreds of millions of tables, but often have …