Feature selection is an effective technique for structured data analytics, aiming to eliminate redundant features and irrelevant features for downstream tasks (eg, classification). With the …
Accessing large-scale structured datasets such as WDC [21], having millions of tables coming from hundreds of thousands of sources is very challenging [11, 13, 14, 30, 31]. Even …
Data profiling is a" set of statistical data analysis activities to determine properties of a dataset". Historically, it was aimed at data (not meta-data), but at scale, the tables' meta-data …
In data platforms with thousands of data tables available for exploration, users often need to retrieve some data based on limited knowledge of the data sources and schema. The task …
Accessing large-scale structured datasets such as WDC [31] or CORD-191 is very challenging [11, 13, 14, 41, 42]. Even if one topic (eg Vaccine Side-Effects) is of interest, the …
B Kandibedala, A Pyayt, N Piraino… - … on Extending Database …, 2023 - par.nsf.gov
We describe a Web-scale interactive Knowledge Graph (KG), populated with trustworthy information from the latest published medical findings on COVID-19. Currently existing …
M Chauhan, A Pyayt, M Gubanov - … of the ACM Web Conference 2023, 2023 - dl.acm.org
Accessing large-scale structured datasets such as WDC or CORD-191 is very challenging. Even if one topic (eg COVID-19 vaccine efficacy) is of interest, all topical tables in different …
K Islam, M Gubanov - Database and Expert Systems Applications: 32nd …, 2021 - Springer
Tabular metadata (ie attribute names) location and classification is a fundamental problem for large-scale structured corpora. Web tables [24], CORD-19 [35], have thousands to …
Metadata location and classification is an important problem for large-scale structured datasets. For example, Web tables [29] have hundreds of millions of tables, but often have …