Exploring the Landscape of Automatic Text Summarization: A Comprehensive Survey

B Khan, ZA Shah, M Usman, I Khan, B Niazi - IEEE Access, 2023 - ieeexplore.ieee.org
The discipline of Automatic Text Summarization (ATS), which is expanding quickly, intends
to automatically create summaries of enormous amounts of text so that readers can save …

The Devil is in the Tails: How Long-Tailed Code Distributions Impact Large Language Models

X Zhou, K Kim, B Xu, J Liu, DG Han, D Lo - arXiv preprint arXiv …, 2023 - arxiv.org
Learning-based techniques, especially advanced Large Language Models (LLMs) for code,
have gained considerable popularity in various software engineering (SE) tasks. However …

Pitfalls in Language Models for Code Intelligence: A Taxonomy and Survey

X She, Y Liu, Y Zhao, Y He, L Li… - arXiv preprint arXiv …, 2023 - arxiv.org
Modern language models (LMs) have been successfully employed in source code
generation and understanding, leading to a significant increase in research focused on …

Challenges and practices of deep learning model reengineering: A case study on computer vision

W Jiang, V Banna, N Vivek, A Goel, N Synovic… - arXiv preprint arXiv …, 2023 - arxiv.org
Many engineering organizations are reimplementing and extending deep neural networks
from the research community. We describe this process as deep learning model …

Assessing the vulnerabilities of the open-source artificial intelligence (ai) landscape: A large-scale analysis of the hugging face platform

A Kathikar, A Nair, B Lazarine… - … on Intelligence and …, 2023 - ieeexplore.ieee.org
Artificial Intelligence (AI) has rapidly proliferated as a critical disruptive technology in the
21st century. Hugging Face hosts pre-trained models, facilitating the sharing and use of …

PTMTorrent: A Dataset for Mining Open-source Pre-trained Model Packages

W Jiang, N Synovic, P Jajal, TR Schorlemmer… - arXiv preprint arXiv …, 2023 - arxiv.org
Due to the cost of developing and training deep learning models from scratch, machine
learning engineers have begun to reuse pre-trained models (PTMs) and fine-tune them for …

Analysis of Failures and Risks in Deep Learning Model Converters: A Case Study in the ONNX Ecosystem

P Jajal, W Jiang, A Tewari, J Woo, YH Lu… - arXiv preprint arXiv …, 2023 - arxiv.org
Software engineers develop, fine-tune, and deploy deep learning (DL) models. They use
and re-use models in a variety of development frameworks and deploy them on a range of …

[HTML][HTML] An Exploratory Study of Helping Undergraduate Students Solve Literature Review Problems Using Litstudy and NLP

GKW Wong, SYK Li - Education Sciences, 2023 - mdpi.com
(1) Many undergraduate students struggle to produce a good literature review in their
dissertations, as they are not experienced, do not have sufficient time, and do not have the …

Active Code Learning: Benchmarking Sample-Efficient Training of Code Models

Q Hu, Y Guo, X Xie, M Cordy, L Ma… - arXiv preprint arXiv …, 2023 - arxiv.org
The costly human effort required to prepare the training data of machine learning (ML)
models hinders their practical development and usage in software engineering (ML4Code) …

Exploring Naming Conventions (and Defects) of Pre-trained Deep Learning Models in Hugging Face and Other Model Hubs

W Jiang, C Cheung, GK Thiruvathukal… - arXiv preprint arXiv …, 2023 - arxiv.org
As innovation in deep learning continues, many engineers want to adopt Pre-Trained deep
learning Models (PTMs) as components in computer systems. PTMs are part of a research-to …