Explaining and harnessing adversarial examples

Re-thinking data strategy and integration for artificial intelligence: concepts, opportunities, and challenges

A Aldoseri, KN Al-Khalifa, AM Hamouda - Applied Sciences, 2023 - mdpi.com

The use of artificial intelligence (AI) is becoming more prevalent across industries such as
healthcare, finance, and transportation. Artificial intelligence is based on the analysis of …

被引用次数：220 相关文章所有 9 个版本

[PDF] springer.com

Interpreting black-box models: a review on explainable artificial intelligence

V Hassija, V Chamola, A Mahapatra, A Singal… - Cognitive …, 2024 - Springer

Recent years have seen a tremendous growth in Artificial Intelligence (AI)-based
methodological development in a broad range of domains. In this rapidly evolving field …

被引用次数：249 相关文章所有 7 个版本

[PDF] arxiv.org

Universal and transferable adversarial attacks on aligned language models

A Zou, Z Wang, N Carlini, M Nasr, JZ Kolter… - arXiv preprint arXiv …, 2023 - arxiv.org

Because" out-of-the-box" large language models are capable of generating a great deal of
objectionable content, recent work has focused on aligning these models in an attempt to …

被引用次数：664 相关文章所有 8 个版本

[PDF] arxiv.org

Holistic evaluation of language models

P Liang, R Bommasani, T Lee, D Tsipras… - arXiv preprint arXiv …, 2022 - arxiv.org

Language models (LMs) are becoming the foundation for almost all major language
technologies, but their capabilities, limitations, and risks are not well understood. We present …

被引用次数：934 相关文章所有 5 个版本

[PDF] github.io

The rise and potential of large language model based agents: A survey

Z Xi, W Chen, X Guo, W He, Y Ding, B Hong… - arXiv preprint arXiv …, 2023 - arxiv.org

For a long time, humanity has pursued artificial intelligence (AI) equivalent to or surpassing
the human level, with AI agents considered a promising vehicle for this pursuit. AI agents are …

被引用次数：466 相关文章所有 4 个版本

[PDF] arxiv.org

Can AI-generated text be reliably detected?

VS Sadasivan, A Kumar, S Balasubramanian… - arXiv preprint arXiv …, 2023 - arxiv.org

The unregulated use of LLMs can potentially lead to malicious consequences such as
plagiarism, generating fake news, spamming, etc. Therefore, reliable detection of AI …

被引用次数：319 相关文章所有 4 个版本

[PDF] mlr.press

Better diffusion models further improve adversarial training

Z Wang, T Pang, C Du, M Lin… - … on Machine Learning, 2023 - proceedings.mlr.press

It has been recognized that the data generated by the denoising diffusion probabilistic
model (DDPM) improves adversarial training. After two years of rapid development in …

被引用次数：160 相关文章所有 9 个版本

[PDF] mlr.press

Cross-entropy loss functions: Theoretical analysis and applications

A Mao, M Mohri, Y Zhong - International conference on …, 2023 - proceedings.mlr.press

Cross-entropy is a widely used loss function in applications. It coincides with the logistic loss
applied to the outputs of a neural network, when the softmax is used. But, what guarantees …

被引用次数：196 相关文章所有 7 个版本

[PDF] arxiv.org

Diffusion models for adversarial purification

W Nie, B Guo, Y Huang, C Xiao, A Vahdat… - arXiv preprint arXiv …, 2022 - arxiv.org

Adversarial purification refers to a class of defense methods that remove adversarial
perturbations using a generative model. These methods do not make assumptions on the …

被引用次数：372 相关文章所有 8 个版本

[PDF] thecvf.com

Spatext: Spatio-textual representation for controllable image generation

O Avrahami, T Hayes, O Gafni… - Proceedings of the …, 2023 - openaccess.thecvf.com

Recent text-to-image diffusion models are able to generate convincing results of
unprecedented quality. However, it is nearly impossible to control the shapes of different …

被引用次数：152 相关文章所有 5 个版本

高级搜索

QQ 群