- 学术资源搜索

A review on fairness in machine learning

D Pessach, E Shmueli - ACM Computing Surveys (CSUR), 2022 - dl.acm.org

An increasing number of decisions regarding the daily lives of human beings are being
controlled by artificial intelligence and machine learning (ML) algorithms in spheres ranging …

被引用次数：431 相关文章

[PDF] arxiv.org

From show to tell: A survey on deep learning-based image captioning

M Stefanini, M Cornia, L Baraldi… - IEEE transactions on …, 2022 - ieeexplore.ieee.org

Connecting Vision and Language plays an essential role in Generative Intelligence. For this
reason, large research efforts have been devoted to image captioning, ie describing images …

被引用次数：318 相关文章所有 11 个版本

[PDF] mlr.press

Scaling vision transformers to 22 billion parameters

M Dehghani, J Djolonga, B Mustafa… - International …, 2023 - proceedings.mlr.press

The scaling of Transformers has driven breakthrough capabilities for language models. At
present, the largest large language models (LLMs) contain upwards of 100B parameters …

被引用次数：370 相关文章所有 9 个版本

[PDF] arxiv.org

Muse: Text-to-image generation via masked generative transformers

H Chang, H Zhang, J Barber, AJ Maschinot… - arXiv preprint arXiv …, 2023 - arxiv.org

We present Muse, a text-to-image Transformer model that achieves state-of-the-art image
generation performance while being significantly more efficient than diffusion or …

被引用次数：360 相关文章所有 6 个版本

[PDF] arxiv.org

Beyond the imitation game: Quantifying and extrapolating the capabilities of language models

A Srivastava, A Rastogi, A Rao, AAM Shoeb… - arXiv preprint arXiv …, 2022 - arxiv.org

Language models demonstrate both quantitative improvement and new qualitative
capabilities with increasing scale. Despite their potentially transformative impact, these new …

被引用次数：962 相关文章所有 11 个版本

[PDF] neurips.cc

Photorealistic text-to-image diffusion models with deep language understanding

C Saharia, W Chan, S Saxena, L Li… - Advances in neural …, 2022 - proceedings.neurips.cc

We present Imagen, a text-to-image diffusion model with an unprecedented degree of
photorealism and a deep level of language understanding. Imagen builds on the power of …

被引用次数：4273 相关文章所有 11 个版本

[PDF] neurips.cc

Flamingo: a visual language model for few-shot learning

JB Alayrac, J Donahue, P Luc… - Advances in neural …, 2022 - proceedings.neurips.cc

Building models that can be rapidly adapted to novel tasks using only a handful of annotated
examples is an open challenge for multimodal machine learning research. We introduce …

被引用次数：2641 相关文章所有 7 个版本

[PDF] neurips.cc

Video diffusion models

J Ho, T Salimans, A Gritsenko… - Advances in …, 2022 - proceedings.neurips.cc

Generating temporally coherent high fidelity video is an important milestone in generative
modeling research. We make progress towards this milestone by proposing a diffusion …

被引用次数：1000 相关文章所有 8 个版本

[PDF] arxiv.org

Easily accessible text-to-image generation amplifies demographic stereotypes at large scale

F Bianchi, P Kalluri, E Durmus, F Ladhak… - Proceedings of the …, 2023 - dl.acm.org

Machine learning models that convert user-written text descriptions into images are now
widely available online and used by millions of users to generate millions of images a day …

被引用次数：197 相关文章所有 4 个版本

[PDF] neurips.cc

Stable bias: Evaluating societal representations in diffusion models

S Luccioni, C Akiki, M Mitchell… - Advances in Neural …, 2024 - proceedings.neurips.cc

As machine learning-enabled Text-to-Image (TTI) systems are becoming increasingly
prevalent and seeing growing adoption as commercial services, characterizing the social …

被引用次数：46 相关文章所有 4 个版本

高级搜索

QQ 群

A review on fairness in machine learning

From show to tell: A survey on deep learning-based image captioning

Scaling vision transformers to 22 billion parameters

Muse: Text-to-image generation via masked generative transformers

Beyond the imitation game: Quantifying and extrapolating the capabilities of language models

Photorealistic text-to-image diffusion models with deep language understanding

Flamingo: a visual language model for few-shot learning

Video diffusion models

Easily accessible text-to-image generation amplifies demographic stereotypes at large scale

Stable bias: Evaluating societal representations in diffusion models

引用