Stargan v2: Diverse image synthesis for multiple domains

Y Choi, Y Uh, J Yoo, JW Ha - Proceedings of the IEEE/CVF …, 2020 - openaccess.thecvf.com
A good image-to-image translation model should learn a mapping between different visual
domains while satisfying the following properties: 1) diversity of generated images and 2) …

In defence of metric learning for speaker recognition

JS Chung, J Huh, S Mun, M Lee, HS Heo… - arXiv preprint arXiv …, 2020 - arxiv.org
The objective of this paper is' open-set'speaker recognition of unseen speakers, where ideal
embeddings should be able to condense information into a compact utterance-level …

Character region awareness for text detection

Y Baek, B Lee, D Han, S Yun… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Scene text detection methods based on neural networks have emerged recently and have
shown promising results. Previous methods trained with rigid word-level bounding boxes …

BioBERT: a pre-trained biomedical language representation model for biomedical text mining

J Lee, W Yoon, S Kim, D Kim, S Kim, CH So… - …, 2020 - academic.oup.com
Motivation Biomedical text mining is becoming increasingly important as the number of
biomedical documents rapidly grows. With the progress in natural language processing …

Dataset condensation with contrastive signals

S Lee, S Chun, S Jung, S Yun… - … Conference on Machine …, 2022 - proceedings.mlr.press
Recent studies have demonstrated that gradient matching-based dataset synthesis, or
dataset condensation (DC), methods can achieve state-of-theart performance when applied …

Exploiting spatial dimensions of latent in gan for real-time image editing

H Kim, Y Choi, J Kim, S Yoo… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Generative adversarial networks (GANs) synthesize realistic images from random latent
vectors. Although manipulating the latent vectors controls the synthesized outputs, editing …

Efficient dialogue state tracking by selectively overwriting memory

S Kim, S Yang, G Kim, SW Lee - arXiv preprint arXiv:1911.03906, 2019 - arxiv.org
Recent works in dialogue state tracking (DST) focus on an open vocabulary-based setting to
resolve scalability and generalization issues of the predefined ontology-based approaches …

Rethinking data augmentation for image super-resolution: A comprehensive analysis and a new strategy

J Yoo, N Ahn, KA Sohn - … of the IEEE/CVF conference on …, 2020 - openaccess.thecvf.com
Data augmentation is an effective way to improve the performance of deep networks.
Unfortunately, current methods are mostly developed for high-level vision tasks (eg …

Clova baseline system for the voxceleb speaker recognition challenge 2020

HS Heo, BJ Lee, J Huh, JS Chung - arXiv preprint arXiv:2009.14153, 2020 - arxiv.org
This report describes our submission to the VoxCeleb Speaker Recognition Challenge
(VoxSRC) at Interspeech 2020. We perform a careful analysis of speaker recognition models …

Online continual learning on class incremental blurry task configuration with anytime inference

H Koh, D Kim, JW Ha, J Choi - arXiv preprint arXiv:2110.10031, 2021 - arxiv.org
Despite rapid advances in continual learning, a large body of research is devoted to
improving performance in the existing setups. While a handful of work do propose new …