publications

2025

2025

  1. ICCV
    An Efficient Post-hoc Framework for Reducing Task Discrepancy of Text Encoders for Composed Image Retrieval
    Jaeseok Byun, Seokhyeon Jeong, Wonjae Kim, Sanghyuk Chun, and Taesup Moon
    In 20th International Conference on Computer Vision (ICCV 2025), 2025
  2. AAAI
    Extract Free Dense Misalignment from CLIP
    JeongYeon Nam, Jinbae Im, Wonjae Kim, and Taeho Kil
    In 39th AAAI Conference on Artificial Intelligence (AAAI 2025), 2025
  3. ICLR
    Probabilistic Language-Image Pre-Training
    Sanghyuk Chun, Wonjae Kim, Song Park, and Sangdoo Yun
    In 13th International Conference on Learning Representations (ICLR 2025), 2025
  4. CVPR-W
    Emergence of Text Readability in Vision Language Models
    Jaeyoo Park, Sanghyuk Chun, Wonjae Kim, Sangdoo Yun, and Bohyung Han
    In 2nd Workshop on Emergent Visual Abilities and Limits of Foundation Models at CVPR 2025, 2025

2024

2024

  1. ECCV Oral
    HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts
    Wonjae Kim, Sanghyuk Chun, Taekyung Kim, Dongyoon Han, and Sangdoo Yun
    In 17th European Conference on Computer Vision (ECCV 2024), 2024
  2. CHIL
    Vision-Language Generative Model for View-Specific Chest X-ray Generation
    Hyungyung Lee, Da Young Lee, Wonjae Kim, Jin-Hwa Kim, Tackeun Kim, Jihang Kim, Leonard Sunwoo, and Edward Choi
    In The 5th Annual Conference on Health, Inference, and Learning (CHIL 2024), 2024
  3. TMLR
    CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion
    Geonmo Gu*, Sanghyuk Chun*, Wonjae Kim, HeeJae Jun, Yoohoon Kang, and Sangdoo Yun
    In Transactions on Machine Learning Research (TMLR 2024), 2024
  4. ICML
    STELLA: Continual Audio-Video Pre-training with Spatio-Temporal Localized Alignment
    Jaewoo Lee*, Jaehong Yoon*, Wonjae Kim, Yunji Kim, and Sung Ju Hwang
    In 41st International Conference on Machine Learning (ICML 2024), 2024
  5. CVPR
    Language-only Efficient Training of Zero-shot Composed Image Retrieval
    Geonmo Gu*, Sanghyuk Chun*, Wonjae Kim, Yoohoon Kang, and Sangdoo Yun
    In 41st Conference on Computer Vision and Pattern Recognition (CVPR 2024), 2024

2023

2023

  1. ACL
    Pivotal Role of Language Modeling in Recommender Systems: Enriching Task-specific and Task-agnostic Representation Learning
    Kyuyong Shin*, Hanock Kwak*, Wonjae Kim, Jisu Jeong, Seungjae Jung, Kyung-Min Kim, Jung-Woo Ha, and Sang-Woo Lee
    In 60th Annual Meeting of the Association for Computational Linguistics (ACL 2023), 2023
  2. ICCV
    SeiT: Storage-Efficient Vision Training with Tokens Using 1% of Pixel Storage
    Song Park*, Sanghyuk Chun*, Byeongho Heo, Wonjae Kim, and Sangdoo Yun
    In 19th International Conference on Computer Vision (ICCV 2023), 2023
  3. ICLR
    What Do Self-Supervised Vision Transformers Learn?
    Namuk Park, Wonjae Kim, Byeongho Heo, Taekyung Kim, and Sangdoo Yun
    In 11th International Conference on Learning Representations (ICLR 2023), 2023
  4. ICML-W
    Computational Approaches for App-to-App Retrieval and Design Consistency Check
    Seokhyeon Park*, Wonjae Kim*, Young-Ho Kim, and Jinwook Seo
    2023

2022

2022

  1. Entropy
    Discrete Infomax Codes for Supervised Representation Learning
    Yoonho Lee, Wonjae Kim, Wonpyo Park, and Seungjin Choi
    Entropy special issue “Theory and Applications of Information Processing Algorithms”, 2022
  2. ICLR
    ViDT: An Efficient and Effective Fully Transformer-based Object Detector
    Hwanjun Song, Deqing Sun, Sanghyuk Chun, Varun Jampani, Dongyoon Han, Byeongho Heo, Wonjae Kim, and Ming-Hsuan Yang
    In 10th International Conference on Learning Representations (ICLR 2022), 2022
  3. An Extendable, Efficient and Effective Transformer-based Object Detector
    Hwanjun Song, Deqing Sun, Sanghyuk Chun, Varun Jampani, Dongyoon Han, Byeongho Heo, Wonjae Kim, and Ming-Hsuan Yang
    2022
  4. CHI
    Speeding up Inference with User Simulators through Policy Modulation
    Hee-Seung Moon, Seungwon Do, Wonjae Kim, Jiwon Seo, Minsuk Chang, and Byungjoo Lee
    In 40th Conference on Human Factors in Computing Systems (CHI 2022), New Orleans, LA, USA, 2022
  5. ECCV
    ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCO
    Sanghyuk Chun, Wonjae Kim, Song Park, Minsuk Chang, and Seong Joon Oh
    In 16th European Conference on Computer Vision (ECCV 2022), 2022
  6. BMVC
    Correlation between Alignment-Uniformity and Performance of Dense Contrastive Representations
    Jong Hak Moon, Wonjae Kim, and Edward Choi
    In 33rd British Machine Vision Conference (BMVC 2022), 2022
  7. Group Generalized Mean Pooling for Vision Transformer
    Byungsoo Ko, Han-Gyu Kim, Byeongho Heo, Sangdoo Yun, Sanghyuk Chun, Geonmo Gu, and Wonjae Kim
    2022

2021

2021

  1. ICML Long talk
    ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision
    Wonjae* Kim, Bokyung* Son, and Ildoo Kim
    In 38th International Conference on Machine Learnings (ICML 2021), 18–24 jul 2021
  2. NeurIPS-W
    Conditional Generation of Periodic Signals with Fourier-Based Decoder
    Jiyoung Lee, Wonjae Kim, Daehoon Gwak, and Edward Choi
    In 34th Conference on Neural Information Processing Systems (NeurIPS 2021), 2021

2020

2020

  1. ECCV-W
    Diversified Mutual Learning for Deep Metric Learning
    Wonpyo* Park, Wonjae* Kim, Kihyun You, and Minsu Cho
    In 15th European Conference on Computer Vision (ECCV 2020, TASK-CV workshop), 2020

2019

2019

  1. NeurIPS
    Learning Dynamics of Attention: Human Prior for Interpretable Machine Reasoning
    Wonjae Kim and Yoonho Lee
    In 32nd Conference on Neural Information Processing Systems (NeurIPS 2019), 2019

2018

2018

  1. Thesis
    Understanding Visualization Idioms Through Deep Visualization
    Wonjae Kim
    Seoul National University, 2018

2017

2017

  1. CHI
    ChartSense: Interactive data extraction from chart images
    Daekyoung Jung, Wonjae Kim, Hyunjoo Song, Jeong-in Hwang, Bongshin Lee, Bohyoung Kim, and Jinwook Seo
    In 35th Conference on Human Factors in Computing Systems (CHI 2017), 2017
  2. PacificVis
    SwiftTuna: Responsive and incremental visual exploration of large-scale multidimensional data
    Jaemin Jo, Wonjae Kim, Seunghoon Yoo, Bohyoung Kim, and Jinwook Seo
    In 10th IEEE Pacific Visualization Symposium (PacificVis 2017), 2017