publications

* denotes equal contribution

2024

  1. Investigating How Large Language Models Leverage Internal Knowledge to Perform Complex Reasoning
    Miyoung Ko* , Sue Hyun Park* , Joonsuk Park , Minjoon Seo
    arXiv preprint 2024
  2. The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
    Seungone Kim , Juyoung Suk , Ji Yong Cho , Shayne Longpre , Chaeeun Kim , Dongkeun Yoon , Guijin Son , Yejin Cho , Sheikh Shafayat , Jinheon Baek , Sue Hyun Park , 21 more authors
    arXiv preprint 2024
  3. Aligning to Thousands of Preferences via System Message Generalization
    Seongyun Lee* , Sue Hyun Park* , Seungone Kim , Minjoon Seo
    ACL 2024 Workshop ConvAI (Oral)
  4. Prometheus-Vision: Vision-Language Model as a Judge for Fine-Grained Evaluation
    Seongyun Lee , Seungone Kim , Sue Park , Geewook Kim , Minjoon Seo
    ACL 2024 Findings
  5. Volcano: Mitigating Multimodal Hallucination through Self-Feedback Guided Revision
    Seongyun Lee , Sue Hyun Park , Yongrae Jo , Minjoon Seo
    NAACL 2024