publications

2025

  1. ACL
    RUBRIC-MQM : Span-Level LLM-as-judge in Machine Translation For High-End Models
    Ahrii Kim
    In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track), Jul 2025
  2. preprint
    Multi-agentMT: Deploying AI Agent in the WMT25 Shared Task Accepted at WMT 2025
    Ahrii Kim
    TechRxiv, Aug 2025
  3. preprint
    Context is Ubiquitous, but Rarely Changes Judgments: Revisiting Document-Level MT Evaluation Accepted at WMT 2025
    Ahrii Kim
    TechRxiv, Aug 2025
  4. preprint
    FALCON: Holistic Framework for Document-Level Machine Translation Evaluation
    Ahrii Kim
    TechRxiv, May 2025
  5. preprint
    IR_Multi-AgentMT at WMT25 Translation Task: A Summary Accepted at WMT 2025
    Ahrii Kim
    TechRxiv, Jul 2025

2023

  1. preprint
    The Suboptimal WMT Test Sets and Its Impact on Human Parity
    Ahrii Kim, Yunju Bak, Jimin Sun, and 2 more authors
    Preprints, Feb 2023

2022

  1. ACL
    Vacillating Human Correlation of SacreBLEU in Unprotected Languages
    Ahrii Kim and Jinhyeon Kim
    In Proceedings of the 2nd Workshop on Human Evaluation of NLP Systems (HumEval), May 2022

2020

  1. Journal
    Human Evaluation of NMT & Annual Progress Report: A Case Study on Spanish to Korean
    Ahrii Kim and Carme Colominas
    Revista Tradumàtica. Tecnologies de la Traducció, Dec 2020