Publications

publications by categories in reversed chronological order.

2025

  1. ACL2025-logo.png
    Call for Rigor in Reporting Quality of Instruction Tuning Data
    Hyeonseok Moon, Jaehyung Seo, and Heuiseok Lim
    In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics, 2025
  2. ACL2025-logo.png
    Cross-Lingual Optimization for Language Transfer in Large Language Models
    Jungseob Lee, Seongtae Hong, Hyeonseok Moon, and 1 more author
    In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics, 2025
  3. ACL2025-logo.png
    Semantic Aware Linear Transfer by Recycling Pre-trained Language Models for Cross-lingual Transfer
    Seungyoon Lee, Seongtae Hong, Hyeonseok Moon, and 1 more author
    In Findings of the Association for Computational Linguistics: ACL 2025, 2025
  4. NAACL2025-logo.png
    FLEX: A Benchmark for Evaluating Robustness of Fairness in Large Language Models
    Dahyun Jung, Seungyoon Lee, Hyeonseok Moon, and 2 more authors
    In Findings of the Association for Computational Linguistics: NAACL 2025, Apr 2025
  5. NAACL2025-logo.png
    MIRAGE: A Metric-Intensive Benchmark for Retrieval-Augmented Generation Evaluation
    Chanhee Park, Hyeonseok Moon, Chanjun Park, and 1 more author
    In Findings of the Association for Computational Linguistics: NAACL 2025, Apr 2025
  6. NAACL2025-logo.png
    Find the Intention of Instruction: Comprehensive Evaluation of Instruction Understanding for Large Language Models
    Hyeonseok Moon, Jaehyung Seo, Seungyoon Lee, and 2 more authors
    In Findings of the Association for Computational Linguistics: NAACL 2025, Apr 2025
  7. COLING2025-logo.jpg
    MIGRATE: Cross-Lingual Adaptation of Domain-Specific LLMs through Code-Switching and Embedding Transfer
    Seongtae Hong, Seungyoon Lee, Hyeonseok Moon, and 1 more author
    In Proceedings of the 31st International Conference on Computational Linguistics, Jan 2025

2024

  1. LREC-COLING2024-logo.png
    Leveraging Pre-existing Resources for Data-Efficient Counter-Narrative Generation in Korean
    Seungyoon Lee, Chanjun Park, DaHyun Jung, and 4 more authors
    In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024
  2. LREC-COLING2024-logo.png
    Detecting Critical Errors Considering Cross-Cultural Factors in English-Korean Translation
    Sugyeong Eo, Jungwoo Lim, Chanjun Park, and 5 more authors
    In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024
  3. EMNLP2024-logo.png
    Translation of Multifaceted Data without Re-Training of Machine Translation Systems
    Hyeonseok Moon, Seungyoon Lee, SeongTae Hong, and 3 more authors
    In Findings of the Association for Computational Linguistics: EMNLP 2024, Nov 2024
  4. ACL2024-logo.png
    Length-aware Byte Pair Encoding for Mitigating Over-segmentation in Korean Machine Translation
    Jungseob Lee, Hyeonseok Moon, Seungjun Lee, and 6 more authors
    In Findings of the Association for Computational Linguistics: ACL 2024, Aug 2024
  5. EACL2024-logo.png
    Generative Interpretation: Toward Human-Like Evaluation for Educational Question-Answer Pair Generation
    Hyeonseok Moon, Jaewook Lee, Sugyeong Eo, and 3 more authors
    In Findings of the Association for Computational Linguistics: EACL 2024, Mar 2024
  6. EACL2024-logo.png
    Hyper-BTS Dataset: Scalability and Enhanced Analysis of Back TranScription (BTS) for ASR Post-Processing
    Chanjun Park, Jaehyung Seo, Seolhwa Lee, and 5 more authors
    In Findings of the Association for Computational Linguistics: EACL 2024, Mar 2024
  7. IEEE_logo.png
    Exploiting hanja-based resources in processing korean historic documents written by common literati
    Hyeonseok Moon, Myunghoon Kang, Jaehyung Seo, and 4 more authors
    IEEE Access, Mar 2024

2023

  1. EMNLP2023-logo.png
    KEBAP: Korean Error Explainable Benchmark Dataset for ASR and Post-processing
    Seonmin Koo, Chanjun Park, Jinsung Kim, and 4 more authors
    In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Mar 2023
  2. EMNLP2023-logo.png
    Post-hoc Utterance Refining Method by Entity Mining for Faithful Knowledge Grounded Conversations
    Yoonna Jang, Suhyune Son, Jeongwoo Lee, and 6 more authors
    In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Mar 2023
  3. EMNLP2023-logo.png
    CHEF in the Language Kitchen: A Generative Data Augmentation Leveraging Korean Morpheme Ingredients
    Jaehyung Seo, Hyeonseok Moon, Jaewook Lee, and 3 more authors
    In The 2023 Conference on Empirical Methods in Natural Language Processing, Mar 2023
  4. acl-logo.png
    Informative Evidence-guided Prompt-based Fine-tuning for English-Korean Critical Error Detection
    Dahyun Jung, Sugyeong Eo, Chanjun Park, and 3 more authors
    In Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), Mar 2023
  5. acl-logo.png
    Improving formality-sensitive machine translation using data-centric approaches and prompt engineering
    Seungjun Lee, Hyeonseok Moon, Chanjun Park, and 1 more author
    In Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023), Mar 2023
  6. ACL2023-logo.png
    PEEP-Talk: A Situational Dialogue-based Chatbot for English Education
    Seungjun Lee, Yoonna Jang, Chanjun Park, and 7 more authors
    In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), Mar 2023
  7. ACL2023-logo.png
    Towards Diverse and Effective Question-Answer Pair Generation from Children Storybooks
    Sugyeong Eo, Hyeonseok Moon, Jinsung Kim, and 6 more authors
    In Findings of the Association for Computational Linguistics: ACL 2023, Jul 2023
  8. ESWA.png
    Doubts on the reliability of parallel corpus filtering
    Hyeonseok Moon, Chanjun Park, Seonmin Koo, and 8 more authors
    Expert Systems with Applications, Jul 2023
  9. IEEE_logo.png
    Uncovering the Risks and Drawbacks Associated with the Use of Synthetic Data for Grammatical Error Correction
    Seonmin Koo, Chanjun Park, Seolhwa Lee, and 4 more authors
    IEEE Access, Jul 2023
  10. MDPI-logo.png
    A Survey on Evaluation Metrics for Machine Translation
    Seungjun Lee, Jungseob Lee, Hyeonseok Moon, and 5 more authors
    Mathematics, Jul 2023

2022

  1. COLING2022-logo.png
    QUAK: A Synthetic Quality Estimation Dataset for Korean-English Neural Machine Translation
    Sugyeong Eo, Chanjun Park, Hyeonseok Moon, and 4 more authors
    In Proceedings of the 29th International Conference on Computational Linguistics, Jul 2022
  2. NAACL2022-logo.png
    A dog is passing over the jet? a text-generation dataset for korean commonsense reasoning and evaluation
    Jaehyung Seo, Seounghoon Lee, Chanjun Park, and 5 more authors
    In Findings of the Association for Computational Linguistics: NAACL 2022, Jul 2022
  3. LREC2022-logo.png
    Empirical Analysis of Noising Scheme based Synthetic Data Generation for Automatic Post-editing
    Hyeonseok Moon, Chanjun Park, Seolhwa Lee, and 4 more authors
    In Proceedings of the Thirteenth Language Resources and Evaluation Conference, Jul 2022
  4. LREC2022-logo.png
    Priming Ancient Korean Neural Machine Translation
    Chanjun Park, Seolhwa Lee, Jaehyung Seo, and 3 more authors
    In Proceedings of the Thirteenth Language Resources and Evaluation Conference, Jul 2022
  5. ESWA.png
    PU-GEN: Enhancing generative commonsense reasoning for language models with human-centered knowledge
    Jaehyung Seo, Dongsuk Oh, Sugyeong Eo, and 5 more authors
    Knowledge-Based Systems, Jul 2022
  6. IEEE_logo.png
    K-nct: Korean neural grammatical error correction gold-standard test set using novel error type classification criteria
    Seonmin Koo, Chanjun Park, Jaehyung Seo, and 4 more authors
    IEEE Access, Jul 2022
  7. IEEE_logo.png
    Plain Template Insertion: Korean-Prompt-Based Engineering for Few-Shot Learners
    Jaehyung Seo, Hyeonseok Moon, Chanhee Lee, and 5 more authors
    IEEE Access, Jul 2022
  8. MDPI-logo.png
    BERTOEIC: Solving TOEIC Problems Using Simple and Efficient Data Augmentation Techniques with Pretrained Transformer Encoders
    Jeongwoo Lee, Hyeonseok Moon, Chanjun Park, and 3 more authors
    Applied Sciences, Jul 2022
  9. MDPI-logo.png
    Empirical Analysis of Parallel Corpora and In-Depth Analysis Using LIWC
    Chanjun Park, Midan Shim, Sugyeong Eo, and 4 more authors
    Applied Sciences, Jul 2022
  10. IEEE_logo.png
    AI for Patents: A Novel Yet Effective and Efficient Framework for Patent Analysis
    Junyoung Son, Hyeonseok Moon, Jeongwoo Lee, and 4 more authors
    IEEE Access, Jul 2022
  11. MDPI-logo.png
    Return on Advertising Spend Prediction with Task Decomposition-Based LSTM Model
    Hyeonseok Moon, Taemin Lee, Jaehyung Seo, and 7 more authors
    Mathematics, Jul 2022
  12. IEEE_logo.png
    Word-level quality estimation for korean-english neural machine translation
    Sugyeong Eo, Chanjun Park, Hyeonseok Moon, and 2 more authors
    IEEE Access, Jul 2022
  13. MDPI-logo.png
    Dense-to-question and sparse-to-answer: Hybrid retriever system for industrial frequently asked questions
    Jaehyung Seo, Taemin Lee, Hyeonseok Moon, and 7 more authors
    Mathematics, Jul 2022
  14. IEEE_logo.png
    Mimicking Infants’ Bilingual Language Acquisition for Domain Specialized Neural Machine Translation
    Chanjun Park, Woo-Young Go, Sugyeong Eo, and 3 more authors
    IEEE Access, Jul 2022
  15. IEEE_logo.png
    An automatic post editing with efficient and simple data generation method
    Hyeonseok Moon, Chanjun Park, Jaehyung Seo, and 2 more authors
    IEEE Access, Jul 2022

2021

  1. NAACL2021-logo.png
    Should we find another model?: Improving neural machine translation performance with ONE-piece tokenization method without model modification
    Chanjun Park, Sugyeong Eo, Hyeonseok Moon, and 1 more author
    In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Papers, Jul 2021
  2. IEEE_logo.png
    An empirical study on automatic post editing for neural machine translation
    Hyeonseok Moon, Chanjun Park, Sugyeong Eo, and 2 more authors
    IEEE Access, Jul 2021
  3. MDPI-logo.png
    Comparative analysis of current approaches to quality estimation for neural machine translation
    Sugyeong Eo, Chanjun Park, Hyeonseok Moon, and 2 more authors
    Applied Sciences, Jul 2021