Publications
publications by categories in reversed chronological order.
2025
- Call for Rigor in Reporting Quality of Instruction Tuning DataIn Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics, 2025
- Cross-Lingual Optimization for Language Transfer in Large Language ModelsIn Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics, 2025
- Semantic Aware Linear Transfer by Recycling Pre-trained Language Models for Cross-lingual TransferIn Findings of the Association for Computational Linguistics: ACL 2025, 2025
- FLEX: A Benchmark for Evaluating Robustness of Fairness in Large Language ModelsIn Findings of the Association for Computational Linguistics: NAACL 2025, Apr 2025
- MIRAGE: A Metric-Intensive Benchmark for Retrieval-Augmented Generation EvaluationIn Findings of the Association for Computational Linguistics: NAACL 2025, Apr 2025
- Find the Intention of Instruction: Comprehensive Evaluation of Instruction Understanding for Large Language ModelsIn Findings of the Association for Computational Linguistics: NAACL 2025, Apr 2025
- MIGRATE: Cross-Lingual Adaptation of Domain-Specific LLMs through Code-Switching and Embedding TransferIn Proceedings of the 31st International Conference on Computational Linguistics, Jan 2025
2024
- Leveraging Pre-existing Resources for Data-Efficient Counter-Narrative Generation in KoreanIn Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024
- Detecting Critical Errors Considering Cross-Cultural Factors in English-Korean TranslationIn Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024
- Generative Interpretation: Toward Human-Like Evaluation for Educational Question-Answer Pair GenerationIn Findings of the Association for Computational Linguistics: EACL 2024, Mar 2024
- Exploiting hanja-based resources in processing korean historic documents written by common literatiIEEE Access, Mar 2024
2023
- KEBAP: Korean Error Explainable Benchmark Dataset for ASR and Post-processingIn Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Mar 2023
- Post-hoc Utterance Refining Method by Entity Mining for Faithful Knowledge Grounded ConversationsIn Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Mar 2023
- CHEF in the Language Kitchen: A Generative Data Augmentation Leveraging Korean Morpheme IngredientsIn The 2023 Conference on Empirical Methods in Natural Language Processing, Mar 2023
- Informative Evidence-guided Prompt-based Fine-tuning for English-Korean Critical Error DetectionIn Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), Mar 2023
- Improving formality-sensitive machine translation using data-centric approaches and prompt engineeringIn Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023), Mar 2023
- PEEP-Talk: A Situational Dialogue-based Chatbot for English EducationIn Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), Mar 2023
- Doubts on the reliability of parallel corpus filteringExpert Systems with Applications, Jul 2023
- Uncovering the Risks and Drawbacks Associated with the Use of Synthetic Data for Grammatical Error CorrectionIEEE Access, Jul 2023
-
2022
- QUAK: A Synthetic Quality Estimation Dataset for Korean-English Neural Machine TranslationIn Proceedings of the 29th International Conference on Computational Linguistics, Jul 2022
- A dog is passing over the jet? a text-generation dataset for korean commonsense reasoning and evaluationIn Findings of the Association for Computational Linguistics: NAACL 2022, Jul 2022
- Empirical Analysis of Noising Scheme based Synthetic Data Generation for Automatic Post-editingIn Proceedings of the Thirteenth Language Resources and Evaluation Conference, Jul 2022
- Priming Ancient Korean Neural Machine TranslationIn Proceedings of the Thirteenth Language Resources and Evaluation Conference, Jul 2022
- PU-GEN: Enhancing generative commonsense reasoning for language models with human-centered knowledgeKnowledge-Based Systems, Jul 2022
- K-nct: Korean neural grammatical error correction gold-standard test set using novel error type classification criteriaIEEE Access, Jul 2022
- Plain Template Insertion: Korean-Prompt-Based Engineering for Few-Shot LearnersIEEE Access, Jul 2022
- BERTOEIC: Solving TOEIC Problems Using Simple and Efficient Data Augmentation Techniques with Pretrained Transformer EncodersApplied Sciences, Jul 2022
- Empirical Analysis of Parallel Corpora and In-Depth Analysis Using LIWCApplied Sciences, Jul 2022
- AI for Patents: A Novel Yet Effective and Efficient Framework for Patent AnalysisIEEE Access, Jul 2022
- Return on Advertising Spend Prediction with Task Decomposition-Based LSTM ModelMathematics, Jul 2022
- Word-level quality estimation for korean-english neural machine translationIEEE Access, Jul 2022
- Dense-to-question and sparse-to-answer: Hybrid retriever system for industrial frequently asked questionsMathematics, Jul 2022
- Mimicking Infants’ Bilingual Language Acquisition for Domain Specialized Neural Machine TranslationIEEE Access, Jul 2022
- An automatic post editing with efficient and simple data generation methodIEEE Access, Jul 2022
2021
- Should we find another model?: Improving neural machine translation performance with ONE-piece tokenization method without model modificationIn Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Papers, Jul 2021
- An empirical study on automatic post editing for neural machine translationIEEE Access, Jul 2021
- Comparative analysis of current approaches to quality estimation for neural machine translationApplied Sciences, Jul 2021