Publications
publications by categories in reversed chronological order.
2025
-
Metric Calculating Benchmark: Complicate Instruction Following Benchmark for Large Language ModelsIn The 2025 Conference on Empirical Methods in Natural Language Processing, 2025 -
LimaCost: Data Valuation for Instruction Tuning of Large Language ModelsIn Findings of the Association for Computational Linguistics: EMNLP 2025, 2025 -
The Impact of Negated Text on Hallucination with Large Language ModelsIn The 2025 Conference on Empirical Methods in Natural Language Processing, 2025 -
Call for Rigor in Reporting Quality of Instruction Tuning DataIn Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics, Jul 2025 -
Cross-Lingual Optimization for Language Transfer in Large Language ModelsIn Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics, Jul 2025 -
Semantic Aware Linear Transfer by Recycling Pre-trained Language Models for Cross-lingual TransferIn Findings of the Association for Computational Linguistics: ACL 2025, Jul 2025 -
FLEX: A Benchmark for Evaluating Robustness of Fairness in Large Language ModelsIn Findings of the Association for Computational Linguistics: NAACL 2025, Apr 2025 -
MIRAGE: A Metric-Intensive Benchmark for Retrieval-Augmented Generation EvaluationIn Findings of the Association for Computational Linguistics: NAACL 2025, Apr 2025 -
Find the Intention of Instruction: Comprehensive Evaluation of Instruction Understanding for Large Language ModelsIn Findings of the Association for Computational Linguistics: NAACL 2025, Apr 2025 -
MIGRATE: Cross-Lingual Adaptation of Domain-Specific LLMs through Code-Switching and Embedding TransferIn Proceedings of the 31st International Conference on Computational Linguistics, Jan 2025
2024
-
Leveraging Pre-existing Resources for Data-Efficient Counter-Narrative Generation in KoreanIn Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024 -
Detecting Critical Errors Considering Cross-Cultural Factors in English-Korean TranslationIn Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024 -
Generative Interpretation: Toward Human-Like Evaluation for Educational Question-Answer Pair GenerationIn Findings of the Association for Computational Linguistics: EACL 2024, Mar 2024 -
Exploiting hanja-based resources in processing korean historic documents written by common literatiIEEE Access, Mar 2024
2023
-
KEBAP: Korean Error Explainable Benchmark Dataset for ASR and Post-processingIn Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Mar 2023 -
Post-hoc Utterance Refining Method by Entity Mining for Faithful Knowledge Grounded ConversationsIn Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Mar 2023 -
CHEF in the Language Kitchen: A Generative Data Augmentation Leveraging Korean Morpheme IngredientsIn The 2023 Conference on Empirical Methods in Natural Language Processing, Mar 2023 -
Informative Evidence-guided Prompt-based Fine-tuning for English-Korean Critical Error DetectionIn Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), Mar 2023 -
Improving formality-sensitive machine translation using data-centric approaches and prompt engineeringIn Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023), Mar 2023 -
PEEP-Talk: A Situational Dialogue-based Chatbot for English EducationIn Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), Mar 2023 -
Doubts on the reliability of parallel corpus filteringExpert Systems with Applications, Jul 2023 -
Uncovering the Risks and Drawbacks Associated with the Use of Synthetic Data for Grammatical Error CorrectionIEEE Access, Jul 2023 -
2022
-
QUAK: A Synthetic Quality Estimation Dataset for Korean-English Neural Machine TranslationIn Proceedings of the 29th International Conference on Computational Linguistics, Jul 2022 -
A dog is passing over the jet? a text-generation dataset for korean commonsense reasoning and evaluationIn Findings of the Association for Computational Linguistics: NAACL 2022, Jul 2022 -
Empirical Analysis of Noising Scheme based Synthetic Data Generation for Automatic Post-editingIn Proceedings of the Thirteenth Language Resources and Evaluation Conference, Jul 2022 -
Priming Ancient Korean Neural Machine TranslationIn Proceedings of the Thirteenth Language Resources and Evaluation Conference, Jul 2022 -
PU-GEN: Enhancing generative commonsense reasoning for language models with human-centered knowledgeKnowledge-Based Systems, Jul 2022 -
K-nct: Korean neural grammatical error correction gold-standard test set using novel error type classification criteriaIEEE Access, Jul 2022 -
Plain Template Insertion: Korean-Prompt-Based Engineering for Few-Shot LearnersIEEE Access, Jul 2022 -
BERTOEIC: Solving TOEIC Problems Using Simple and Efficient Data Augmentation Techniques with Pretrained Transformer EncodersApplied Sciences, Jul 2022 -
Empirical Analysis of Parallel Corpora and In-Depth Analysis Using LIWCApplied Sciences, Jul 2022 -
AI for Patents: A Novel Yet Effective and Efficient Framework for Patent AnalysisIEEE Access, Jul 2022 -
Return on Advertising Spend Prediction with Task Decomposition-Based LSTM ModelMathematics, Jul 2022 -
Word-level quality estimation for korean-english neural machine translationIEEE Access, Jul 2022 -
Dense-to-question and sparse-to-answer: Hybrid retriever system for industrial frequently asked questionsMathematics, Jul 2022 -
Mimicking Infants’ Bilingual Language Acquisition for Domain Specialized Neural Machine TranslationIEEE Access, Jul 2022 -
An automatic post editing with efficient and simple data generation methodIEEE Access, Jul 2022
2021
-
Should we find another model?: Improving neural machine translation performance with ONE-piece tokenization method without model modificationIn Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Papers, Jul 2021 -
An empirical study on automatic post editing for neural machine translationIEEE Access, Jul 2021 -
Comparative analysis of current approaches to quality estimation for neural machine translationApplied Sciences, Jul 2021