Dongryeol Lee
Ph.D. Student, Machine Intelligence Lab, Seoul National University ECE
Hi! I am a Ph.D. student at Seoul National University ECE. I am fortunate to be advised by Prof. Kyomin Jung.
My research interest is broadly in machine learning and natural language processing. I am particularly interested in Question Answering and fair evaluation metrics for large language models.
Education
Seoul National University
Ph.D. in Electrical and Computer Engineering (2021 – present)
Advisor: Prof. Kyomin Jung
Ph.D. in Electrical and Computer Engineering (2021 – present)
Advisor: Prof. Kyomin Jung
Seoul National University
B.S. in Naval Architecture and Ocean Engineering (2014 – 2021)
B.S. in Naval Architecture and Ocean Engineering (2014 – 2021)
Work Experience
Amazon
Applied Scientist Intern (Sept 2025 – Feb 2026)
Host: Saab Mansour
Applied Scientist Intern (Sept 2025 – Feb 2026)
Host: Saab Mansour
news
| Dec, 2025 | Paper accepted at EACL 2026 (Findings): Don’t Judge Code by Its Cover: Exploring Biases in LLM Judges for Code Evaluation |
|---|---|
| Nov, 2025 | Paper accepted at NeurIPS 2025: Program Synthesis via Test-Time Transduction |
| Oct, 2025 | Two papers accepted at EMNLP 2025: Fooling the LVLM Judges (main) and Can You Trick the Grader? (Findings) |
| Aug, 2025 | I have joined Amazon as an Applied Scientist Intern. |
| May, 2025 | Four papers accepted at NAACL 2025 (2 Oral, 2 Findings): EMBER, MoC, VLind-Bench, and Summary-Guided Decoding |
| Dec, 2024 | Paper accepted at COLING 2025 (Oral): Return of EM: Entity-driven Answer Set Expansion for QA Evaluation |
selected publications
- EACLDon’t Judge Code by Its Cover: Exploring Biases in LLM Judges for Code EvaluationIn Findings of the Association for Computational Linguistics: EACL 2026, 2026
- NeurIPSProgram Synthesis via Test-Time TransductionIn Advances in Neural Information Processing Systems (NeurIPS), 2025
- EMNLPFooling the LVLM Judges: Visual Biases in LVLM-Based EvaluationIn Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025
- EMNLPCan You Trick the Grader? Adversarial Persuasion of LLM JudgesIn Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
- NAACL OralGenerating Diverse Hypotheses for Inductive ReasoningIn Proceedings of the 2025 Annual Conference of the Nations of the Americas Chapter of the ACL (NAACL), 2025
- NAACLVLind-Bench: Measuring Language Priors in Large Vision-Language ModelsIn Findings of the Association for Computational Linguistics: NAACL 2025, 2025
- NAACLMitigating Hallucinations in Large Vision-Language Models via Summary-Guided DecodingIn Findings of the Association for Computational Linguistics: NAACL 2025, 2025