Multilingual NLP

Developing and Utilizing a Large-Scale Cantonese Dataset for Multi-Tasking in Large Language Models

EMNLP 2025 Findings paper on developing a large-scale Cantonese dataset for LLM multi-tasking.

jiyue-jiang

MlingConf: A Comprehensive Study of Multilingual Confidence Estimation on Large Language Models

ACL 2025 Findings paper on multilingual confidence estimation in LLMs.

boyang-xue
How Well Do LLMs Handle Cantonese? Benchmarking Cantonese Capabilities of Large Language Models featured image

How Well Do LLMs Handle Cantonese? Benchmarking Cantonese Capabilities of Large Language Models

NAACL 2025 Findings paper benchmarking Cantonese capabilities of LLMs.

jiyue-jiang