Xiang Dai (戴翔)

[CV-full]     [CV-1 page]

I am currently a research scientist at CSIRO, Australia’s national science agency.

My main research interest lies in natural language processing. My earlier work focused on information extraction, especially identifying entities (e.g., ACL 20) and relations from domain-specific text and linking these entities to corresponding ontologies. One example application of information extraction in the medical domain is active surveillance in pharmacovigilance: I developed models to detect descriptions of adverse drug events in patient‑written posts. I also created datasets (e.g., JBI 24) and organised shared tasks (e.g., ALTA 25) on mapping these layman descriptions to their standard terms in MedDRA, a medical dictionary used for regulatory purposes.

These days, my research is more centered on building cost‑effective domain‑specific large language models (e.g., EMNLP 20) that excel at understanding specialized text, such as clinical notes, scientific publications, and technical documents. I also work on human‑centric AI (e.g., TOCHI 23), building systems that adapt to a user’s background and specific information needs. These systems help people make informed decisions (for example, enabling a biologist to navigate scientific literature and simulation results to decide which genes to investigate for a particular trait) or follow instructions (for example, helping a patient understand their health records so they can follow their health advice).

I also work with students and Postdocs on topics including digital health, plain language summarization, multimodal learning, human–robot interaction, and AI agent evaluation.

A full list of my publications can be found on the Publications page or on Google scholar.

News

Email

dai (dot) xiang (dot) au (at) gmail.com

dai (dot) dai (at) csiro.au

Calendar