Xiang Dai (戴翔)

[CV-full]     [CV-1 page]

I am currently a research scientist at CSIRO, Australia’s national science agency.

My main research interest lies in natural language processing. My earlier work focused on information extraction, especially identifying entities (e.g., ACL 20) and relations from unstructured text and linking these entities to corresponding ontologies. In the medical domain, one example is active surveillance in pharmacovigilance: I developed models to detect descriptions of adverse drug events in patient‑written online posts. I also created datasets (e.g., JBI 24) and organised shared tasks (e.g., ALTA 25) on mapping these layman descriptions to their standard terms in MedDRA, a medical dictionary used for regulatory purposes.

These days, my research is more centered on building cost‑effective domain‑specific large language models (e.g., EMNLP 20) that excel at understanding specialized text, such as clinical notes, scientific publications, and technical documents. I also work on human‑centric AI (e.g., TOCHI 23), building systems that adapt to a user’s background and specific information needs. These systems help people make informed decisions (for example, enabling a biologist to navigate scientific literature and simulation results to decide which genes to investigate for a particular trait) or follow instructions (for example, helping a patient understand their health records so they can follow their health advice).

A full list of my publications can be found on the Publications page or on Google scholar.

News

Email

dai (dot) xiang (dot) au (at) gmail.com

dai (dot) dai (at) csiro.au

Calendar