Publications

An Analysis of Simple Data Augmentation for Named Entity Recognition

Xiang Dai and Heike Adel
International Conference on Computational Linguistics (COLING 2020). Code.

Cost-effective Selection of Pretraining Data: A Case Study of Pretraining BERT on Social Media

Xiang Dai, Sarvnaz Karimi, Ben Hachey and Cecile Paris
Findings of Conference on Empirical Methods in Natural Language Processing (EMNLP 2020). Resources, Bibtex.

NLNDE at CANTEMIST: Neural Sequence Labeling and Parsing Approaches for Clinical Concept Extraction

Lukas Lange, Xiang Dai, Heike Adel, Jannik Strotgen
Iberian Languages Evaluation Forum (IberLEF 2020).

An Effective Transition-based Model for Discontinuous NER

Xiang Dai, Sarvnaz Karimi, Ben Hachey and Cecile Paris
The Annual Meeting of the Association for Computational Linguistics (ACL 2020). Code, Talk, Bibtex.

NNE: A Dataset for Nested Named Entity Recognition in English Newswire

Nicky Ringland, Xiang Dai, Ben Hachey, Sarvnaz Karimi, Cecile Paris and James R. Curran
The Annual Meeting of the Association for Computational Linguistics (ACL 2019). Resources, Poster, Bibtex.

Using Similarity Measures to Select Pretraining Data for NER

Xiang Dai, Sarvnaz Karimi, Ben Hachey, Cecile Paris
Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2019). Slides, Poster, Resources, Code, Bibtex.

Shot Or Not: Comparison of NLP Approaches for Vaccination Behaviour Detection

Aditya Joshi, Xiang Dai, Sarvnaz Karimi, Ross Sparks, Cecile Paris, C Raina MacIntyre
EMNLP Workshop on Social Media Mining for Health Applications (SMM4H 2018). Bibtex.

Recognizing Complex Entity Mentions: A Review and Future Directions

Xiang Dai
ACL Student Research Workshop (ACL-SRW 2018). Poster, Bibtex.

Medication and Adverse Event Extraction from Noisy Text

Xiang Dai, Sarvnaz Karimi, Cecile Paris
Australasian Language Technology Association Workshop (ALTA 2017). Slides, Bibtex.

Automatic Diagnosis Coding of Radiology Reports: A Comparison of Deep Learning and Conventional Classification Methods

Sarvnaz Karimi, Xiang Dai, Hamed Hassanzadeh, Anthony Nguyen
ACL Workshop on Biomedical Natural Language Processing (BioNLP 2017). Poster, Bibtex.