Posts

2026

26 Jan: Low-resource language processing (lessons from HG2051) 19 Jan: Blog Post Generator (tutorial)

2025

22 May: The taggedPBC - a massive dataset for crosslinguistic investigations

2024

01 Feb: Some notes on LLMs in real-world contexts (Part 1)

2023

18 Nov: Website redesign (markdown + pandoc) 08 Jun: Teaching 'Language and the Computer'

2022

23 Jan: In case you hadn't noticed...

2020

14 May: Automating the coding of implicit motives (Paper announcement)

2019

16 Sep: Computer-assisted syntactic reconstruction 12 Sep: ICAAL 7 Proceedings volume

more

About

 

I am a linguist based in Singapore. My interests include Language Documentation and Description, Natural Language Processing, Historical Linguistics, and Language Change, among other things. This site is currently under construction, so please bear with me as I get it working again.

Research

For my PhD (2015) I wrote a Grammar of Pnar, based on extensive texts transcribed from stories told by native speakers in and around Jowai, Meghalaya, India. Since then I have conducted fieldwork on a number of other Austroasiatic languages in Thailand and Myanmar, and continue to be involved in historical linguistic research within the phylum. My other strand of research involves training text classifiers, most recently for automating implicit motive coding, though I have also developed LLMs for other use cases.

Courses

I currently teach several courses at Nanyang Technological University, Singapore, at both the undergraduate and masters levels including:

  • Language and the Computer
  • Anthropological Linguistics
  • Languages of the World
  • Sociolinguistic Research Methods