1

Predicting gender and age categories in English conversations using lexical, non-lexical, and turn-taking features

This paper examines gender and age salience and (stereo)typicality in British English talk with the aim to predict gender and age categories based on lexical, phrasal and turntaking features.

NameSpec asks What's Your Name in Chinese? A Voice Bot to Specify Chinese Personal Names through Dialog

We present NameSpec, a voice interface that specifies Mandarin Chinese personal names with the user through dialog. Chinese personal names can contain thousands of different, sometimes rare, characters, many of which are homophones that require …

Profiling Chinese Cause-effect Constructions with rang (讓), shi (使) and ling (令) Using Frame Semantic Features

This behavioural profiling (BP) study examines the use of rang (讓), shi (使) and ling (令) as three ways to express cause-effect relationships. The study investigates the intricate semantic variation of rang, shi and ling through analysing 103 …

Towards distributed representations of building blocks of talk-ininteraction. A data-driven, frame-based method to describe discourse-interactional properties of the janwai construction in Cantonese Chinese talk

Cantonese turn-initial minimal particles: annotation of discourse-interactional functions in dialog corpora

This interactional linguistic study is concerned with the annotation of discourse-interactional functions of turn-initial particles in Cantonese conversation. These particles (or intersections) are commonly transcribed as ngo (哦), ng (嗯), aa (啊), aak …

Action formation with janwai in Cantonese Chinese conversation

MYCanCor: A Video Corpus of spoken Malaysian Cantonese

The Malaysia Cantonese Corpus (MYCanCor) is a collection of recordings of Malaysian Cantonese speech mainly collected in Perak, Malaysia. The corpus consists of around 20 hours of video recordings of spontaneous talk-in-interaction (56 settings) …

The Use of Janwai in the Management of Disagreement in Malaysian Cantonese Conversation. Evidence from MYCanCor Corpus

Project notes on building a conversation parser on top of a text parser: Towards a causal language tagger for spoken Chinese

This ongoing doctoral study examines cause and effect relationships in Chinese spoken language corpora and aims to build a tagger (Cause-Chi) that automatically annotates linguistic patterns used to express these relationships. Drawing on insights …

Extracting construction networks from cantonese speech corpora using clustering algorithms

Language is constructions “all the way down”, concludes Adele Goldberg (2006:18). Starting from this key insight associated with construction grammar (CxG), this study addresses an issue that is a result of CxG’s assumption that “the network of …