Qingkai Min | publications

You can also check my Google Scholar page.

Conference Papers

IJCAI

Dialogue State Induction Using Neural Latent Variable Models

Qingkai Min, Libo Qin, Zhiyang Teng, Xiao Liu, and Yue Zhang

In Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI-20 2020

Abs PDF code poster slides

Dialogue state modules are a useful component in a task-oriented dialogue system. Traditional methods find dialogue states by manually labeling training corpora, upon which neural models are trained. However, the labeling process can be costly, slow, error-prone, and more importantly, cannot cover the vast range of domains in real-world dialogues for customer service. We propose the task of dialogue state induction, building two neural latent variable models that mine dialogue states automatically from unlabeled customer service dialogue records. Results show that the models can effectively find meaningful dialogue states. In addition, equipped with induced dialogue states, a state-of-the-art dialogue system gives better performance compared with not using a dialogue state module.
EMNLP

A Pilot Study for Chinese SQL Semantic Parsing

Qingkai Min, Yuefeng Shi, and Yue Zhang

In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) 2019

Abs PDF code poster slides

The task of semantic parsing is highly useful for dialogue and question answering systems. Many datasets have been proposed to map natural language text into SQL, among which the recent Spider dataset provides cross-domain samples with multiple tables and complex queries. We build a Spider dataset for Chinese, which is currently a low-resource language in this task area. Interesting research questions arise from the uniqueness of the language, which requires word segmentation, and also from the fact that SQL keywords and columns of DB tables are typically written in English. We compare character- and word-based encoders for a semantic parser, and different embedding schemes. Results show that word-based semantic parser is subject to segmentation errors and cross-lingual word embeddings are useful for text-to-SQL.

Note: We start the Chinese semantic parsing and Text-to-SQL challenge: CSpider 1.0.

publications

Conference Papers

Journals

Workshop Papers

Theses