Incremental Semantic Processing
in Dialogue Systems

Hover for details! Just like you did now, on text with dotted underlines!

What?

Incremental: Word-by-word in real-time

Semantic: capturing the meaning

Processing: Understanding and generation of language

Dialogue Systems: Conversational assistants like Siri, Alexa, etc.

Cognitive Science → Humans do it in natural dialogue!

Psycholinguistics → To handle all the natural phenomena in dialogue Pauses, corrections, turn-taking & interruptions, split-utterances, etc.

Applications (in dialogue systems):

More robust, natural ConvAI
Faster inference
Accessibility for people with dementia [?] Addlesee & Eshghi (2024). "You have interrupted me again!: making voice assistants more dementia-friendly with incremental clarification" - Frontiers in Dementia

Grammar-based → Symbolic Semantic Parsing

Dynamic Syntax (DS) Inherently incremental but no wide-coverage parser [?] Kempson et al. (2001). "Dynamic Syntax: The Flow of Language Understanding" - Blackwell Publishing
Combinatory Categorial Grammar (CCG) Not inherently incremental but has wide-coverage & incremental parsers [?] Steedman. (1996). "Surface Structure and Interpretation" - MIT Press
PL-TAG, RMRS-IP, HPSG Unfortunately discontinued. [?] Konstas & Keller, (2015). "Semantic role labeling improves incremental parsing" - ACL-IJCNLP [?] Hough et al. (2015). "Incremental semantics for dialogue processing: Requirements, and a comparison of two approaches" - IWCS [?] Ginzberg et al. (2017). "Incrementality and HPSG: Why Not?" - pre-print

E2E Neural → [Large] Language Models

RNNs & LSTMs Not SOTA [?] Hochreiter & Schmidhuber(1997). "Long short-term memory" - Neural Computation
Auto-regressive models (GPT-2, etc.) Don't have the best incremental performance compared to bi-directional models. [?] Madureira et al. (2024). "When Only Time Will Tell: Interpreting How Transformers Process Local Ambiguities Through the Lens of Restart-Incrementality" - ACL 2024
Mamba, xLSTM, RWKV These are quite new architectures and are not evaluated in an incremental setting yet. [?] Gu & Dao (2024). "Mamba: Linear-Time Sequence Modelling with Selective State Spaces" - ICLR [?] Beck et al. (2024). "xLSTM: Extended Long Short-Term Memory" - NeurIPS [?] Peng et al. (2023). "RWKV: Reinventing RNNs for the Transformer Era" - EMNLP

Bidirectional models adapted incrementally Restart-incremental is not the only way to adapt non-incremental models in an incremental fashion, but it is the most effective one. See references for more. [?] Madureira & Schlangen (2020). "Incremental Processing in the Age of Non-Incremental Encoders: An Empirical Assessment of Bidirectional Models for Incremental NLU" - EMNLP 2020

Transformers + restart incremental Better incremental performance than ARs, but high computational cost. So no perfectly incremental neural model found yet. [?] Madureira et al. (2024). "When Only Time Will Tell: Interpreting How Transformers Process Local Ambiguities Through the Lens of Restart-Incrementality" - ACL 2024

Property\Approach	Symbolic AI	E2E Neural AI
Predictive Performance	❌	✅
Scalability	❌	✅
Cross-domain Generalisation	❌	✅
Data Efficiency	✅	❌
Model size + Inference speed	✅	❌
Compute Power	✅	❌
Reasoning + Planning	✅	❔
Compositional Generalisation	✅	❔
Interpretability + Explainability	✅	❔
Verifiability, Transparency, Control, Safety	✅	❔

Developing a wide-coverage neuro-symbolic parser for DS [?] Prange et al. (2021). "Supertagging the Long Tail with Tree-Structured Decoding of Complex Categories" - TACL [?] Kogkalidis et al. (2023). "SPINDLE: Spinning Raw Text into Lambda Terms with Graph Attention" - EACL
Comparison with incremental CCG parsing [?] Stanojevic & Steedman. (2020). "Max-Margin Incremental CCG Parsing" - ACL
Benchmarking dialogue processing Including all dialogue phenomena, open-domain vs. task-oriented, incremental & non-incremental metrics, etc.
Comprehensive (re-)evaluation of all the models! Including the newer neural architectures, comprehensive intrinsic evaluation on the above benchmark, extrinsic evaluation as natural interaction with humans.