Simple Data Augmentation for Multilingual NLU in Task Oriented Dialogue Systems

Simple Data Augmentation for Multilingual NLU in Task Oriented Dialogue Systems SamuelLouvan slouvan@fbk.eu University of Trento FondazioneBrunoKessler University of Trento BernardoMagniniFondazione University of Trento BrunoKessler University of Trento Simple Data Augmentation for Multilingual NLU in Task Oriented Dialogue Systems 12248AC51F5ADECED063A69E62A5B111 GROBID - A machine learning software for extracting information from scholarly documents

Data augmentation has shown potential in alleviating data scarcity for Natural Language Understanding (e.g. slot filling and intent classification) in task-oriented dialogue systems. As prior work has been mostly experimented on English datasets, we focus on five different languages, and consider a setting where limited data are available. We investigate the effectiveness of non-gradient based augmentation methods, involving simple text span substitutions and syntactic manipulations. Our experiments show that (i) augmentation is effective in all cases, particularly for slot filling; and (ii) it is beneficial for a joint intent-slot model based on multilingual BERT, both for limited data settings and when full training data is used.

Introduction

Natural Language Understanding (NLU) in taskoriented dialogue systems is responsible for parsing user utterances to extract the intent of the user and the arguments of the intent (i.e. slots) into a semantic representation, typically a semantic frame (Tur and De Mori, 2011). For example, the utterance "Play Jeff Pilson on Youtube" has the intent PLAYMUSIC and "Youtube" as value for the slot SERVICE. As more skills are added to the dialogue system, the NLU model frequently needs to be updated to scale to new domains and languages, a situation which typically becomes problematic when labeled data are limited (data scarcity).

One way to combat data scarcity is through data augmentation (DA) techniques performing label preserving operations to produce auxiliary training data. Recently, DA has shown potential in tasks such as machine translation (Fadaee et al., 2017), constituency and dependency parsing (S ¸ahin and Steedman, 2018;Vania et al., 2019), and text classification (Wei and Zou, 2019;Kumar et al., 2020). As for slot filling (SF) and intent classification (IC), a number of DA methods have been proposed to generate synthetic utterances using sequence to sequence models (Hou et al., 2018;Zhao et al., 2019), Conditional Variational Auto Encoder (Yoo et al., 2019), or pretrained NLG models (Peng et al., 2020). To date, most of the DA methods are evaluated on English and it is not clear whether the same finding apply to other languages.

In this paper, we study the effectiveness of DA on several non-English datasets for NLU in task-oriented dialogue systems. We experiment with existing lightweight, non-gradient based, DA methods from Louvan and Magnini (2020) that produces varying slot values through substitution and sentence structure manipulation by leveraging syntactic information from a dependency parser. We evaluate the DA methods on NLU datasets from five languages: Italian, Hindi, Turkish, Spanish, and Thai. The contributions of our paper are as follows: 1. We assess the applicability of DA methods for NLU in task-oriented dialogue systems in five languages. 2. We demonstrate that simple DA can improve performance on all languages despite different characteristic of the languages. 3. We show that a large pre-trained multilingual BERT (M-BERT) (Devlin et al., 2019) can still benefit from DA, in particular for slot filling.

Slot Filling and Intent Classification

The NLU component of a task-oriented dialogue system is responsible in a parsing user utterance into a semantic representation, such as semantic Given an input utterance of n tokens, x = (x 1 , x 2 , .., x n ), the system needs to assign a particular intent y intent for the whole utterance x and the corresponding slots that are mentioned in the utterance y slot = (y slot 1 , y slot 2 , .., y slot n ). In practice, IC is typically modeled as text classification and SF as a sequence tagging problem. As an example, for the utterance "Play Jeff Pilson on Youtube", y intent is PLAYMUSIC, as the intent of the user is to ask the system to play a song from a musician and y slot = ( O, B-ARTIST, I-ARTIST, O, B-SERVICE ) in which the artist is "Jeff Pilson" and the service is "Youtube"". Slot labels are in BIO format: B indicates the start of a slot span, I the inside of a span while O denotes that the word does not belong to any slot. Recent approaches for SF and IC are based on neural network methods that models SF and IC jointly (Goo et al., 2018;Chen et al., 2019) by sharing model parameter among both tasks. methods from Louvan and Magnini (2020), that has shown promising results on English datasets. We describe the augmentation operations in the following sections.

Slot Substitution (SLOT-SUB)

SLOT-SUB (Figure 1 left) performs augmentation by substituting a particular text span (slot-value pair) in an utterance with a different text span that is semantically consistent i.e., the slot label is the same. For example, in the utterance "Quali film animati stanno proiettando al cinema più vicino", one of the spans that can be substituted is the slot value pair (più vicino, SPATIAL RELATION). Then, we collect other spans in D in which the slot values are different, but the slot label is the same. For instance, we found the substitute candidates SP = {("distanza a piedi", SPATIAL RE-LATION), ("lontano", SPATIAL RELATION), ("nel quartiere", SPATIAL RELATION), . . . }, and then we sample one span to replace the original span in the utterance.

CROP and ROTATE

In order to produce sentence variations, we apply the crop and rotate operations proposed in S ¸ahin and Steedman (2018), which manipulate the sentence structure through its dependency parse tree. The goal of CROP (Figure 1 middle) is to simplify the sentence so that it focuses on a particular fragment (e.g. subject/object) by removing other fragments in the sentence. CROP uses the dependency tree to identify the fragment and then remove it and its children from the dependency tree. The ROTATE (Figure 1 right) operation is performed by moving a particular fragment (including subject/object) around the root of the tree, typically the verb in the sentence. For each operation, all possible combinations are generated, and one of them is picked randomly as the augmented sentence. Both CROP and ROTATE rely on the universal dependency labels (Nivre et al., 2017) to identify relevant fragments, such as NSUBJ (nominal subject), DOBJ (direct object), OBJ (object), IOBJ (indirect object).

Experiments

Our primary goal is to verify the effectiveness of data augmentation on Italian, Hindi, Turkish, Spanish and Thai NLU datasets with limited labeled data. To this end, we compare the performance of a baseline NLU model trained on the original training data (D) with a NLU model that incorporates the augmented data as additional training instances (D + D ). To simulate the limited labeled data situation we randomly sample 10% of the training data for each dataset.

Baseline and Data Augmentation (DA) Methods. We use the state of the art BERT-based joint intent slot filling model (Chen et al., 2019) as the baseline model. We leverage the pre-trained multilingual BERT (M-BERT), which is trained on 104 languages. During training, M-BERT is fine tuned on the slot filling and intent classification tasks. Given a sentence representation x = ([CLS] t 1 t 2 . . . t L ), we use the hidden state h [CLS] to predict the intent, and h t i to predict the slot label. As for DA methods, in addition to the methods described in Section 2, we add one configuration COMBINE, which combines the result of SLOT-SUB and ROTATE, as ROTATE obtains better results than CROP on the development set.

Settings. The model is trained with the BertAdam optimizer for 30 epochs with early stopping. The learning rate is set to 10 −5 and batch size is 16. All the hyperparameters are listed in Appendix A. For SLOT-SUB the number of augmentation per sentence N is tuned on the development set. To produce the dependency tree, we parse the sentence using Stanza (Qi et al., 2020). For both CROP and ROTATE we follow the default hyperparameters from S ¸ahin and Steedman (2018). We did not experiment with Thai for CROP and ROTATE as Thai is not supported by Stanza. The number of augmented sentences (D ) for each method is listed in Table 1. For evaluation metric, we use the standard CoNLL script to compute F1 score for slot filling and accuracy for intent classification.

Datasets. For the Italian language, we use the data from Bellomaria et al. (2019), translated from the English SNIPS dataset (Coucke et al., 2018). SNIPS has been widely used for evaluating NLU models and consists of utterances in multiple domains. As for Hindi and Turkish, we use the ATIS dataset from Upadhyay et al. (2018), derived from Hemphill et al. (1990). ATIS is a well known NLU dataset on flight domain. As for Spanish and Thai we use the FB dataset from Schuster et al. (2019) that contains utterances in alarm, weather, and reminder domains. The overall statistics of the datasets are shown in Table 1.

Results

The overall results reported in Table 2 show that applying DA improves performance on slot filling and intent classification across all languages. In particular, for SF, the SLOT-SUB method yields the best result, while for IC, ROTATE obtains better performance compared to CROP in most cases. These results are consistent with the finding from Louvan and Magnini (2020) on the English dataset, where SLOT-SUB improves SF and CROP or ROTATE improve IC. In general, ROTATE is better than CROP for most cases on IC, and we think this is because CROP may change the intent of the original sentence. Intents typically depend on the occurrence of specific slots, so when the cropped part is a slot-value, it may change the sentence's overall semantics.

We can see that languages with different typological features (e.g. subject/verb/object ordering)1 benefit from ROTATE operation for IC. This result suggests that augmentation can produce useful noise (regularization) for the model to alleviate overfitting when labeled data is limited. When we use COMBINE, it still helps the performance of both SF and IC, although the improvements are not as high as when only one of the augmentation method is applied. The only language that gets the benefits the most from COMBINE is Turkish. We hypothesize that as Turkish has a more flexible word order than the other languages it benefits the most when ROTATE is performed.

Performance on varying data size. To better understand the effectiveness of SLOT-SUB, we perform further analysis on different training data size (see Figure 2). Overall, we observe that as we increase the training size, the benefit of SLOT-SUB is decreasing for all datasets. For some datasets, namely ATIS-HI and FB-ES, SLOT-SUB can cause performance drop for larger data size, although it is reasonably small (less than 1 F1 point). FB-TH consistently benefits from SLOT-SUB even when full training data is used. Until which training data size the improvement is significant vary across datasets2 . For SNIPS-IT, improvement is clear for all training data size and they are statistically significant up until the training data size is 80%. For ATIS-HI improvements are significant until data size of 40%. As for FB datasets, improvements are significant only until the training data size is 10%. Overall, we can see that SLOT-SUB is effective for cases where data is scarce (5%, 10%), while it is still relatively robust for larger data size on all datasets. Performance on different numbers of augmentation per utterance (N ). We examine the effect of a larger number of augmentations per utterance (N ) to the model performance, specifically for SF (see Figure 3). For FB-ES, similarly to the results in Table 2, increasing N does not affect the performance. For the other datasets, increasing N brings performance improvement. For ATIS-HI, SNIPS-IT, and FB-TH the trend is that, as we increase N , performance goes up and plateau. For ATIS-TR, changing N does not really affect the gain of the performance as the performance trend is quite steady across number of augmentations. For most combinations of N in each dataset (except FB-ES), the difference between the performance of model that using SLOT-SUB and the model that does not use SLOT-SUB is significant3 .

Related Work

Data augmentation methods that has been proposed in NLP aims to automatically produce additional training data through different kinds of methods ranging from simple word substitution (Wei and Zou, 2019) to more complex methods that aims to produce semantically preserving sentence generation (Hou et al., 2018;Gao et al., 2020). In the context of slot filling and intent classification, recent augmentation methods typically apply deep learning models to produce augmented utterances. Hou et al. (2018) proposes a two-stages methods to produce the delexicalized utterances generation and slot values realization. Their method is based on a sequence to sequence based model (Sutskever et al., 2014) to produce a paraphrase of an utterance with its slot values placeholder (delexicalized) for a given intent. For the slot values lexicalization, they use the slot values in the training data that occur in similar contexts. Zhao et al. (2019) 2020) make use of Transformer (Vaswani et al., 2017) based pre-trained NLG namely GPT-2 (Radford et al., 2019), and fine-tune it to slot filling dataset to produce synthetic utterances. We consider these deep learning based approaches as heavyweight as they often require several stages in the augmentation process namely generating augmentation candidates, ranking and filtering the candidates before producing the final augmented data. Consequently, the computation time of these approaches is generally more expensive as separate training is required to train the augmentation and joint SF-IC models. Recent work from Louvan and Magnini (2020) apply a set of lightweight methods in which most of the augmentation methods do not require model training. The augmentation methods focus on varying the slot values through substitution mechanisms and varying sentence structure through dependency tree manipulation. While the methods are relatively simple it obtains competitive results with deep learning based approaches on the standard English slot filling benchmark datasets namely ATIS (Hemphill et al., 1990), SNIPS (Coucke et al., 2018), and FB (Schuster et al., 2019) datasets.

Existing methods mostly evaluate their approaches on English datasets, and little work has been done on other languages. Our work focuses on investigating the effect of data augmentation on five non-English languages. We apply a subset of lightweight augmentation methods from Louvan and Magnini (2020) that do not require separate model training to produce augmentation data.

Conclusion

We evaluate the effectiveness of data augmentation for slot filling and intent classification tasks in five typologically diverse languages. Our results show that by applying simple augmentation, namely slot values substitutions and dependency tree manipulations, we can obtain substantial improvement in most cases when only small amount of training data is available. We also show that a large pre-trained multilingual BERT benefits from data augmentation.

Figure 1 :1Figure 1: Augmentation operations performed on an utterance, "Quali film animati stanno proiettando al cinema piu vicino" ("Which animated films are showing at the nearest cinema"). The utterance is taken from the Italian SNIPS dataset.

Figure 2 :2Figure 2: Improvement (∆F 1) obtained by SLOT-SUB (SS) on different training data size. Positive numbers mean that the model with SS yields gain.

Figure 3 :3Figure 3: Gain (∆F 1) obtained by SLOT-SUB (SS) on various number of augmented sentence (N). Positive numbers mean that the model with SS yields gain.

trains a sequence to sequence model with training instances that consist of a pair of atomic templates of dialogue acts and its sentence realization. Yoo et al. (2019) proposes a solution by extending Variational Auto Encoder (VAE) (Kingma and Welling, 2014) into a Conditional VAE (CVAE) to generate synthetic utterances. The CVAE controls the utterance generation by conditioning on the intent and slot labels during model training. Recent work from Peng et al. (

Table 1 :1Statistics on the datasets. #train indicates our limited training data setup (10% of full training data). D is produced by tuning the number of augmentations per utterance (N ) on the dev set.#Label#Utterances (D)#Augmented Utterances (D )DatasetLanguage #slot #intent #train#dev#test #SLOT-SUB #CROP #ROTATESNIPS-IT Italian3975747006985,4041,4311,889ATIS-HIHindi73171764408931,286460472ATIS-TRTurkish701799248715144161194FB-ESSpanish1112361 1,983 3,0431,4557691,028FB-THThai810215 1,235 1,692781--ModelDASNIPS-ITATIS-HIATIS-TRFB-ESFB-THSlotIntentSlotIntentSlotIntentSlotIntentSlotIntentM-BERT None78.2594.9969.5786.5764.3678.9884.1397.6856.0689.80SLOT-SUB 81.97 † 94.93 72.44 †87.2966.60 †79.8584.2797.7259.68 † 91.42 †CROP80.12 † 94.6070.0486.9265.1179.4883.85 98.08 †--ROTATE79.24 † 95.3770.6987.60 †65.2080.0683.2898.20 †--COMBINE81.27 † 95.00 72.13 †86.9366.68 † 81.12 † 83.6797.94--

Table 2 :2Performance comparison of the baseline and augmentation methods on the test set. F1 score is used for slot filling and accuracy for intent classification. Scores are the average of 10 different runs. † indicates statistically significant improvement over the baseline (p-value < 0.05 according to Wilcoxon signed rank test).

Table 3 :3List of hyperparameters used for the BERT model and data augmentation methods Appendix B. Statistical Significance

DatasetNb Augp-valueATIS-TR20.00506203212650.01251531869100.006910429808200.5001842571250.07961580146ATIS-HI20.109744638750.005062032126100.005062032126200.04311444678250.04311444678SNIPS-IT20.00506203212650.005062032126100.005062032126200.04311444678250.04311444678FB-ES20.066316031350.02831405495100.09260069782200.3452310718250.07961580146FB-TH20.0366579286750.005062032126100.005062032126200.04311444678250.04311444678

Table 5 :5The p-values of statistical tests on the experiments on Figure3DatasetTraining Size (%)p-valueATIS-HI50.04311444678100.005062032126200.04311444678400.04311444678800.13801073761000.2733216783ATIS-TR50.224915884100.005062032126200.7150006547400.1797124949800.17971249491000.1797124949SNIPS-IT50.04311444678100.005062032126200.04311444678400.04311444678800.043114446781000.04311444678FB-ES50.04311444678100.02831405495200.1797124949400.1755543028800.13801073761000.1797124949FB-TH50.04311444678100.005062032126200.1797124949400.1797124949800.17971249491000.10880943

Table 4 :4The p-values of statistical tests on the experiments on Figure2.Data Augmentation (DA) MethodsDA aims to perform semantically preserving transformations on the training data D to produce auxiliary data D . The union of D and D is then used to train a particular NLU model. For each utterance in D, we produce N augmented utterances by applying a specific augmentation operation. We adopt a subset of existing augmentationItalian, Spanish, and Thai are SVO languages while Hindi and Turkish are SOV languages.For more details of the p-value of the statistical tests please refer to Appendix BFor more details of the p-value of the statistical tests please refer to Appendix B

Acknowledgments

We thank Valentina Bellomaria for providing the Italian SNIPS dataset. We thank Clara Vania for the feedback on the early draft of the paper.

Almawaveslu: A new dataset for SLU in italian ValentinaBellomaria GiuseppeCastellucci AndreaFavalli RanieroRomagnoli Proceedings of the Sixth Italian Conference on Computational Linguistics CEUR Workshop Proceedings RaffaellaBernardi RobertoNavigli GiovanniSemeraro the Sixth Italian Conference on Computational Linguistics

Bari, Italy

2019. November 13-15. 2019 2481 QianChen ZhuZhuo WenWang arXiv:1902.10909 Bert for joint intent classification and slot filling 2019 arXiv preprint Snips voice platform: an embedded spoken language understanding system for privateby-design voice interfaces AliceCoucke AlaaSaade AdrienBall ThéodoreBluche AlexandreCaulier DavidLeroy ClémentDoumouro ThibaultGisselbrecht FrancescoCaltagirone ThibautLavril MaëlPrimet JosephDureau ArXiv, abs/1805.10190 2018 BERT: Pre-training of deep bidirectional transformers for language understanding JacobDevlin Ming-WeiChang KentonLee KristinaToutanova Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies Long and Short Papers the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

Minneapolis, Minnesota

Association for Computational Linguistics 2019. June 1 Data augmentation for low-resource neural machine translation MarziehFadaee AriannaBisazza ChristofMonz Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, ACL 2017 Short Papers ReginaBarzilay Min-YenKan the 55th Annual Meeting of the Association for Computational Linguistics, ACL 2017

Vancouver, Canada

2017. July 30 -August 4 2 Association for Computational Linguistics Paraphrase augmented task-oriented dialog generation SilinGao YichiZhang ZhijianOu ZhouYu Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, ACL 2020, Online DanJurafsky JoyceChai NatalieSchluter JoelRTetreault the 58th Annual Meeting of the Association for Computational Linguistics, ACL 2020, Online Association for Computational Linguistics 2020. July 5-10, 2020 Slot-gated modeling for joint slot filling and intent prediction Chih-WenGoo GuangGao Yun-KaiHsu Chih-LiHuo Tsung-ChiehChen Keng-WeiHsu Yun-NungChen Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2018 2 Short Papers The ATIS spoken language systems pilot corpus CharlesTHemphill JohnJGodfrey GeorgeRDoddington Speech and Natural Language: Proceedings of a Workshop Held at Hidden

Valley, Pennsylvania, USA

Morgan Kaufmann 1990. June 24-27, 1990 Sequence-to-sequence data augmentation for dialogue language understanding YutaiHou YijiaLiu WanxiangChe TingLiu Proceedings of the 27th International Conference on Computational Linguistics the 27th International Conference on Computational Linguistics

Santa Fe, New Mexico, USA

Association for Computational Linguistics 2018. August Autoencoding variational bayes PDiederik MaxKingma Welling 2nd International Conference on Learning Representations, ICLR 2014 Conference Track Proceedings YoshuaBengio YannLecun

Banff, AB, Canada

2014. April 14-16, 2014 VarunKumar AshutoshChoudhary EunahCho arXiv:2003.02245 Data augmentation using pre-trained transformer models 2020 arXiv preprint SamuelLouvan BernardoMagnini Simple is better! lightweight data augmentation for low resource slot filling and intent classification 2020 arXiv preprint Conference on Language, Information and Computation JoakimNivre ŽeljkoAgić LarsAhrenberg LeneAntonsen MariaJesus Aranzabe MasayukiAsahara LumaAteyah MohammedAttia AitziberAtutxa LiesbethAugustinus Universal dependencies 2 2017 1 Data augmentation for spoken language understanding via pretrained models BaolinPeng ChenguangZhu MichaelZeng JianfengGao CoRR, abs/2004.13952 2020 Stanza: A python natural language processing toolkit for many human languages PengQi YuhaoZhang YuhuiZhang JasonBolton ChristopherDManning Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations Association for Computational Linguistics 2020. July Language models are unsupervised multitask learners AlecRadford JeffreyWu RewonChild DavidLuan DarioAmodei IlyaSutskever 2019 Data augmentation via dependency tree morphing for low-resource languages GözdeGül S¸ahin MarkSteedman Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing the 2018 Conference on Empirical Methods in Natural Language Processing

Brussels, Belgium

Association for Computational Linguistics 2018. October-November Cross-lingual transfer learning for multilingual task oriented dialog SebastianSchuster SonalGupta RushinShah MikeLewis Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies Long and Short Papers the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

Minneapolis, Minnesota

Association for Computational Linguistics 2019. June 1 Sequence to sequence learning with neural networks IlyaSutskever OriolVinyals VQuoc Le Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014 ZoubinGhahramani MaxWelling CorinnaCortes NeilDLawrence KilianQWeinberger

Montreal, Quebec, Canada

2014. December 8-13 2014 Spoken language understanding: Systems for extracting semantic information from speech GokhanTur RenatoDeMori 2011 John Wiley & Sons almost) zero-shot cross-lingual spoken language understanding ShyamUpadhyay ManaalFaruqui GokhanTür Hakkani-TürDilek LarryHeck IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) IEEE 2018. 2018 A systematic comparison of methods for low-resource dependency parsing on genuinely low-resource languages ClaraVania YovaKementchedjhieva AndersSøgaard AdamLopez ; Kentaro JingInui VincentJiang XiaojunNg Wan Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP 2019 the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP 2019

Hong Kong, China

2019. November 3-7, 2019 Association for Computational Linguistics Attention is all you need AshishVaswani NoamShazeer NikiParmar JakobUszkoreit LlionJones AidanNGomez LukaszKaiser IlliaPolosukhin Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017 IsabelleGuyon SamyUlrike Von Luxburg HannaMBengio RobWallach SV NFergus RomanVishwanathan Garnett

Long Beach, CA, USA

2017. December 2017 EDA: easy data augmentation techniques for boosting performance on text classification tasks JasonWWei KaiZou Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP 2019 KentaroInui JingJiang VincentNg XiaojunWan the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP 2019

Hong Kong, China

2019. November 3-7, 2019 Association for Computational Linguistics Data augmentation for spoken language understanding via joint variational generation MinKang YouhyunYoo Sang-GooShin Lee The Thirty-Third AAAI Conference on Artificial Intelligence, AAAI 2019, The Thirty-First Innovative Applications of Artificial Intelligence Conference, IAAI 2019, The Ninth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019

Honolulu, Hawaii, USA

AAAI Press 2019. January 27 -February 1, 2019 Data augmentation with atomic templates for spoken language understanding ZijianZhao SuZhu KaiYu Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP 2019 KentaroInui JingJiang VincentNg XiaojunWan the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP 2019

Hong Kong, China; November

2019. 3-7, 2019 Association for Computational Linguistics