出品 | 深度学习这件小事公众号
如需转载,请联系后台授权
  自然语言处理(10月23日更新版)
[1] UniCase -- Rethinking Casing in Language Models
作者 | Rafal Powalski, Tomasz Stanislawek
链接 | https://arxiv.org/abs/2010.11936 
[2] mT5: A massively multilingual pre-trained text-to-text transformer
作者 | Linting Xue, Noah Constant, Adam Roberts, Mihir Kale, Rami Al-Rfou, Aditya Siddhant, Aditya Barua, Colin Raffel
链接 | https://arxiv.org/abs/2010.11934 
[3] Scientific Claim Verification with VERT5ERINI
作者 | Ronak Pradeep, Xueguang Ma, Rodrigo Nogueira, Jimmy Lin
链接 | https://arxiv.org/abs/2010.11930 
[4] Challenges in Information Seeking QA:Unanswerable Questions and Paragraph Retrieval
作者 | Akari Asai, Eunsol Choi
链接 | https://arxiv.org/abs/2010.11915 
[5] Rewriting Meaningful Sentences via Conditional BERT Sampling and an application on fooling text classifiers
作者 | Lei Xu, Ivan Ramirez, Kalyan Veeramachaneni
链接 | https://arxiv.org/abs/2010.11869 
[6] Not all parameters are born equal: Attention is mostly what you need
作者 | Nikolay Bogoychev
链接 | https://arxiv.org/abs/2010.11859 
[7] XOR QA: Cross-lingual Open-Retrieval Question Answering
作者 | Akari Asai, Jungo Kasai, Jonathan H. Clark, Kenton Lee, Eunsol Choi, Hannaneh Hajishirzi
链接 | https://arxiv.org/abs/2010.11856 
项目链接 | https://nlp.cs.washington.edu/xorqa
[8] Detecting and Exorcising Statistical Demons from Language Models with Anti-Models of Negative Data
作者 | Michael L. Wick, Kate Silverstein, Jean-Baptiste Tristan, Adam Pocock, Mark Johnson
链接 | https://arxiv.org/abs/2010.11855 
[9] STAR: A Schema-Guided Dialog Dataset for Transfer Learning
作者 | Johannes E. M. Mosig, Shikib Mehri, Thomas Kober
链接 | https://arxiv.org/abs/2010.11853 
备注 | Equal contribution: Johannes E. M. Mosig, Shikib Mehri
[10] Compositional Generalization via Semantic Tagging
作者 | Hao Zheng, Mirella Lapata
链接 | https://arxiv.org/abs/2010.11818 
[11] ConVEx: Data-Efficient and Few-Shot Slot Labeling
作者 | Matthew Henderson, Ivan Vulić
链接 | https://arxiv.org/abs/2010.11791 
[12] Self-alignment Pre-training for Biomedical Entity Representations
作者 | Fangyu Liu, Ehsan Shareghi, Zaiqiao Meng, Marco Basaldella, Nigel Collier
链接 | https://arxiv.org/abs/2010.11784 
备注 | 8 pages. work in progress
[13] EIGEN: Event Influence GENeration using Pre-trained Language Models
作者 | Aman Madaan, Dheeraj Rajagopal, Yiming Yang, Abhilasha Ravichander, Eduard Hovy, Shrimai Prabhumoye
链接 | https://arxiv.org/abs/2010.11764 
[14] CUNI Systems for the Unsupervised and Very Low Resource Translation Task in WMT20
作者 | Ivana Kvapilíková, Tom Kocmi, Ondřej Bojar
链接 | https://arxiv.org/abs/2010.11747 
备注 | WMT20
[15] Improving BERT Performance for Aspect-Based Sentiment Analysis
作者 | Akbar Karimi, Leonardo Rossi, Andrea Prati
链接 | https://arxiv.org/abs/2010.11731 
[16] An Analysis of Simple Data Augmentation for Named Entity Recognition
作者 | Xiang Dai, Heike Adel
链接 | https://arxiv.org/abs/2010.11683 
备注 | COLING 2020
[17] Reducing Unintended Identity Bias in Russian Hate Speech Detection
作者 | Nadezhda Zueva, Madina Kabirova, Pavel Kalaidin
链接 | https://arxiv.org/abs/2010.11666 
[18] Towards Fully Bilingual Deep Language Modeling
作者 | Li-Hsin Chang, Sampo Pyysalo, Jenna Kanerva, Filip Ginter
链接 | https://arxiv.org/abs/2010.11639 
[19] AI-lead Court Debate Case Investigation
作者 | Changzhen Ji, Conghui Zhu, Tiejun Zhao
链接 | https://arxiv.org/abs/2010.11604 
备注 | 4 pages, 2 figures
[20] A Technical Report: BUT Speech Translation Systems
作者 | Hari Krishna Vydana, Lukas Burget, Jan Cernocky
链接 | https://arxiv.org/abs/2010.11593 
[21] Multi-dimensional Style Transfer for Partially Annotated Data using Language Models as Discriminators
作者 | Navita Goyal, Balaji Vasan Srinivasan, Anandhavelu N, Abhilasha Sancheti
链接 | https://arxiv.org/abs/2010.11578 
[22] Investigating the True Performance of Transformers in Low-Resource Languages: A Case Study in Automatic Corpus Creation
作者 | Jan Christian Blaise Cruz, Jose Kristian Resabal, James Lin, Dan John Velasco, Charibeth Cheng
链接 | https://arxiv.org/abs/2010.11574 
项目链接 | https://github.com/jcblaisecruz02/Filipino-Text-Benchmarks
[23] Bilinear Fusion of Commonsense Knowledge with Attention-Based NLI Models
作者 | Amit Gajbhiye, Thomas Winterbottom, Noura Al Moubayed, Steven Bradley
链接 | https://arxiv.org/abs/2010.11562 
备注 | Published in Lecture Notes in Computer Science, Springer International Publishing
[24] Incorporating Stylistic Lexical Preferences in Generative Language Models
作者 | Hrituraj Singh, Gaurav Verma, Balaji Vasan Srinivasan
链接 | https://arxiv.org/abs/2010.11553 
备注 | To Appear in Findings of EMNLP 2020
[25] Method of noun phrase detection in Ukrainian texts
作者 | S.D. Pogorilyy, A.A. Kramov
链接 | https://arxiv.org/abs/2010.11548 
备注 | 25 pages, in Ukrainian, 5 figures, 2 tables
[26] Cross Copy Network for Dialogue Generation
作者 | Changzhen Ji, Xin Zhou, Yating Zhang, Xiaozhong Liu, Changlong Sun, Conghui Zhu, Tiejun Zhao
链接 | https://arxiv.org/abs/2010.11539 
备注 | 11 pages, 4 figures
[27] slimIPL: Language-Model-Free Iterative Pseudo-Labeling
作者 | Tatiana Likhomanenko, Qiantong Xu, Jacob Kahn, Gabriel Synnaeve, Ronan Collobert
链接 | https://arxiv.org/abs/2010.11524 
[28] An Industry Evaluation of Embedding-based Entity Alignment
作者 | Ziheng Zhang, Jiaoyan Chen, Xi Chen, Hualuo Liu, Yuejia Xiang, Bo Liu, Yefeng Zheng
链接 | https://arxiv.org/abs/2010.11522 
[29] Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data
作者 | Lingkai Kong, Haoming Jiang, Yuchen Zhuang, Jie Lyu, Tuo Zhao, Chao Zhang
链接 | https://arxiv.org/abs/2010.11506 
备注 | EMNLP2020 long paper
[30] On the Effects of Using word2vec Representations in Neural Networks for Dialogue Act Recognition
作者 | Christophe Cerisara (SYNALP), Pavel Kral, Ladislav Lenc
链接 | https://arxiv.org/abs/2010.11490 
[31] Knowledge Distillation for BERT Unsupervised Domain Adaptation
作者 | Minho Ryu, Kichun Lee
链接 | https://arxiv.org/abs/2010.11478 
[32] MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation
作者 | Junkun Chen, Mingbo Ma, Renjie Zheng, Liang Huang
链接 | https://arxiv.org/abs/2010.11445 
备注 | 10 pages
[33] Developing Real-time Streaming Transformer Transducer for Speech Recognition on Large-scale Dataset
作者 | Xie Chen, Yu Wu, Zhenghao Wang, Shujie Liu, Jinyu Li
链接 | https://arxiv.org/abs/2010.11395 
备注 | 5 pages
[34] Kwame: A Bilingual AI Teaching Assistant for Online SuaCode Courses
作者 | George Boateng
链接 | https://arxiv.org/abs/2010.11387 
[35] A Disentangled Adversarial Neural Topic Model for Separating Opinions from Plots in User Reviews
作者 | Gabriele Pergola, Lin Gui, Yulan He
链接 | https://arxiv.org/abs/2010.11384 
备注 | 12 pages, 4 figures
[36] Exploit Multiple Reference Graphs for Semi-supervised Relation Extraction
作者 | Wanli Li, Tieyun Qian
链接 | https://arxiv.org/abs/2010.11383 
[37] Stronger Transformers for Neural Multi-Hop Question Generation
作者 | Devendra Singh Sachan, Lingfei Wu, Mrinmaya Sachan, William Hamilton
链接 | https://arxiv.org/abs/2010.11374 
备注 | Code will be made available
[38] Latte-Mix: Measuring Sentence Semantic Similarity with Latent Categorical Mixtures
作者 | M. Li, H. Bai, L. Tan, K. Xiong, M. Li, J. Lin
链接 | https://arxiv.org/abs/2010.11351 
[39] LSTM-LM with Long-Term History for First-Pass Decoding in Conversational Speech Recognition
作者 | Xie Chen, Sarangarajan Parthasarathy, William Gale, Shuangyu Chang, Michael Zeng
链接 | https://arxiv.org/abs/2010.11349 
备注 | 5 pages
[40] A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks
作者 | Yun Tang, Juan Pino, Changhan Wang, Xutai Ma, Dmitriy Genzel
链接 | https://arxiv.org/abs/2010.11338 
[41] NADI 2020: The First Nuanced Arabic Dialect Identification Shared Task
作者 | Muhammad Abdul-Mageed, Chiyu Zhang, Houda Bouamor, Nizar Habash
链接 | https://arxiv.org/abs/2010.11334 
备注 | Accepted in WANLP 2020
[42] Linking Entities to Unseen Knowledge Bases with Arbitrary Schemas
作者 | Yogarshi Vyas, Miguel Ballesteros
链接 | https://arxiv.org/abs/2010.11333 
[43] Probing and Fine-tuning Reading Comprehension Models for Few-shot Event Extraction
作者 | Rui Feng, Jie Yuan, Chao Zhang
链接 | https://arxiv.org/abs/2010.11325 
[44] Learning to Summarize Long Texts with Memory Compression and Transfer
作者 | Jaehong Park, Jonathan Pilault, Christopher Pal
链接 | https://arxiv.org/abs/2010.11322 
[45] Document-Level Relation Extraction with Adaptive Thresholding and Localized Context Pooling
作者 | Wenxuan Zhou, Kevin Huang, Tengyu Ma, Jing Huang
链接 | https://arxiv.org/abs/2010.11304 
[46] Clustering-based Inference for Zero-Shot Biomedical Entity Linking
作者 | Rico Angell, Nicholas Monath, Sunil Mohan, Nishant Yadav, Andrew McCallum
链接 | https://arxiv.org/abs/2010.11253 
[47] Improving Simultaneous Translation with Pseudo References
作者 | Junkun Chen, Renjie Zheng, Atsuhito Kita, Mingbo Ma, Liang Huang
链接 | https://arxiv.org/abs/2010.11247 
备注 | 6 pages
[48] On the Potential of Lexico-logical Alignments for Semantic Parsing to SQL Queries
作者 | Tianze Shi, Chen Zhao, Jordan Boyd-Graber, Hal Daum茅 III, Lillian Lee
链接 | https://arxiv.org/abs/2010.11246 
备注 | Findings of ACL: EMNLP 2020
[49] Detection of COVID-19 informative tweets using RoBERTa
作者 | Sirigireddy Dhanalaxmi, Rohit Agarwal, Aman Sinha
链接 | https://arxiv.org/abs/2010.11238 
[50] Autoregressive Modeling is Misspecified for Some Sequence Distributions
作者 | Chu-Cheng Lin, Aaron Jaech, Xin Li, Matt Gormley, Jason Eisner
链接 | https://arxiv.org/abs/2010.11939 
[51] AdapterDrop: On the Efficiency of Adapters in Transformers
作者 | Andreas Rücklé, Gregor Geigle, Max Glockner, Tilman Beck, Jonas Pfeiffer, Nils Reimers, Iryna Gurevych
链接 | https://arxiv.org/abs/2010.11918 
[52] Rethinking Evaluation in ASR: Are Our Models Robust Enough?
作者 | Tatiana Likhomanenko, Qiantong Xu, Vineel Pratap, Paden Tomasello, Jacob Kahn, Gilad Avidov, Ronan Collobert, Gabriel Synnaeve
链接 | https://arxiv.org/abs/2010.11745 
[53] Spatial Attention as an Interface for Image Captioning Models
作者 | Philipp Sadler
链接 | https://arxiv.org/abs/2010.11701 
备注 | A thesis submitted in fulfillment of the requirements for the degree Master of Science in Cognitive Systems
[54] Similarity Analysis of Self-Supervised Speech Representations
作者 | Yu-An Chung, Yonatan Belinkov, James Glass
链接 | https://arxiv.org/abs/2010.11481 
[55] Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition
作者 | Qiujia Li, David Qiu, Yu Zhang, Bo Li, Yanzhang He, Philip C. Woodland, Liangliang Cao, Trevor Strohman
链接 | https://arxiv.org/abs/2010.11428 
备注 | Submitted to ICASSP 2021
[56] Distilling Dense Representations for Ranking using Tightly-Coupled Teachers
作者 | Sheng-Chieh Lin, Jheng-Hong Yang, Jimmy Lin
链接 | https://arxiv.org/abs/2010.11386 
[57] NU-GAN: High resolution neural upsampling with GAN
作者 | Rithesh Kumar, Kundan Kumar, Vicki Anand, Yoshua Bengio, Aaron Courville
链接 | https://arxiv.org/abs/2010.11362 
[58] N-ODE Transformer: A Depth-Adaptive Variant of the Transformer Using Neural Ordinary Differential Equations
作者 | Aaron Baier-Reinio, Hans De Sterck
链接 | https://arxiv.org/abs/2010.11358 
[59] Self-Supervised Contrastive Learning for Efficient User Satisfaction Prediction in Conversational Agents
作者 | Mohammad Kachuee, Hao Yuan, Young-Bum Kim, Sungjin Lee
链接 | https://arxiv.org/abs/2010.11230 
继续阅读
阅读原文