Fault Localization (FL) is an important first step in software debugging and is mostly manual in the current practice. Many methods have been proposed over the years to automate the FL process, including information retrieval (IR)-based techniques. These methods localize the fault based on the similarity of the reported bug report and the source code. Newer variations of IR-based FL (IRFL) techniques also look into the history of bug reports and leverage them during the localization. However, all existing IRFL techniques limit themselves to the current project’s data (local data). In this study, we introduce, which is an IRFL framework consisting of methods that use models pre-trained on the global data (extracted from open-source benchmark projects). In, we investigate two heuristics: (a) the effect of global data on a state-of-the-art IR-FL technique, namely, and (b) the application of a Word Embedding technique (Doc2Vec) together with global data. Our large-scale experiment on 51 software projects shows that using global data improves on average 6.6% and 4.8% in terms of MRR (Mean Reciprocal Rank) and MAP (Mean Average Precision), with over 14% in a majority (64% and 54% in terms of MRR and MAP, respectively) of the cases. This amount of improvement is significant compared to the improvement rates that five other state-of-the-art IRFL tools provide over. In addition, training the models globally is a one-time offline task with no overhead on ’s run-time fault localization. Our study, however, shows that a Word Embedding-based global solution did not further improve the results.
Wed 12 OctDisplayed time zone: Eastern Time (US & Canada) 
| 16:00 - 18:00 | Technical Session 18 - Testing IIResearch Papers / Tool Demonstrations / Journal-first Papers at Banquet A Chair(s): Darko Marinov University of Illinois at Urbana-Champaign | ||
| 16:0010m Demonstration | Shibboleth: Hybrid Patch Correctness Assessment in Automated Program Repair Tool Demonstrations | ||
| 16:1020m Research paper | Auto Off-Target: Enabling Thorough and Scalable Testing for Complex Software Systems Research PapersDOI Pre-print | ||
| 16:3010m Demonstration | Maktub: Lightweight Robot System Test Creation and Automation Tool Demonstrations | ||
| 16:4020m Paper | Cerebro: Static Subsuming Mutant Selection Journal-first Papers Aayush Garg University of Luxembourg, Milos Ojdanic University of Luxembourg, Renzo Degiovanni SnT, University of Luxembourg, Thierry Titcheu Chekam SES S.A. & University of Luxembourg (SnT), Mike Papadakis University of Luxembourg, Luxembourg, Yves Le Traon University of Luxembourg, LuxembourgLink to publication DOI | ||
| 17:0020m Research paper | Natural Test Generation for Precise Testing of Question Answering SoftwareVirtual Research Papers Qingchao Shen Tianjin University, Junjie Chen Tianjin University, Jie M. Zhang King's College London, Haoyu Wang College of Intelligence and Computing, Tianjin University, Shuang Liu Tianjin University, Menghan Tian College of Intelligence and Computing, Tianjin UniversityPre-print | ||
| 17:2020m Paper | GloBug: Using global data in Fault LocalizationVirtual Journal-first Papers Nima Miryeganeh University of Calgary, Sepehr Hashtroudi University of Calgary, Hadi Hemmati University of CalgaryLink to publication DOI | ||
| 17:4020m Research paper | Selectively Combining Multiple Coverage Goals in Search-Based Unit Test GenerationVirtual Research Papers Zhichao Zhou School of Information Science and Technology, ShanghaiTech University, Yuming Zhou Nanjing University, Chunrong Fang Nanjing University, Zhenyu Chen Nanjing University, Yutian Tang ShanghaiTech UniversityDOI Pre-print | ||

