Mining Software Repositories

View My GitHub Profile

Twitter: @msrconf

MSR Foundational Contribution Award

2017 Tim Menzies, North Carolina State University, USA
“For his pioneering and meticulous efforts in the creation and maintenance of the PROMISE data repository. The PROMISE repository has had a tremendous and widely-recognized impact on raising the bar for rigorous and repeatable software engineering research worldwide.”

MSR Early Career Achievement Award

2017 Abram Hindle, University of Alberta, Canada
“To recognize the rigor, fearlessness, and breadth of his MSR-related research, and for establishing a new area of research related to green-mining.”

Most Influential Papers

2017 How Long Will It Take to Fix This Bug?
Cathrin Weiss, Rahul Premraj, Thomas Zimmermann, and Andreas Zeller
“For paving the way of actionable software analytics.”
2016 Mining email social networks (MSR 2006)
Christian Bird, Alex Gourley, Prem Devanbu, Michael Gertz and Anand Swaminathan
“For their fundational influence on studies of socio-technical activities in software projects.”
2015 When do changes induce fixes? (MSR 2005)
Jacek Sliwerski, Thomas Zimmermann, and Andreas Zeller
“Prior software quality research focused on flagging files with bugs, but the SZZ algorithm by Sliwerski et al. was the first work to focus on flagging faulty changes. By flagging bugs before they get into the code, follow-up research has taken a preventive role instead of a catchup role.”
Honorable Mention: Developer Identification Methods for Integrated Data from Various Sources
Gregorio Robles and Jesus M. Gonzalez-Barahona
“For their pioneering efforts in mining social information. Such information has been the catalyst for many research efforts throughout software engineering.”
2014 Preprocessing CVS Data for Fine-Grained Analysis (MSR 2004)
Thomas Zimmermann and Peter Weißgerber
“For clearly and engagingly presenting practices that stood at the core of early MSR approaches, thus lowering the entry barrier for the researchers worldwide to join this emerging field.”

ACM SIGSOFT Distinguished Papers

Starting 2015 the MSR conference recognized outstanding papers with ACM SIGSOFT Distinguished Paper Awards. Since 2016 there is no separate Best Paper award anymore.

2017 Classifying code comments in Java open-source software systems
Luca Pascarella and Alberto Bacchelli

Some From Here, Some From There: Cross-Project Code Reuse in GitHub
Mohammad Gharehyazie, Baishakhi Ray and Vladimir Filkov
2016 Adressing problems with external validity of repository mining studies through a smart data platform
Fabian Trautsch, Steffen Herbold, Philip Makedonski and Jens Grabowski

Studying the Impact of Switching to a Rapid Release Cycle on Integration Delay of Addressed Issues - An Empirical Study of the Mozilla Firefox Project
Daniel Alencar da Costa, Shane McIntosh, Uirá Kulesza and Ahmed E. Hassan
2015 Do Bugs Foreshadow Vulnerabilities? A Study of the Chromium Project
Felivel Camilo, Andrew Meneely and Meiyappan Nagappan

Characterization and prediction of issue-related risks in software projects
Morakot Choetikertikul, Hoa Khanh Dam, Truyen Tran and Aditya Ghose

MSR Distinguished/Best Papers

Until 2015 the MSR conference recognized outstanding papers with MSR Best Paper Awards or in the case of multiple winners with MSR Distinguished Paper Awards. In 2015 the MSR conference awarded both MSR Best Paper and ACM SIGSOFT Distingusihed Paper Awards.

2015 Do Bugs Foreshadow Vulnerabilities? A Study of the Chromium Project
Felivel Camilo, Andrew Meneely and Meiyappan Nagappan
2014 Towards Building a Universal Defect Prediction Model
Feng Zhang, Audris Mockus, Iman Keivanloo, and Ying Zou

The Impact of Code Review Coverage and Code Review Participation on Software Quality: A Case Study of the Qt, VTK, and ITK Projects
Shane Mcintosh, Yasutaka Kamei, Bram Adams, and Ahmed E. Hassan
2013 Automatically Mining Software-based, Semantically-similar Words from Comment-Code Mappings
Matthew J. Howard, Lori Pollock, K. Vijay-Shanker, and Samir Gupta
2012 Think Locally, Act Globally: Improving Defect and Effort Prediction Models
Nicolas Bettenburg, Meiyappan Nagappan, and Ahmed E. Hassan

Green mining: A methodology of relating software change to power consumption
Abram Hindle
2011 How Developers Use the Dynamic Features of Programming Languages: the Case of Smalltalk
Oscar Callaú, Romain Robbes, Éric Tanter, and David Röthlisberger
2010 Clones: What is that Smell?
Foyzur Rahman, Christian Bird, and Premkumar Devanbu
2009 Mining search topics from a code search engine usage log
Sushil Krishna Bajracharya and Cristina Videira Lopes
2008 AMAP: automatically mining abbreviation expansions in programs to enhance software maintenance tools
Emily Hill, Zachary P. Fry, Haley Boyd, Giriprasad Sridhara, Yana Novikova, Lori L. Pollock, and K. Vijay-Shanker
2007 Identifying Changed Source Code Lines from Version Repositories
Gerardo Canfora, Luigi Cerulo, and Massimiliano Di Penta
2006 Mining Large Software Compilations over Time: Another Perspective of Software Evolution
Gregorio Robles, Jesus M. Gonzalez-Barahona, Martin Michlmayr, and Juan Jose Amor

Best Data Showcase Award

2017 A Data Set of OCL Expressions on GitHub
Jeroen F.H. Noten, Josh G.M. Mengerink, Alexander Serebrenik
2016 Data Sets: The Circle of Life in Ruby Hosting, 2003-2015
Megan Squire
2015 A Repository with 44 Years of Unix Evolution
Diomidis Spinellis
2014 A dataset for pull-based development research
Georgios Gousios and Andy Zaidman
2013 The GHTorent Dataset and Tool Suite
Georgios Gousios

Mining Challenge Winners

2017 How Does Contributors’ Involvement Influence the Build Status of an Open-Source Software Project?
Marcel Rebouças, Renato Oliveira Dos Santos, Gustavo Pinto and Fernando Castor
2016 Judging a commit by its cover: Correlating commit message entropy with build status on Travis-CI
Eddie Antonio Santos and Abram Hindle
2015 Mining StackOverflow to Filter out Off-topic IRC Discussion
Shaiful Chowdhury and Abram Hindle
2014 Sentiment Analysis of Commit Messages in GitHub: An Empirical Study
Emitza Guzman, David Azócar, and Yang Li

Honorable mention: Do developers discuss design?
João Brunet, Gail C. Murphy, Ricardo Terra, Jorge Figueiredo, and Dalton Serey

Honorable mention: A Study of External Community Contribution to Open-Source Projects on GitHub
Rohan Padhye, Senthil Mani, and Vibha Singhal Sinha
2013 Encouraging User Behaviour with Achievements: An Empirical Study
Scott Grant and Buddy Betts
2012 Do the stars align? Multidimensional analysis of Android’s layered architecture
Victor Guana, Fabio Rocha, Abram Hindle and Eleni Stroulia
2011 Apples Vs. Oranges? An exploration of the challenges of comparing the source code of two software systems
Daniel M. German and Julius Davies
2010 Cloning and Copying between GNOME Projects
Jens Krinke, Nicolas Gold, Yue Jia, and David Binkley
2009 On the use of Internet Relay Chat (IRC) meeting by developers of the GNOME GTK+ project
Emad Shihab, Zhen Ming Jiang, and Ahmed E. Hassan
2008 A newbie’s guide to Eclipse APIs
Reid Holmes and Robert J. Walker
2007 Mining Eclipse Developer Contributions via Author-Topic Models
Erik Linstead, Paul Rigor, Sushil Bajracharya, Cristina Lopes, and Pierre Baldi
2006 A study of the contributors of PostgreSQL
Daniel M. German