MSR 2025 - Tutorials

Agents for Software Development

Graham Neubig (Carnegie Mellon University)

Software is one of the most powerful tools that we humans have at our disposal; it allows a skilled programmer to interact with the world in complex and profound ways. However, at the same time software systems are complex, fragile, and even dangerous. Can we develop AI agents that help us develop real-world software, particularly in the context of real-world software development tasks in large software repositories, in all their complexity? In this tutorial I will discuss the state-of-the-art in software development agents, including challenges with respect to identifying which files to edit, how to edit them, how to test edits and recover, and how to train and evaluate models. In addition, I will address some challenges beyond simple writing code, such as how to process multimodal data, how to combine web browsing with coding, and how to perform data science tasks with software development models. I will provide examples from OpenHands, an open-source toolkit that implements many of the methods that I discuss: https://github.com/All-Hands-AI/OpenHands

Harmonized Coding with AI: LLMs for Qualitative Analysis in Software Engineering Research

Christoph Treude (Singapore Management University), Youmei Fan (Nara Institute of Science and Technology), Tao Xiao (Nara Institute of Science and Technology), and Hideaki Hata (Shinshu University)

Qualitative bottom-up coding is essential for identifying themes and patterns in complex data. This tutorial demonstrates how LLMs such as ChatGPT can support the qualitative coding process for software engineering research. Participants will walk through an entire coding exercise, learning to identify themes through open coding, consolidate themes by refining and merging codes, and conduct inter-rater agreement by standardizing codebooks and testing agreement between human and AI coders. Using hands-on exercises and real-world examples, this session highlights effective human-AI collaboration and strategies to ensure transparency, trustworthiness, and methodological rigor. Designed for researchers at all levels of experience, this tutorial equips participants with practical techniques to analyze software engineering data.

Time Zone

The program is currently displayed in (GMT-04:00) Eastern Time (US & Canada).

Use conference time zone: (GMT-04:00) Eastern Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

You're viewing the program in a time zone which is different from your device's time zone change time zone

Mon 28 Apr
Displayed time zone: Eastern Time (US & Canada) change

09:00 - 10:30	Plenary: Opening + Joint MSR + ICPC KeynoteProgram / Keynotes at 214 Chair(s): Bram Adams Queen's University, Olga Baysal Carleton University, Michael W. Godfrey University of Waterloo, Canada, Ayushi Rastogi University of Groningen, The Netherlands

09:00 30m Day opening		Official Opening Program
09:30 60m Keynote		Mining BOMs for Improving Supply Chain Efficiency & Resilience Keynotes Kate Stewart Linux Foundation File Attached

11:00 - 12:30	Defects, bugs, and issuesData and Tool Showcase Track / Technical Papers / Registered Reports / Program at 214 Chair(s): Minhaz Zibran Idaho State University

11:00 10m Talk		Learning from Mistakes: Understanding Ad-hoc Logs through Analyzing Accidental Commits Technical Papers Yi-Hung Chou University of California, Irvine, Yiyang Min Amazon, April Wang ETH Zürich, James Jones University of California at Irvine Pre-print
11:10 10m Talk		On the calibration of Just-in-time Defect Prediction Technical Papers Xhulja Shahini paluno - University of Duisburg-Essen, Jone Bartel University of Duisburg-Essen, paluno, Klaus Pohl University of Duisburg-Essen, paluno
11:20 10m Talk		An Empirical Study on Leveraging Images in Automated Bug Report Reproduction Technical Papers Dingbang Wang University of Connecticut, Zhaoxu Zhang University of Southern California, Sidong Feng Monash University, William G.J. Halfond University of Southern California, Tingting Yu University of Connecticut
11:30 10m Talk		It’s About Time: An Empirical Study of Date and Time Bugs in Open-Source Python SoftwareTechnical Track Distinguished Paper Award Technical Papers Shrey Tiwari Carnegie Mellon University, Serena Chen University of California, San Diego, Alexander Joukov Stony Brook University, Peter Vandervelde University of California, Santa Barbara, Ao Li Carnegie Mellon University, Rohan Padhye Carnegie Mellon University Pre-print
11:40 10m Talk		Enhancing Just-In-Time Defect Prediction Models with Developer-Centric Features Technical Papers Emanuela Guglielmi University of Molise, Andrea D'Aguanno University of Molise, Rocco Oliveto University of Molise, Simone Scalabrino University of Molise
11:50 10m Talk		Revisiting Defects4J for Fault Localization in Diverse Development Scenarios Technical Papers Md Nakhla Rafi Concordia University, An Ran Chen University of Alberta, Tse-Hsun (Peter) Chen Concordia University, Shaohua Wang Central University of Finance and Economics
12:00 5m Talk		Mining Bug Repositories for Multi-Fault Programs Data and Tool Showcase Track Dylan Callaghan Stellenbosch University, Bernd Fischer Stellenbosch University
12:05 5m Talk		HaPy-Bug - Human Annotated Python Bug Resolution Dataset Data and Tool Showcase Track Piotr Przymus Nicolaus Copernicus University in Toruń, Poland, Mikołaj Fejzer Nicolaus Copernicus University in Toruń, Jakub Narębski Nicolaus Copernicus University in Toruń, Radosław Woźniak Nicolaus Copernicus University in Toruń, Łukasz Halada University of Wrocław, Poland, Aleksander Kazecki Nicolaus Copernicus University in Toruń, Mykhailo Molchanov Igor Sikorsky Kyiv Polytechnic Institute, Ukraine, Krzysztof Stencel University of Warsaw Pre-print File Attached
12:10 5m Talk		SPRINT: An Assistant for Issue Report Management Data and Tool Showcase Track Ahmed Adnan , Antu Saha William & Mary, Oscar Chaparro William & Mary Pre-print
12:15 5m Talk		Identifying and Replicating Code Patterns Driving Performance Regressions in Software Systems Registered Reports Denivan Campos University of Molise, Luana Martins University of Salerno, Emanuela Guglielmi University of Molise, Michele Tucci University of L'Aquila, Daniele Di Pompeo University of L'Aquila, Simone Scalabrino University of Molise, Vittorio Cortellessa University of L'Aquila, Dario Di Nucci University of Salerno, Rocco Oliveto University of Molise

11:00 - 12:30	Security and legal aspectsIndustry Track / Data and Tool Showcase Track / Technical Papers / Program at 215 Chair(s): Mohammad Ghafari TU Clausthal

11:00 10m Talk		Wolves in the Repository: A Software Engineering Analysis of the XZ Utils Supply Chain Attack Technical Papers Piotr Przymus Nicolaus Copernicus University in Toruń, Poland, Thomas Durieux TU Delft Pre-print
11:10 10m Talk		Software Composition Analysis and Supply Chain Security in Apache Projects: an Empirical Study Technical Papers Sabato Nocera University of Salerno, Sira Vegas Universidad Politecnica de Madrid, Giuseppe Scanniello University of Salerno, Natalia Juristo Universidad Politecnica de Madrid Pre-print
11:20 10m Talk		Good practice versus reality: a landscape analysis of Research Software metadata adoption in European Open Science Clusters Technical Papers Anas El Hounsri Universidad Politécnica de Madrid, Daniel Garijo Universidad Politécnica de Madrid
11:30 10m Talk		Towards Security Commit Message Standardization Technical Papers Sofia Reis Instituto Superior Técnico, U. Lisboa & INESC-ID, Rui Abreu Faculty of Engineering of the University of Porto, Portugal, Corina Pasareanu CMU, NASA, KBR
11:40 10m Talk		From Industrial Practices to Academia: Uncovering the Gap in Vulnerability Research and Practice Technical Papers Zhuang Liu , Xing Hu Zhejiang University, Jiayuan Zhou Queen's University, Xin Xia Huawei
11:50 5m Talk		Patch Me If You Can—Securing the Linux Kernel Industry Track Gunnar Kudrjavets Amazon Web Services, USA Pre-print
11:55 5m Talk		OSS License Identification at Scale: A Comprehensive Dataset Using World of Code Data and Tool Showcase Track Mahmoud Jahanshahi University of Tennessee, David Reid University of Tennessee, Adam McDaniel University of Tennessee Knoxville, Audris Mockus University of Tennessee
12:00 5m Talk		SCRUBD: Smart Contracts Reentrancy and Unhandled Exceptions Vulnerability Dataset Data and Tool Showcase Track Chavhan Sujeet Yashavant Indian Institute of Technology, Kanpur, Mitrajsinh Chavda Indian Institute of Technology Kanpur, India, Saurabh Kumar Indian Institute of Technology Hyderabad, India, Amey Karkare IIT Kanpur, Angshuman Karmakar Indian Institute of Technology Kanpur, India Pre-print
12:05 5m Talk		ICVul: A Well-labeled C/C++ Vulnerability Dataset with Comprehensive Metadata and VCCs Data and Tool Showcase Track Chaomeng Lu DistriNet Group-T, KU Leuven, Tianyu Li DistriNet Group-T, KU Leuven, Toon Dehaene KU Leuven, Bert Lagaisse DistriNet Group-T, KU Leuven Pre-print
12:10 5m Talk		A Dataset of Software Bill of Materials for Evaluating SBOM Consumption Tools Data and Tool Showcase Track Rio Kishimoto Osaka University, Tetsuya Kanda Notre Dame Seishin University, Yuki Manabe The University of Fukuchiyama, Katsuro Inoue Nanzan University, Shi Qiu Toshiba, Yoshiki Higo Osaka University Pre-print
12:15 5m Talk		Wild SBOMs: a Large-scale Dataset of Software Bills of Materials from Public Code Data and Tool Showcase Track Luis Soeiro LTCI, Télécom Paris, Institut Polytechnique de Paris, Thomas Robert LTCI, Télécom Paris, Institut Polytechnique de Paris, Stefano Zacchiroli LTCI, Télécom Paris, Institut Polytechnique de Paris, Palaiseau, France Pre-print
12:20 5m Talk		MaLAware: Automating the Comprehension of Malicious Software Behaviours using Large Language Models (LLMs) Data and Tool Showcase Track BIKASH SAHA Indian Institute of Technology Kanpur, Nanda Rani Indian Institute of Technology Kanpur, Sandeep K. Shukla Indian Institute of Technology Kanpur Pre-print

13:00 - 14:00	MSR Poster (Monday)Data and Tool Showcase Track / Technical Papers / Mining Challenge / Program at Canada Hall 3 Poster Area

13:00 60m Talk		SPRINT: An Assistant for Issue Report Management Data and Tool Showcase Track Ahmed Adnan , Antu Saha William & Mary, Oscar Chaparro William & Mary Pre-print
13:00 60m Talk		Combining Large Language Models with Static Analyzers for Code Review Generation Technical Papers Imen Jaoua DIRO, Université de Montréal, Oussama Ben Sghaier DIRO, Université de Montréal, Houari Sahraoui DIRO, Université de Montréal Pre-print
13:00 60m Talk		Can LLMs Replace Manual Annotation of Software Engineering Artifacts?Technical Track Distinguished Paper Award Technical Papers Toufique Ahmed IBM Research, Prem Devanbu University of California at Davis, Christoph Treude Singapore Management University, Michael Pradel University of Stuttgart Pre-print
13:00 60m Talk		Dependency Update Adoption Patterns in the Maven Software Ecosystem Mining Challenge Baltasar Berretta College of Wooster, Augustus Thomas College of Wooster, Heather Guarnera The College of Wooster
13:00 60m Talk		Popularity and Innovation in Maven Central Mining Challenge Nkiru Ede Victoria University of Wellington, Jens Dietrich Victoria University of Wellington, Ulrich Zülicke Victoria University of Wellington Pre-print
13:00 60m Talk		Chasing the Clock: How Fast Are Vulnerabilities Fixed in the Maven Ecosystem? Mining Challenge Md Fazle Rabbi Idaho State University, Arifa Islam Champa Idaho State University, Rajshakhar Paul Wayne State University, Minhaz F. Zibran Idaho State University Pre-print
13:00 60m Talk		SCRUBD: Smart Contracts Reentrancy and Unhandled Exceptions Vulnerability Dataset Data and Tool Showcase Track Chavhan Sujeet Yashavant Indian Institute of Technology, Kanpur, Mitrajsinh Chavda Indian Institute of Technology Kanpur, India, Saurabh Kumar Indian Institute of Technology Hyderabad, India, Amey Karkare IIT Kanpur, Angshuman Karmakar Indian Institute of Technology Kanpur, India Pre-print
13:00 60m Talk		TerraDS: A Dataset for Terraform HCL Programs Data and Tool Showcase Track Christoph Buehler University of St. Gallen, David Spielmann University of St. Gallen, Roland Meier armasuisse, Guido Salvaneschi University of St. Gallen Pre-print
13:00 60m Talk		Mining a Decade of Contributor Dynamics in Ethereum: A Longitudinal StudyFOSS Award Technical Papers Matteo Vaccargiu University of Cagliari, Sabrina Aufiero University College London (UCL), Cheick Ba Queen Mary University of London, Silvia Bartolucci University College London, Richard Clegg Queen Mary University London, Daniel Graziotin University of Hohenheim, Rumyana Neykova Brunel University London, Roberto Tonelli University of Cagliari, Giuseppe Destefanis Brunel University of London Pre-print
13:00 60m Talk		CoMRAT: Commit Message Rationale Analysis Tool Data and Tool Showcase Track Mouna Dhaouadi University of Montreal, Bentley Oakes Polytechnique Montréal, Michalis Famelis Université de Montréal Pre-print Media Attached File Attached
13:00 60m Talk		A Dataset of Software Bill of Materials for Evaluating SBOM Consumption Tools Data and Tool Showcase Track Rio Kishimoto Osaka University, Tetsuya Kanda Notre Dame Seishin University, Yuki Manabe The University of Fukuchiyama, Katsuro Inoue Nanzan University, Shi Qiu Toshiba, Yoshiki Higo Osaka University Pre-print
13:00 60m Talk		A Dataset of Contributor Activities in the NumFocus Open-Source CommunityData/Tool Track Distinguished Dataset Award Data and Tool Showcase Track Youness Hourri University of Mons, Alexandre Decan University of Mons; F.R.S.-FNRS, Tom Mens University of Mons Pre-print
13:00 60m Talk		Does Functional Package Management Enable Reproducible Builds at Scale? Yes.Technical Track Distinguished Paper Award Technical Papers Julien Malka LTCI, Télécom Paris, Institut Polytechnique de Paris, France, Stefano Zacchiroli LTCI, Télécom Paris, Institut Polytechnique de Paris, Palaiseau, France, Théo Zimmermann Télécom Paris, Polytechnic Institute of Paris Pre-print
13:00 60m Talk		HaPy-Bug - Human Annotated Python Bug Resolution Dataset Data and Tool Showcase Track Piotr Przymus Nicolaus Copernicus University in Toruń, Poland, Mikołaj Fejzer Nicolaus Copernicus University in Toruń, Jakub Narębski Nicolaus Copernicus University in Toruń, Radosław Woźniak Nicolaus Copernicus University in Toruń, Łukasz Halada University of Wrocław, Poland, Aleksander Kazecki Nicolaus Copernicus University in Toruń, Mykhailo Molchanov Igor Sikorsky Kyiv Polytechnic Institute, Ukraine, Krzysztof Stencel University of Warsaw Pre-print File Attached
13:00 60m Talk		Do LLMs Provide Links to Code Similar to what they Generate? A Study with Gemini and Bing CoPilot Technical Papers Daniele Bifolco University of Sannio, Pietro Cassieri University of Salerno, Giuseppe Scanniello University of Salerno, Massimiliano Di Penta University of Sannio, Italy, Fiorella Zampetti University of Sannio, Italy Pre-print
13:00 60m Talk		Out of Sight, Still at Risk: The Lifecycle of Transitive Vulnerabilities in Maven Mining Challenge Piotr Przymus Nicolaus Copernicus University in Toruń, Poland, Mikołaj Fejzer Nicolaus Copernicus University in Toruń, Jakub Narębski Nicolaus Copernicus University in Toruń, Krzysztof Rykaczewski Nicolaus Copernicus University in Toruń, Poland, Krzysztof Stencel University of Warsaw Pre-print
13:00 60m Talk		Refactoring for Dockerfile Quality: A Dive into Developer Practices and Automation Potential Technical Papers Emna Ksontini University of Michigan, Meriem Mastouri University of Michigan, Rania Khalsi University of Michigan - Flint, Wael Kessentini DePaul University
13:00 60m Talk		Cascading Effects: Analyzing Project Failure Impact in the Maven Central Ecosystem Mining Challenge Mina Shehata Belmont University, Saidmakhmud Makhkamjonoov Belmont University, Mahad Syed Belmont University, Esteban Parra Rodriguez Belmont University
13:00 60m Talk		MaLAware: Automating the Comprehension of Malicious Software Behaviours using Large Language Models (LLMs) Data and Tool Showcase Track BIKASH SAHA Indian Institute of Technology Kanpur, Nanda Rani Indian Institute of Technology Kanpur, Sandeep K. Shukla Indian Institute of Technology Kanpur Pre-print
13:00 60m Talk		Investigating the Understandability of Review Comments on Code Change Requests Technical Papers Md Shamimur Rahman University of Saskatchewan, Zadia Codabux University of Saskatchewan, Chanchal K. Roy University of Saskatchewan

14:00 - 15:30	AI for SE (1)Technical Papers / Data and Tool Showcase Track / Registered Reports / Program at 214 Chair(s): Diego Elias Costa Concordia University, Canada

14:00 10m Talk		Combining Large Language Models with Static Analyzers for Code Review Generation Technical Papers Imen Jaoua DIRO, Université de Montréal, Oussama Ben Sghaier DIRO, Université de Montréal, Houari Sahraoui DIRO, Université de Montréal Pre-print
14:10 10m Talk		Harnessing Large Language Models for Curated Code Reviews Technical Papers Oussama Ben Sghaier DIRO, Université de Montréal, Martin Weyssow Singapore Management University, Houari Sahraoui DIRO, Université de Montréal Pre-print
14:20 10m Talk		SMATCH-M-LLM: Semantic Similarity in Metamodel Matching With Large Language Models Technical Papers Nafisa Ahmed Polytechnique Montreal, Hin Chi Kwok Hong Kong Polytechnic University, Mohammad Hamdaqa Polytechnique Montreal, Wesley K.G. Assunção Johannes Kepler University Linz
14:30 10m Talk		How Effective are LLMs for Data Science Coding? A Controlled ExperimentTechnical Track Distinguished Paper Award Technical Papers Nathalia Nascimento Pennsylvania State University, Everton Guimaraes Pennsylvania State University, Sai Sanjna Chintakunta Pennsylvania State University, Santhosh AB Pennsylvania State University Pre-print
14:40 10m Talk		Do LLMs Provide Links to Code Similar to what they Generate? A Study with Gemini and Bing CoPilot Technical Papers Daniele Bifolco University of Sannio, Pietro Cassieri University of Salerno, Giuseppe Scanniello University of Salerno, Massimiliano Di Penta University of Sannio, Italy, Fiorella Zampetti University of Sannio, Italy Pre-print
14:50 10m Talk		Too Noisy To Learn: Enhancing Data Quality for Code Review Comment Generation Technical Papers Chunhua Liu The University of Melbourne, Hong Yi Lin The University of Melbourne, Patanamon Thongtanunam University of Melbourne
15:00 5m Talk		Should Code Models Learn Pedagogically? A Preliminary Evaluation of Curriculum Learning for Real-World Software Engineering Tasks Technical Papers Kyi Shin Khant The University of Melbourne, Hong Yi Lin The University of Melbourne, Patanamon Thongtanunam University of Melbourne
15:05 5m Talk		RepoChat: An LLM-Powered Chatbot for GitHub Repository Question-Answering Data and Tool Showcase Track Samuel Abedu Concordia University, Laurine Menneron CESI Graduate School of Engineering, SayedHassan Khatoonabadi Concordia University, Montreal, Emad Shihab Concordia University, Montreal
15:10 5m Talk		How do Copilot Suggestions Impact Developers' Frustration and Productivity? Registered Reports Emanuela Guglielmi University of Molise, Venera Arnaoudova Washington State University, Gabriele Bavota Software Institute @ Università della Svizzera Italiana, Rocco Oliveto University of Molise, Simone Scalabrino University of Molise
15:15 5m Talk		Exploring the Lifecycle and Maintenance Practices of Pre-Trained Models in Open-Source Software Repositories Registered Reports Matin Koohjani Concordia University, Diego Elias Costa Concordia University, Canada Pre-print

14:00 - 15:30	MSR 2025 Mining ChallengeMining Challenge / Program at 215 Chair(s): Joyce El Haddad Université Paris Dauphine - PSL , Damien Jaime Université Paris Nanterre & LIP6, Pascal Poizat Université Paris Nanterre & LIP6

14:00 4m Talk		Analyzing Dependency Clusters and Security Risks in the Maven Central Repository Mining Challenge George Lake Idaho State University, Minhaz F. Zibran Idaho State University
14:04 4m Talk		Chasing the Clock: How Fast Are Vulnerabilities Fixed in the Maven Ecosystem? Mining Challenge Md Fazle Rabbi Idaho State University, Arifa Islam Champa Idaho State University, Rajshakhar Paul Wayne State University, Minhaz F. Zibran Idaho State University Pre-print
14:08 4m Talk		Decoding Dependency Risks: A Quantitative Study of Vulnerabilities in the Maven Ecosystem Mining Challenge Costain Nachuma Idaho State University, Md Mosharaf Hossan Idaho State University, Asif Kamal Turzo Wayne State University, Minhaz F. Zibran Idaho State University Pre-print
14:12 4m Talk		Faster Releases, Fewer Risks: A Study on Maven Artifact Vulnerabilities and Lifecycle ManagementChallenge Track Best Mining Challenge Paper Mining Challenge Md Shafiullah Shafin Rajshahi University of Engineering & Technology (RUET), Md Fazle Rabbi Idaho State University, S. M. Mahedy Hasan Rajshahi University of Engineering & Technology, Minhaz F. Zibran Idaho State University Pre-print
14:16 4m Talk		Insights into Dependency Maintenance Trends in the Maven Ecosystem Mining Challenge Barisha Chowdhury Rajshahi University of Engineering & Technology, Md Fazle Rabbi Idaho State University, S. M. Mahedy Hasan Rajshahi University of Engineering & Technology, Minhaz F. Zibran Idaho State University Pre-print
14:20 4m Talk		Insights into Vulnerability Trends in Maven Artifacts: Recurrence, Popularity, and User Behavior Mining Challenge Courtney Bodily Idaho State University, Eric Hill Idaho State University, Andreas Kramer Idaho State University, Leslie Kerby Idaho State University, Minhaz F. Zibran Idaho State University
14:24 4m Talk		Understanding Software Vulnerabilities in the Maven Ecosystem: Patterns, Timelines, and Risks Mining Challenge Md Fazle Rabbi Idaho State University, Rajshakhar Paul Wayne State University, Arifa Islam Champa Idaho State University, Minhaz F. Zibran Idaho State University Pre-print
14:28 4m Talk		Dependency Update Adoption Patterns in the Maven Software Ecosystem Mining Challenge Baltasar Berretta College of Wooster, Augustus Thomas College of Wooster, Heather Guarnera The College of Wooster
14:32 4m Talk		Analyzing Vulnerability Overestimation in Software Projects Mining Challenge Taha Draoui University of Michigan-Flint, Faten Jebari University of Michigan-Flint, Chawki Ben Slimen University of Michigan-Flint, Munjaap Uppal University of Michigan-Flint, Mohamed Wiem Mkaouer University of Michigan - Flint
14:36 4m Talk		Dependency Dilemmas: A Comparative Study of Independent and Dependent Artifacts in Maven Ecosystem Mining Challenge Mehedi Hasan Shanto Khulna University, Muhammad Asaduzzaman University of Windsor, Manishankar Mondal Khulna University, Shaiful Chowdhury University of Manitoba Pre-print
14:40 4m Talk		Cascading Effects: Analyzing Project Failure Impact in the Maven Central Ecosystem Mining Challenge Mina Shehata Belmont University, Saidmakhmud Makhkamjonoov Belmont University, Mahad Syed Belmont University, Esteban Parra Rodriguez Belmont University
14:45 4m Talk		Do Developers Depend on Deprecated Library Versions? A Mining Study of Log4j Mining Challenge Haruhiko Yoshioka Nara Institute of Science and Technology, Sila Lertbanjongngam Nara Institute of Science and Technology, Masayuki Inaba Nara Institute of Science and Technology, Youmei Fan Nara Institute of Science and Technology, Takashi Nakano Nara Institute of Science and Technology, Kazumasa Shimari Nara Institute of Science and Technology, Raula Gaikovina Kula The University of Osaka, Kenichi Matsumoto Nara Institute of Science and Technology Pre-print
14:49 4m Talk		Mining for Lags in Updating Critical Security Threats: A Case Study of Log4j Library Mining Challenge Hidetake Tanaka Nara Institute of Science and Technology, Kazuma Yamasaki Nara Institute of Science and Technology, Momoka Hirose Nara Institute of Science and Technology, Takashi Nakano Nara Institute of Science and Technology, Youmei Fan Nara Institute of Science and Technology, Kazumasa Shimari Nara Institute of Science and Technology, Raula Gaikovina Kula The University of Osaka, Kenichi Matsumoto Nara Institute of Science and Technology Pre-print
14:53 4m Talk		On the Evolution of Unused Dependencies in Java Project Releases: An Empirical Study Mining Challenge Nabhan Suwanachote Nara Institute of Science and Technology, Yagut Shakizada Nara Institute of Science and Technology, Yutaro Kashiwa Nara Institute of Science and Technology, Bin Lin Hangzhou Dianzi University, Hajimu Iida Nara Institute of Science and Technology
14:57 4m Talk		Out of Sight, Still at Risk: The Lifecycle of Transitive Vulnerabilities in Maven Mining Challenge Piotr Przymus Nicolaus Copernicus University in Toruń, Poland, Mikołaj Fejzer Nicolaus Copernicus University in Toruń, Jakub Narębski Nicolaus Copernicus University in Toruń, Krzysztof Rykaczewski Nicolaus Copernicus University in Toruń, Poland, Krzysztof Stencel University of Warsaw Pre-print
15:01 4m Talk		Popularity and Innovation in Maven Central Mining Challenge Nkiru Ede Victoria University of Wellington, Jens Dietrich Victoria University of Wellington, Ulrich Zülicke Victoria University of Wellington Pre-print
15:05 4m Talk		Software Bills of Materials in Maven Central Mining Challenge Yogya Gamage Universtité de Montréal, Nadia Gonzalez Fernandez Université de Montréal, Martin Monperrus KTH Royal Institute of Technology, Benoit Baudry Université de Montréal
15:09 4m Talk		The Ripple Effect of Vulnerabilities in Maven Central: Prevalence, Propagation, and Mitigation Challenges Mining Challenge Ehtisham Ul Haq York University, Song Wang York University, Robert S Allison York University
15:13 4m Talk		Tracing Vulnerabilities in Maven: A Study of CVE lifecycles and Dependency Networks Mining Challenge Corey Yang-Smith University of Calgary, Ahmad Abdellatif University of Calgary Pre-print
15:17 4m Talk		Understanding Abandonment and Slowdown Dynamics in the Maven EcosystemChallenge Track Best Student Presentation Award Mining Challenge Kazi Amit Hasan Queen's University, Canada, Jerin Yasmin Queen's University, Canada, Huizi Hao Queen's University, Canada, Yuan Tian Queen's University, Kingston, Ontario, Safwat Hassan University of Toronto, Steven Ding Pre-print
15:21 4m Talk		Characterizing Packages for Vulnerability Prediction Mining Challenge Saviour Owolabi University of Calgary, Francesco Rosati University of Calgary, Ahmad Abdellatif University of Calgary, Lorenzo De Carli University of Calgary, Canada
15:25 4m Talk		Understanding the Popularity of Packages in Maven Ecosystem Mining Challenge Sadman Jashim Sakib University of Windsor, Muhammad Asaduzzaman University of Windsor, Curtis Bright University of Windsor, Cole Morgan University of Windsor Pre-print

16:00 - 17:30	LLMs for CodeTechnical Papers / Data and Tool Showcase Track / Tutorials / Program at 214 Chair(s): Ali Ouni ETS Montreal, University of Quebec

16:00 10m Talk		How Much Do Code Language Models Remember? An Investigation on Data Extraction Attacks before and after Fine-tuningTechnical Track Distinguished Paper Award Technical Papers Fabio Salerno Delft University of Technology, Ali Al-Kaswan Delft University of Technology, Netherlands, Maliheh Izadi Delft University of Technology
16:10 10m Talk		Can LLMs Generate Higher Quality Code Than Humans? An Empirical Study Technical Papers Mohammad Talal Jamil Lahore University of Management Sciences, Shamsa Abid National University of Computer and Emerging Sciences, Shafay Shamail LUMS, DHA, Lahore Pre-print
16:20 10m Talk		Prompt Engineering or Fine-Tuning: An Empirical Assessment of LLMs for Code Technical Papers Jiho Shin York University, Clark Tang , Tahmineh Mohati University of Calgary, Maleknaz Nayebi York University, Song Wang York University, Hadi Hemmati York University
16:30 5m Talk		Drawing Pandas: A Benchmark for LLMs in Generating Plotting Code Data and Tool Showcase Track Timur Galimzyanov JetBrains Research, Sergey Titov JetBrains Research, Yaroslav Golubev JetBrains Research, Egor Bogomolov JetBrains Research Pre-print
16:35 5m Talk		SnipGen: A Mining Repository Framework for Evaluating LLMs for Code Data and Tool Showcase Track Daniel Rodriguez-Cardenas William & Mary, Alejandro Velasco William & Mary, Denys Poshyvanyk William & Mary Pre-print
16:50 40m Tutorial		Harmonized Coding with AI: LLMs for Qualitative Analysis in Software Engineering Research Tutorials Christoph Treude Singapore Management University, Youmei Fan Nara Institute of Science and Technology, Tao Xiao Kyushu University, Hideaki Hata Shinshu University File Attached

16:00 - 17:30	Software evolution and analysisData and Tool Showcase Track / Technical Papers / Industry Track / Program at 215 Chair(s): Mauricio Verano Merino Vrije Universiteit Amsterdam

16:00 10m Talk		50 Years of Programming Language Evolution through the Software Heritage looking glass Technical Papers Adèle Desmazières Sorbonne Unversité, Roberto Di Cosmo Inria, France / University of Paris Diderot, France, Valentin Lorentz Inria Foundation Pre-print
16:10 10m Talk		It Works (only) on My Machine: A Study on Reproducibility Smells in Ansible Scripts Technical Papers Ghazal Sobhani Dalhousie University, Israat Haque Dalhousie University, Tushar Sharma Dalhousie University Pre-print
16:20 10m Talk		Are the Majority of Public Computational Notebooks Pathologically Non-Executable? Technical Papers Waris Gill Virginia Tech, Muhammad Ali Gulzar Virginia Tech, Tien Nguyen Virginia Tech Pre-print
16:30 10m Talk		Understanding Test Deletion in Java Applications Technical Papers Suraj Bhatta North Dakota State University, Frank Kendemah North Dakota State University, Ajay Jha North Dakota State University Pre-print
16:40 10m Talk		A Public Benchmark of REST APIs Technical Papers Alix Decrop University of Namur, Sara Eraso University of Valle, Xavier Devroey University of Namur, Gilles Perrouin Fonds de la Recherche Scientifique - FNRS & University of Namur Pre-print
16:50 5m Talk		What Do Contribution Guidelines Say About Software Testing? Technical Papers Bruna Pereira Falcucci UFMG, Felipe Gomide UFMG, Andre Hora UFMG Pre-print Media Attached
16:55 5m Talk		Measuring InnerSource Value Industry Track Chamindra de Silva Citibank, Daniel Izquierdo-Cortazar Bitergia
17:00 5m Talk		CoUpJava: A Dataset of Code Upgrade Histories in Open-Source Java Repositories Data and Tool Showcase Track Kaihang Jiang University of Waterloo, Bihui Jin University of Waterloo, Pengyu Nie University of Waterloo
17:05 5m Talk		EvoChain: A Framework for Tracking and Visualizing Smart Contract Evolution Data and Tool Showcase Track Ilham Qasse Reykjavik University, Mohammad Hamdaqa Polytechnique Montreal, Björn Þór Jónsson Reykjavik University
17:10 5m Talk		CoDocBench: A Dataset for Code-Documentation Alignment in Software Maintenance Data and Tool Showcase Track Kunal Suresh Pai UC Davis, Prem Devanbu University of California at Davis, Toufique Ahmed IBM Research Pre-print
17:15 5m Talk		RefExpo: Unveiling Software Project Structures through Advanced Dependency Graph Extraction Data and Tool Showcase Track Vahid Haratian Bilkent Univeristy, Pouria Derakhshanfar JetBrains Research, Vladimir Kovalenko JetBrains Research, Eray Tüzün Bilkent University
17:20 5m Talk		HyperAST: Incrementally Mining Large Source Code Repositories Data and Tool Showcase Track Quentin Le Dilavrec TU Delft, Netherlands, Andy Zaidman TU Delft Pre-print

Tue 29 Apr
Displayed time zone: Eastern Time (US & Canada) change

09:00 - 10:30	Plenary: MIP + FCAMIP Award / FOSS Award / Vision and Reflection / Program at 214 Chair(s): Gabriele Bavota Software Institute @ Università della Svizzera Italiana, Jin L.C. Guo McGill University, Audris Mockus The University of Tennessee, Knoxville / Vilnius University, Martin Pinzger Universität Klagenfurt, Romain Robbes CNRS, LaBRI, University of Bordeaux, Patanamon Thongtanunam The University of Melbourne

09:00 30m Awards		MSR 2025 Most Influential Paper Award: Toward deep learning software repositoriesMost Influential Paper Award MIP Award Martin White Syneos Health, Christopher Vendome Miami University, Mario Linares-Vásquez Universidad de los Andes, Denys Poshyvanyk William & Mary
09:30 30m Awards		Myriad People Open Source Software for New Media ArtsFOSS Award (Runner-up) FOSS Award Benoit Baudry Université de Montréal, Erik Natanael Gustafsson Independent artist, Roni Kaufman Independent artist, Maria Kling Independent artist Pre-print
10:00 30m Talk		The Standard of Rigor for MSR Research: A 20-Year Evolution Vision and Reflection Bogdan Vasilescu Raj Reddy Associate Professor of Software and Societal Systems, Carnegie Mellon University, USA Pre-print

11:00 - 12:30	Software ecosystems and humansData and Tool Showcase Track / Technical Papers / Program at 214 Chair(s): Ahmad Abdellatif University of Calgary

11:00 10m Talk		The Ecosystem of Open-Source Music Production Software – A Mining Study on the Development Practices of VST Plugins on GitHub Technical Papers Andrei Bogdan University of Amsterdam, Mauricio Verano Merino Vrije Universiteit Amsterdam, Ivano Malavolta Vrije Universiteit Amsterdam Pre-print Media Attached
11:10 10m Talk		Can LLMs Replace Manual Annotation of Software Engineering Artifacts?Technical Track Distinguished Paper Award Technical Papers Toufique Ahmed IBM Research, Prem Devanbu University of California at Davis, Christoph Treude Singapore Management University, Michael Pradel University of Stuttgart Pre-print
11:20 10m Talk		Investigating the Understandability of Review Comments on Code Change Requests Technical Papers Md Shamimur Rahman University of Saskatchewan, Zadia Codabux University of Saskatchewan, Chanchal K. Roy University of Saskatchewan
11:30 10m Talk		Mining a Decade of Contributor Dynamics in Ethereum: A Longitudinal StudyFOSS Award Technical Papers Matteo Vaccargiu University of Cagliari, Sabrina Aufiero University College London (UCL), Cheick Ba Queen Mary University of London, Silvia Bartolucci University College London, Richard Clegg Queen Mary University London, Daniel Graziotin University of Hohenheim, Rumyana Neykova Brunel University London, Roberto Tonelli University of Cagliari, Giuseppe Destefanis Brunel University of London Pre-print
11:40 10m Talk		Is it Really Fun? Detecting Low Engagement Events in Video Games Technical Papers Emanuela Guglielmi University of Molise, Gabriele Bavota Software Institute @ Università della Svizzera Italiana, Nicole Novielli University of Bari, Rocco Oliveto University of Molise, Simone Scalabrino University of Molise
11:50 5m Talk		A Dataset of Contributor Activities in the NumFocus Open-Source CommunityData/Tool Track Distinguished Dataset Award Data and Tool Showcase Track Youness Hourri University of Mons, Alexandre Decan University of Mons; F.R.S.-FNRS, Tom Mens University of Mons Pre-print
11:55 5m Talk		Jupyter Notebook Activity Dataset Data and Tool Showcase Track Tomoki Nakamaru The University of Tokyo, Tomomasa Matsunaga The University of Tokyo, Tetsuro Yamazaki University of Tokyo
12:00 5m Talk		CoPhi - Mining C/C++ Packages for Conan Ecosystem Analysis Data and Tool Showcase Track Vivek Sarkar University of Washington, Anemone Kampkötter TU Dortmund, Ben Hermann TU Dortmund Pre-print
12:05 5m Talk		MARIN: A Research-Centric Interface for Querying Software Artifacts on Maven Repositories Data and Tool Showcase Track Johannes Düsing TU Dortmund, Jared Chiaramonte Arizona State University, Ben Hermann TU Dortmund Pre-print File Attached
12:10 5m Talk		GitProjectHealth: an Extensible Framework for Git Social Platform Mining Data and Tool Showcase Track Nicolas Hlad Berger-Levrault, Benoit Verhaeghe Berger-Levrault, Kilian Bauvent Berger-levrault
12:15 5m Talk		Myriad People. Open Source Software for New Media ArtsFOSS Award Data and Tool Showcase Track Benoit Baudry Université de Montréal, Erik Natanael Gustafsson Independent artist, Roni Kaufman Independent artist, Maria Kling Independent artist Pre-print
12:20 5m Talk		OpenMent: A Dataset of Mentor-Mentee Interactions in Google Summer of Code Data and Tool Showcase Track Erfan Raoofian University of British Columbia, Fatemeh Hendijani Fard Department of Computer Science, Mathematics, Physics and Statistics, University of British Columbia, Okanagan Campus, Ifeoma Adaji University of British Columbia, Gema Rodríguez-Pérez Department of Computer Science, Mathematics, Physics and Statistics, University of British Columbia, Okanagan Campus
12:25 5m Talk		Under the Blueprints: Parsing Unreal Engine’s Visual Scripting at Scale Data and Tool Showcase Track Kalvin Eng University of Alberta, Abram Hindle University of Alberta Pre-print

11:00 - 12:30	Build systems and DevOpsData and Tool Showcase Track / Technical Papers / Tutorials / Program at 215 Chair(s): Massimiliano Di Penta University of Sannio, Italy

11:00 10m Talk		Build Scripts Need Maintenance Too: A Study on Refactoring and Technical Debt in Build Systems Technical Papers Anwar Ghammam Oakland University, Dhia Elhaq Rzig University of Michigan - Dearborn, Mohamed Almukhtar Oakland University, Rania Khalsi University of Michigan - Flint, Foyzul Hassan University of Michigan at Dearborn, Marouane Kessentini Grand Valley State University
11:10 10m Talk		LLMSecConfig: An LLM-Based Approach for Fixing Software Container Misconfigurations Technical Papers Ziyang Ye The University of Adelaide, Triet Le The University of Adelaide, Muhammad Ali Babar School of Computer Science, The University of Adelaide Pre-print
11:20 10m Talk		How Do Infrastructure-as-Code Practitioners Update Their Dependencies? An Empirical Study on Terraform Module Updates Technical Papers Mahi Begoug , Ali Ouni ETS Montreal, University of Quebec, Moataz Chouchen Department of Electrical and Computer Engineering, Concordia University, Montreal, Canada
11:30 5m Talk		TerraDS: A Dataset for Terraform HCL Programs Data and Tool Showcase Track Christoph Buehler University of St. Gallen, David Spielmann University of St. Gallen, Roland Meier armasuisse, Guido Salvaneschi University of St. Gallen Pre-print
11:35 5m Talk		CARDS: A collection of package, revision, and miscelleneous dependency graphs Data and Tool Showcase Track Euxane TRAN-GIRARD LIGM, CNRS, Université Gustave Eiffel, Laurent BULTEAU LIGM, CNRS, Université Gustave Eiffel, Pierre-Yves DAVID Octobus S.c.o.p. Pre-print
11:40 5m Talk		GHALogs: Large-scale dataset of GitHub Actions runs Data and Tool Showcase Track Florent Moriconi EURECOM, AMADEUS, Thomas Durieux TU Delft, Jean-Rémy Falleri Univ. Bordeaux, CNRS, Bordeaux INP, LaBRI, UMR 5800, Institut Universitaire de France, Raphaël Troncy EURECOM, Aurélien Francillon EURECOM
11:45 5m Talk		OSPtrack: A Labeled Dataset Targeting Simulated Execution of Open-Source Software Data and Tool Showcase Track Zhuoran Tan University of Glasgow, Christos Anagnostopoulos University of Glasgow, Jeremy Singer University of Glasgow
11:50 40m Tutorial		Agents for Software Development Tutorials Graham Neubig Carnegie Mellon University

13:00 - 14:00	MSR Poster (Tuesday)Mining Challenge / Data and Tool Showcase Track / Technical Papers / Program at Canada Hall 3 Poster Area

13:00 60m Talk		Chasing the Clock: How Fast Are Vulnerabilities Fixed in the Maven Ecosystem? Mining Challenge Md Fazle Rabbi Idaho State University, Arifa Islam Champa Idaho State University, Rajshakhar Paul Wayne State University, Minhaz F. Zibran Idaho State University Pre-print
13:00 60m Talk		MaLAware: Automating the Comprehension of Malicious Software Behaviours using Large Language Models (LLMs) Data and Tool Showcase Track BIKASH SAHA Indian Institute of Technology Kanpur, Nanda Rani Indian Institute of Technology Kanpur, Sandeep K. Shukla Indian Institute of Technology Kanpur Pre-print
13:00 60m Talk		A Dataset of Contributor Activities in the NumFocus Open-Source CommunityData/Tool Track Distinguished Dataset Award Data and Tool Showcase Track Youness Hourri University of Mons, Alexandre Decan University of Mons; F.R.S.-FNRS, Tom Mens University of Mons Pre-print
13:00 60m Talk		Popularity and Innovation in Maven Central Mining Challenge Nkiru Ede Victoria University of Wellington, Jens Dietrich Victoria University of Wellington, Ulrich Zülicke Victoria University of Wellington Pre-print
13:00 60m Talk		TerraDS: A Dataset for Terraform HCL Programs Data and Tool Showcase Track Christoph Buehler University of St. Gallen, David Spielmann University of St. Gallen, Roland Meier armasuisse, Guido Salvaneschi University of St. Gallen Pre-print
13:00 60m Talk		SPRINT: An Assistant for Issue Report Management Data and Tool Showcase Track Ahmed Adnan , Antu Saha William & Mary, Oscar Chaparro William & Mary Pre-print
13:00 60m Talk		Does Functional Package Management Enable Reproducible Builds at Scale? Yes.Technical Track Distinguished Paper Award Technical Papers Julien Malka LTCI, Télécom Paris, Institut Polytechnique de Paris, France, Stefano Zacchiroli LTCI, Télécom Paris, Institut Polytechnique de Paris, Palaiseau, France, Théo Zimmermann Télécom Paris, Polytechnic Institute of Paris Pre-print
13:00 60m Talk		Dependency Update Adoption Patterns in the Maven Software Ecosystem Mining Challenge Baltasar Berretta College of Wooster, Augustus Thomas College of Wooster, Heather Guarnera The College of Wooster
13:00 60m Talk		A Dataset of Software Bill of Materials for Evaluating SBOM Consumption Tools Data and Tool Showcase Track Rio Kishimoto Osaka University, Tetsuya Kanda Notre Dame Seishin University, Yuki Manabe The University of Fukuchiyama, Katsuro Inoue Nanzan University, Shi Qiu Toshiba, Yoshiki Higo Osaka University Pre-print
13:00 60m Talk		Investigating the Understandability of Review Comments on Code Change Requests Technical Papers Md Shamimur Rahman University of Saskatchewan, Zadia Codabux University of Saskatchewan, Chanchal K. Roy University of Saskatchewan
13:00 60m Talk		Refactoring for Dockerfile Quality: A Dive into Developer Practices and Automation Potential Technical Papers Emna Ksontini University of Michigan, Meriem Mastouri University of Michigan, Rania Khalsi University of Michigan - Flint, Wael Kessentini DePaul University
13:00 60m Talk		Combining Large Language Models with Static Analyzers for Code Review Generation Technical Papers Imen Jaoua DIRO, Université de Montréal, Oussama Ben Sghaier DIRO, Université de Montréal, Houari Sahraoui DIRO, Université de Montréal Pre-print
13:00 60m Talk		Cascading Effects: Analyzing Project Failure Impact in the Maven Central Ecosystem Mining Challenge Mina Shehata Belmont University, Saidmakhmud Makhkamjonoov Belmont University, Mahad Syed Belmont University, Esteban Parra Rodriguez Belmont University
13:00 60m Talk		CoMRAT: Commit Message Rationale Analysis Tool Data and Tool Showcase Track Mouna Dhaouadi University of Montreal, Bentley Oakes Polytechnique Montréal, Michalis Famelis Université de Montréal Pre-print Media Attached File Attached
13:00 60m Talk		Can LLMs Replace Manual Annotation of Software Engineering Artifacts?Technical Track Distinguished Paper Award Technical Papers Toufique Ahmed IBM Research, Prem Devanbu University of California at Davis, Christoph Treude Singapore Management University, Michael Pradel University of Stuttgart Pre-print
13:00 60m Talk		Do LLMs Provide Links to Code Similar to what they Generate? A Study with Gemini and Bing CoPilot Technical Papers Daniele Bifolco University of Sannio, Pietro Cassieri University of Salerno, Giuseppe Scanniello University of Salerno, Massimiliano Di Penta University of Sannio, Italy, Fiorella Zampetti University of Sannio, Italy Pre-print
13:00 60m Talk		Mining a Decade of Contributor Dynamics in Ethereum: A Longitudinal StudyFOSS Award Technical Papers Matteo Vaccargiu University of Cagliari, Sabrina Aufiero University College London (UCL), Cheick Ba Queen Mary University of London, Silvia Bartolucci University College London, Richard Clegg Queen Mary University London, Daniel Graziotin University of Hohenheim, Rumyana Neykova Brunel University London, Roberto Tonelli University of Cagliari, Giuseppe Destefanis Brunel University of London Pre-print
13:00 60m Talk		SCRUBD: Smart Contracts Reentrancy and Unhandled Exceptions Vulnerability Dataset Data and Tool Showcase Track Chavhan Sujeet Yashavant Indian Institute of Technology, Kanpur, Mitrajsinh Chavda Indian Institute of Technology Kanpur, India, Saurabh Kumar Indian Institute of Technology Hyderabad, India, Amey Karkare IIT Kanpur, Angshuman Karmakar Indian Institute of Technology Kanpur, India Pre-print
13:00 60m Talk		Out of Sight, Still at Risk: The Lifecycle of Transitive Vulnerabilities in Maven Mining Challenge Piotr Przymus Nicolaus Copernicus University in Toruń, Poland, Mikołaj Fejzer Nicolaus Copernicus University in Toruń, Jakub Narębski Nicolaus Copernicus University in Toruń, Krzysztof Rykaczewski Nicolaus Copernicus University in Toruń, Poland, Krzysztof Stencel University of Warsaw Pre-print
13:00 60m Talk		HaPy-Bug - Human Annotated Python Bug Resolution Dataset Data and Tool Showcase Track Piotr Przymus Nicolaus Copernicus University in Toruń, Poland, Mikołaj Fejzer Nicolaus Copernicus University in Toruń, Jakub Narębski Nicolaus Copernicus University in Toruń, Radosław Woźniak Nicolaus Copernicus University in Toruń, Łukasz Halada University of Wrocław, Poland, Aleksander Kazecki Nicolaus Copernicus University in Toruń, Mykhailo Molchanov Igor Sikorsky Kyiv Polytechnic Institute, Ukraine, Krzysztof Stencel University of Warsaw Pre-print File Attached

14:00 - 15:30	AI for SE (2)Technical Papers / Data and Tool Showcase Track / Registered Reports / Industry Track / Program at 214 Chair(s): Giuseppe Destefanis Brunel University of London

14:00 10m Talk		Automatic High-Level Test Case Generation using Large Language Models Technical Papers Navid Bin Hasan Bangladesh University of Engineering and Technology, Md. Ashraful Islam Bangladesh University of Engineering and Technology, Junaed Younus Khan Bangladesh University of Engineering and Technology, Sanjida Senjik Bangladesh University of Engineering and Technology, Anindya Iqbal Bangladesh University of Engineering and Technology Dhaka, Bangladesh
14:10 10m Talk		Prompting in the Wild: An Empirical Study of Prompt Evolution in Software Repositories Technical Papers Mahan Tafreshipour University of California at Irvine, Aaron Imani University of California, Irvine, Eric Huang University of California, Irvine, Eduardo Santana de Almeida Federal University of Bahia, Thomas Zimmermann University of California, Irvine, Iftekhar Ahmed University of California at Irvine Pre-print
14:20 10m Talk		Intelligent Semantic Matching (ISM) for Video Tutorial Search using Transformer Models Technical Papers Ahmad Tayeb , Sonia Haiduc Florida State University
14:30 10m Talk		Language Models in Software Development Tasks: An Experimental Analysis of Energy and Accuracy Technical Papers Negar Alizadeh Universiteit Utrecht, Boris Belchev University of Twente, Nishant Saurabh Utrecht University, Patricia Kelbert Fraunhofer IESE, Fernando Castor University of Twente Pre-print
14:40 10m Talk		TriGraph: A Probabilistic Subgraph-Based Model for Visual Code Completion in Pure Data Technical Papers Anisha Islam Department of Computing Science, University of Alberta, Abram Hindle University of Alberta Pre-print
14:50 5m Talk		Inferring Questions from Programming Screenshots Technical Papers Faiz Ahmed York University, Xuchen Tan York University, Folajinmi Adewole York University, Suprakash Datta York University, Maleknaz Nayebi York University
14:55 5m Talk		Human-In-The-Loop Software Development Agents: Challenges and Future Directions Industry Track Jirat Pasuksmit Atlassian, Wannita Takerngsaksiri Monash University, Patanamon Thongtanunam University of Melbourne, Kla Tantithamthavorn Monash University, Ruixiong Zhang Atlassian, Shiyan Wang Atlassian, Fan Jiang Atlassian, Jing Li Atlassian, Evan Cook Atlassian, Kun Chen Atlassian, Ming Wu Atlassian Pre-print
15:00 5m Talk		FormalSpecCpp: A Dataset of C++ Formal Specifications Created Using LLMs Data and Tool Showcase Track Madhurima Chakraborty University of California, Riverside, Peter Pirkelbauer Lawrence Livermore National Laboratory, Qing Yi Lawrence Livermore National Laboratory
15:05 10m Talk		Towards Detecting Prompt Knowledge Gaps for Improved LLM-guided Issue Resolution Technical Papers Ramtin Ehsani Drexel University, Sakshi Pathak Drexel University, Preetha Chatterjee Drexel University, USA Pre-print
15:15 5m Talk		GENCNIPPET: Automated Generation of Code Snippets for Supporting Programming Questions Registered Reports Saikat Mondal University of Saskatchewan, Chanchal K. Roy University of Saskatchewan Pre-print

14:00 - 15:30	Software qualityTechnical Papers / Data and Tool Showcase Track / Registered Reports / Program at 215 Chair(s): Mohammad Hamdaqa Polytechnique Montreal

14:00 10m Talk		Does Functional Package Management Enable Reproducible Builds at Scale? Yes.Technical Track Distinguished Paper Award Technical Papers Julien Malka LTCI, Télécom Paris, Institut Polytechnique de Paris, France, Stefano Zacchiroli LTCI, Télécom Paris, Institut Polytechnique de Paris, Palaiseau, France, Théo Zimmermann Télécom Paris, Polytechnic Institute of Paris Pre-print
14:10 10m Talk		Refactoring for Dockerfile Quality: A Dive into Developer Practices and Automation Potential Technical Papers Emna Ksontini University of Michigan, Meriem Mastouri University of Michigan, Rania Khalsi University of Michigan - Flint, Wael Kessentini DePaul University
14:20 10m Talk		Smells-sus: Sustainability Smells in IaC Technical Papers Seif Kosbar Polytechnique Montréal, Mohammad Hamdaqa Polytechnique Montreal
14:30 10m Talk		Evidence is All We Need: Do Self-Admitted Technical Debts Impact Method-Level Maintenance? Technical Papers Shaiful Chowdhury University of Manitoba, Hisham Kidwai University of Manitoba, Muhammad Asaduzzaman University of Windsor
14:40 5m Talk		DPy: Code Smells Detection Tool for Python Data and Tool Showcase Track Aryan Boloori Dalhousie university, Tushar Sharma Dalhousie University Pre-print
14:45 5m Talk		CoMRAT: Commit Message Rationale Analysis Tool Data and Tool Showcase Track Mouna Dhaouadi University of Montreal, Bentley Oakes Polytechnique Montréal, Michalis Famelis Université de Montréal Pre-print Media Attached File Attached
14:50 5m Talk		E2EGit: A Dataset of End-to-End Web Tests in Open Source ProjectsData/Tool Track Distinguished Dataset Award Data and Tool Showcase Track Sergio Di Meglio Università degli Studi di Napoli Federico II, Luigi Libero Lucio Starace Università degli Studi di Napoli Federico II, Valeria Pontillo Gran Sasso Science Institute, Ruben Opdebeeck Vrije Universiteit Brussel, Coen De Roover Vrije Universiteit Brussel, Sergio Di Martino Università degli Studi di Napoli Federico II Media Attached
14:55 5m Talk		TestMigrationsInPy: A Dataset of Test Migrations from Unittest to Pytest Data and Tool Showcase Track Altino Alves Júnior UFMG, Andre Hora UFMG Pre-print Media Attached
15:00 5m Talk		pyMethods2Test: A Dataset of Python Tests Mapped to Focal Methods Data and Tool Showcase Track Idriss Abdelmadjid University of Nebraska-Lincoln, Robert Dyer University of Nebraska-Lincoln Pre-print Media Attached
15:05 5m Talk		DataTD: A Dataset of Java Projects Including Test Doubles Data and Tool Showcase Track Mengzhen Li University of Minnesota, Mattia Fazzini University of Minnesota
15:10 5m Talk		JPerfEvo: A Tool for Tracking Method-Level Performance Changes in Java Projects Data and Tool Showcase Track Kaveh Shahedi Polytechnique Montréal, Maxime Lamothe Polytechnique Montreal, Foutse Khomh Polytechnique Montréal, Heng Li Polytechnique Montréal
15:15 10m Talk		PyExamine: A Comprehensive, Un-Opinionated Smell Detection Tool for Python Technical Papers Karthik Shivashankar University of Oslo, Antonio Martini University of Oslo, Norway
15:25 5m Talk		How Do Solidity Versions Affect Vulnerability Detection Tools? An Empirical Study Registered Reports Gerardo Iuliano University of Salerno, Davide Corradini University of Luxembourg, Michele Pasqua University of Verona, Mariano Ceccato University of Verona, Dario Di Nucci University of Salerno

16:00 - 17:30	Plenary: ClosingProgram / Vision and Reflection at 214 Chair(s): Gabriele Bavota Software Institute @ Università della Svizzera Italiana, Jin L.C. Guo McGill University

16:00 30m Talk		Future of AI4SE: From Code Generation to Software Engineering? Vision and Reflection Baishakhi Ray Columbia University
16:30 30m Talk		Reshaping MSR (and SE) empirical evaluations in 2030 Vision and Reflection Massimiliano Di Penta University of Sannio, Italy
17:00 15m Day closing		Closing Session Program Bram Adams Queen's University, Olga Baysal Carleton University, Ayushi Rastogi University of Groningen, The Netherlands
17:15 15m Day closing		MSR 2026 Presentation Program

Accepted Papers

	Title
	Agents for Software Development Tutorials Graham Neubig
	Harmonized Coding with AI: LLMs for Qualitative Analysis in Software Engineering Research Tutorials Christoph Treude, Youmei Fan, Tao Xiao, Hideaki Hata File Attached

Call for Tutorials

The MSR Tutorials track aims to invite experienced software repository miners to give informative tutorials to our broad community, whether newcomers or mining experts. These tutorials will cover various topics related to mining software repositories.

We encourage researchers to submit an abstract outlining a talk, with a maximum length of one page (plus up to one additional page of references). We are soliciting abstracts in two categories:

Research Problem: the talk outlines a single problem in an academic/industrial context that could be addressed using data from software repositories.
Lessons Learnt & Opportunities: the talk outlines a series of actionable implications and provides promising directions for future researchers.

If you have any questions, please contact the co-chairs: Dong Wang (d.wang@ait.kyushu-u.ac.jp) and Ayushi Rastogi (a.rastogi@rug.nl)

Submission

A one-page PDF file is required for submission, including the title, speaker names, affiliations, and the outline of the tutorial talk. Additionally, up to one page of references can be included. We do not require any specific template for the submission stage.

If your submission is accepted, you can have your abstract published in the conference proceedings. In this case, your abstract must conform to the official “ACM Primary Article Template”, which can be obtained from the ACM Proceedings Template page. If you are using LaTeX, please use the “sigconf” option, as well as the “review” option, to produce line numbers for easy reference by the reviewers. To do so, please include the following LaTeX code at the start of your document:

\documentclass\[sigconf,review,anonymous]{acmart}

\acmConference\[ICSE 2024]{46th International Conference on Software Engineering}{April 2024}{Lisbon, Portugal}

If you would like to submit your tutorial for consideration for the 2024 MSR Tutorials, please submit it at https://msr24tutorial.hotcrp.com/

Evaluation Criteria

The submissions will be evaluated by the two tutorial track co-chairs. We will assess the relevance and clarity of the problem description, the lessons learned, and the research opportunities, along with the context, in submissions from researchers.

Accepted Papers

The official publication date is when the proceedings become available in the ACM or IEEE Digital Libraries. Note that this date may be up to two weeks before the first day of ICSE 2024. It’s important to remember that the official publication date may affect the deadline for any patent filings related to published work.
Purchases of additional pages in the proceedings are not allowed.

Important Dates (all dates are in AoE)

Tue 19th of December 2023 (Proposal submission deadline),
Thu 21st of December 2023 (Author Notification),
Sun 28th of January 2024 (Camera-ready version)

TutorialsMSR 2025

Agents for Software Development

Harmonized Coding with AI: LLMs for Qualitative Analysis in Software Engineering Research

Mon 28 Apr
Displayed time zone: Eastern Time (US & Canada) change

Tue 29 Apr
Displayed time zone: Eastern Time (US & Canada) change

Accepted Papers

Call for Tutorials

Submission

Evaluation Criteria

Accepted Papers

Important Dates (all dates are in AoE)

Tracks

TutorialsMSR 2025

Agents for Software Development

Harmonized Coding with AI: LLMs for Qualitative Analysis in Software Engineering Research

Program Display Configuration

Mon 28 AprDisplayed time zone: Eastern Time (US & Canada) change

Tue 29 AprDisplayed time zone: Eastern Time (US & Canada) change

Accepted Papers

Call for Tutorials

Submission

Evaluation Criteria

Accepted Papers

Important Dates (all dates are in AoE)

Mon 28 Apr
Displayed time zone: Eastern Time (US & Canada) change

Tue 29 Apr
Displayed time zone: Eastern Time (US & Canada) change