Computer Science Seminar Series
ABOUT
The Computer Science Seminar Series was created in 2009-2010 and features invited speakers from all areas of computing. In addition to invited speakers, the seminar series also features students and faculty associated with our university's Computer Science graduate programs.
The goal of the seminar series is to promote a culture of collaboration and research excellence at the university. Seminars are (typically) held weekly and are scheduled to take place Thursdays from 2:00 to 3:00 p.m. in the Science Building, Room 4170 or in the Software & Informatics Research Centre (SIRC), Room 4150. The seminar is currently meeting online due to the Covid-19 pandemic. If you have any questions or requests, please contact the Seminar Series Coordinators Khalil El-Khatib, PhD and Ken Pu, PhD.
JOIN THE MAILING LIST
Our university's Computer Science graduate students and faculty automatically receive seminar announcements by email. Other members of the university or members of the public may subscribe to the Computer Science Seminar mailing list.
Upcoming Seminars
Date: Wednesday, October 2, 2024
Time: 1:00 pm – 2:00 pm
Room: UA 4170 (Science Board Room)
Presenter: Dr. Jessie Galasso (Department of Electrical and Computer Engineering at the Faculty of Engineering, McGill University, Canada.)
Host: Dr. Cristiano Politowski (Faculty of Science, Ontario Tech University)
Title: Navigating the challenges of creating large, reproducible software datasets
Abstract: The collection and management of software datasets are fundamental to empirical software engineering. Defining and adopting proper data collection practices is essential to ensure the quality and reproducibility of research findings. In this talk, I will explore the challenges and limitations researchers face when selecting repositories from popular code forges. Specifically, I will address the issues that may hinder the reuse and reproducibility of software datasets. Following this, I will present solutions to support the creation and sharing of large, reproducible datasets of software repositories. These solutions are centered around the concept of dataset fingerprinting and leverage the unique features of the Software Heritage open archive.
Presenter Bio: Jessie Galasso is an Assistant Professor in the Department of Electrical and Computer Engineering at the Faculty of Engineering, McGill University, Canada. She received a Ph.D. in Computer Science in 2018 from the University of Montpellier, France. Her primary research interests focus on issues related to the management of software datasets, including software repository mining, knowledge extraction and representation, as well as variability management and modeling. In recent years, she has served as a program committee member for SANER'24 and ICSR'24, and has co-chaired several tracks and workshops at MODELS and SPLC.