The Center for Education and Research in Information Assurance and Security (CERIAS)

The Center for Education and Research in
Information Assurance and Security (CERIAS)

Reusing Deep Learning Models: Challenges and Directions in Trustworthy Software Engineering

Principal Investigator: Jamie Davis

The development and training of deep learning models have become increasingly costly and complex. Consequently, software engineers are adopting pre-trained models (PTMs) for their downstream applications. The content, dynamics, and effective use of the PTM supply chain remain largely unexplored. This project seeks to characterize the associated engineering processes and artifacts, in order to identify and mitigate failure modes. We then develop tools, e.g. automation, to optimize the reuse of PTMs. We integrate research methods from human factors, mining software repositories, and machine learning. We are focused on major model registries such as HuggingFace and PyTorchHub, and interoperability infrastructure such as the Open Neural Network eXchange (ONNX).

Personnel

Other PIs: Yung-Hsiang Lu (Purdue) George K Thiruvathukal (Loyola University Chicago)

Students: Wenxin Jiang, PhD student Purvish Jajal, PhD student ~20 Purdue undergraduate students from several majors, through the VIP program

Representative Publications

Keywords: machine learning, reproducibility, software engineering