An Exploratory Study on the Platforms of Sharing Reusable Machine Learning Models

dc.contributor.advisorJiang, ZhenMing "Jack"
dc.contributor.authorXiu, Minke
dc.date.accessioned2021-03-08T17:22:44Z
dc.date.available2021-03-08T17:22:44Z
dc.date.copyright2020-11
dc.date.issued2021-03-08
dc.date.updated2021-03-08T17:22:44Z
dc.degree.disciplineElectrical and Computer Engineering
dc.degree.levelMaster's
dc.degree.nameMASc - Master of Applied Science
dc.description.abstractRecent advances in Artificial Intelligence, especially in Machine Learning (ML), have brought applications previously considered as science fiction (e.g., virtual personal assistants and autonomous cars) into the reach of millions of everyday users. Since modern ML technologies like deep learning require considerable technical expertise and resource to build custom models, reusing existing models trained by experts has become essential. Currently the ML models are shared, distributed, or retailed on multiple ML model platforms which can be divided into two categories based on their usage patterns: (1) ML model stores whose models can be deployed and served with the help of cloud infrastructure, and (2) ML package repositories whose models are free but need to be deployed and used (e.g., embedded into users applications as a software component) manually. We conducted an exploratory study on the above two categories of ML model platforms: ML model stores and ML package repositories. We analyzed the structure and the contents of the ML models platforms, as well as functionalities provided by the package managers. The research subjects were three general purpose ML model stores (AWS marketplace, ModelDepot, and Wolfram neural net repository) and two popular ML package repositories (TensorFlow Hub and PyTorch Hub). When studying the structure of ML model platforms and functionalities of package managers, we compared them against their counterparts from traditional software development: ML model stores vs. mobile app stores (e.g., Google Play and Apple App Store), and ML package repositories vs. programming language package repositories (e.g., npm, PyPI, and CRAN). Through our study, we identified special software engineering practices and challenges for sharing, distributing, and retailing ML models. The implications from this thesis will be helpful for stakeholders to make the ML model platforms better serve the users (i.e., software engineers, data scientists and researchers).
dc.identifier.urihttp://hdl.handle.net/10315/38185
dc.languageen
dc.rightsAuthor owns copyright, except where explicitly noted. Please contact the author directly with licensing requests.
dc.subjectComputer Engineering
dc.subject.keywordsMachine Learning
dc.subject.keywordsSoftware Engineering
dc.subject.keywordsSoftware Engineering for Artificial Intelligence
dc.subject.keywordsML Frameworks
dc.subject.keywordsML Model Platforms
dc.subject.keywordsML Model Stores
dc.subject.keywordsML Package Repositories
dc.subject.keywordsEmpirical Study
dc.subject.keywordsSpecial Practices and Concerns
dc.subject.keywordsReview Policy
dc.subject.keywordsUser Reviews
dc.subject.keywordsUsage Statistics
dc.subject.keywordsPackage Manage
dc.subject.keywordsVersion Control
dc.subject.keywordsSignature Mechanism
dc.subject.keywordsEntrypoint Mechanisms
dc.subject.keywordsProduct Line Architecture
dc.subject.keywordsCross-platform Support
dc.subject.keywordsML Model Security
dc.subject.keywordsDataset Descriptions
dc.subject.keywordsModel Evaluation Approaches
dc.subject.keywordsModel Evaluation Results
dc.titleAn Exploratory Study on the Platforms of Sharing Reusable Machine Learning Models
dc.typeElectronic Thesis or Dissertation

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Minke_Xiu_2020_Masters.pdf
Size:
1.04 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
license.txt
Size:
1.87 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
YorkU_ETDlicense.txt
Size:
3.39 KB
Format:
Plain Text
Description: