dc.contributor.advisor | Idreos, Stratos | |
dc.contributor.author | Mishra, Aakash | |
dc.date.accessioned | 2023-07-06T04:10:55Z | |
dc.date.created | 2023 | |
dc.date.issued | 2023-06-30 | |
dc.date.submitted | 2023 | |
dc.identifier.citation | Mishra, Aakash. 2023. Improving Neural Networks with Generalizable Performance Predictors and Generative Code Language Models. Bachelor's thesis, Harvard College. | |
dc.identifier.other | 30315542 | |
dc.identifier.uri | https://nrs.harvard.edu/URN-3:HUL.INSTREPOS:37376427 | * |
dc.description.abstract | Neural Architecture Search (NAS) is a growing field with many evolving facets of research, from evaluation strategies and search space criterion to architecture optimization strategies and performance prediction. Currently, these spaces are disjoint and constrained due to lack of generalizability. Structured search spaces restrict algorithms to specific architectures, while performance estimators are fixed to given benchmarks without the ability to conduct zero-shot evaluation. Using advances in generative AI, we present a chimera of the aforementioned methods in a tool called NAS-Assistant. Our methodology consists of a new generalizable GNN-based neural architecture encoder and a clustering, attention-based regression network that predicts model performance with high accuracy and transferability. We also propose a unique method for evaluating the contribution of each layer of a network, combined with zero-cost NAS evaluation. Lastly, we develop a framework for using generative code language models to explore any model search space requested from NAS-Assistant. This thesis aims to demonstrate the first integrated generative AI optimizer for Neural Architecture Search. | |
dc.format.mimetype | application/pdf | |
dc.language.iso | en | |
dash.license | LAA | |
dc.subject | Generative AI | |
dc.subject | Neural Architecture Search | |
dc.subject | Performance Prediction | |
dc.subject | Computer science | |
dc.title | Improving Neural Networks with Generalizable Performance Predictors and Generative Code Language Models | |
dc.type | Thesis or Dissertation | |
dash.depositing.author | Mishra, Aakash | |
dc.date.available | 2023-07-06T04:10:55Z | |
thesis.degree.date | 2023 | |
thesis.degree.grantor | Harvard College | |
thesis.degree.level | Bachelor's | |
thesis.degree.level | Undergraduate | |
thesis.degree.name | AB | |
dc.contributor.committeeMember | Protopapas, Pavlos | |
dc.contributor.committeeMember | Sirin, Utku | |
dc.type.material | text | |
thesis.degree.department | Computer Science | |
dc.identifier.orcid | 0000-0002-5216-5400 | |
dash.author.email | aakamishra1@gmail.com | |