Show simple item record

dc.contributor.advisorIdreos, Stratos
dc.contributor.authorMishra, Aakash
dc.date.accessioned2023-07-06T04:10:55Z
dc.date.created2023
dc.date.issued2023-06-30
dc.date.submitted2023
dc.identifier.citationMishra, Aakash. 2023. Improving Neural Networks with Generalizable Performance Predictors and Generative Code Language Models. Bachelor's thesis, Harvard College.
dc.identifier.other30315542
dc.identifier.urihttps://nrs.harvard.edu/URN-3:HUL.INSTREPOS:37376427*
dc.description.abstractNeural Architecture Search (NAS) is a growing field with many evolving facets of research, from evaluation strategies and search space criterion to architecture optimization strategies and performance prediction. Currently, these spaces are disjoint and constrained due to lack of generalizability. Structured search spaces restrict algorithms to specific architectures, while performance estimators are fixed to given benchmarks without the ability to conduct zero-shot evaluation. Using advances in generative AI, we present a chimera of the aforementioned methods in a tool called NAS-Assistant. Our methodology consists of a new generalizable GNN-based neural architecture encoder and a clustering, attention-based regression network that predicts model performance with high accuracy and transferability. We also propose a unique method for evaluating the contribution of each layer of a network, combined with zero-cost NAS evaluation. Lastly, we develop a framework for using generative code language models to explore any model search space requested from NAS-Assistant. This thesis aims to demonstrate the first integrated generative AI optimizer for Neural Architecture Search.
dc.format.mimetypeapplication/pdf
dc.language.isoen
dash.licenseLAA
dc.subjectGenerative AI
dc.subjectNeural Architecture Search
dc.subjectPerformance Prediction
dc.subjectComputer science
dc.titleImproving Neural Networks with Generalizable Performance Predictors and Generative Code Language Models
dc.typeThesis or Dissertation
dash.depositing.authorMishra, Aakash
dc.date.available2023-07-06T04:10:55Z
thesis.degree.date2023
thesis.degree.grantorHarvard College
thesis.degree.levelBachelor's
thesis.degree.levelUndergraduate
thesis.degree.nameAB
dc.contributor.committeeMemberProtopapas, Pavlos
dc.contributor.committeeMemberSirin, Utku
dc.type.materialtext
thesis.degree.departmentComputer Science
dc.identifier.orcid0000-0002-5216-5400
dash.author.emailaakamishra1@gmail.com


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record