Publication:

Natural Language Search for NASA ADS

Loading...
Thumbnail Image

Date

2024-05-17

Published Version

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Marsh, Tanner. 2024. Natural Language Search for NASA ADS. Master's thesis, Harvard University Division of Continuing Education.

Abstract

The NASA Astrophysics Data System (ADS) is a critical resource for researchers and students in astronomy, astrophysics, and beyond. ADS indexes a vast collection of papers and scholarly literature that researchers can search through using the ADS website or API. ADS’s database is powered by Apache Solr, enabling users to formulate highly expressive and precise search queries from the more than 50 allowable search fields. However, the sophistication of ADS’s search capabilities comes at the cost of usability, necessitating users to familiarize themselves with Solr and ADS’s documentation to fully exploit its features. This thesis proposes a solution to enhance the accessibility of ADS by creating a chat application where users make requests for papers by asking for them in natural language rather than by constructing Solr queries. This application works by leveraging SOTA transformer-based large language models (LLMs) to translate natural language requests into Solr queries, thereby simplifying user interaction with the ADS database without compromising on the precision of search results. In this work, we use in-context learning (ICL) with retrieval augmented generation (RAG) in order to enhance the translation capabilities of the LLM, leading to significant improvement in translation performance.

Description

Other Available Sources

Research Data

Keywords

Astrophysics Data System (ADS), few-shot learning, in-context learning (ICL), retrieval-augmented generation (RAG), Solr, text-to-sql, Computer science, Artificial intelligence, Astronomy

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Related Stories