<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Exploring Advanced Hypothesis Generation in Astronomy Through the Implementation of a Mathematical Model of Linguistic Neural Networks</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Ievgen</forename><surname>Zaitsev</surname></persName>
							<email>zaitsev@i.ua</email>
							<affiliation key="aff0">
								<orgName type="department">Institute of Electrodynamics</orgName>
								<orgName type="institution">National Academy of Sciences of Ukraine</orgName>
								<address>
									<settlement>Kyiv</settlement>
									<country key="UA">Ukraine</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Oleksandr</forename><surname>Golubenko</surname></persName>
							<email>golubenkooleksandr82@gmail.com</email>
							<affiliation key="aff1">
								<orgName type="institution">Academician Yury Bugai International Scientific and Technical University</orgName>
								<address>
									<settlement>Kyiv</settlement>
									<country key="UA">Ukraine</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Olha</forename><surname>Tkachenko</surname></persName>
							<affiliation key="aff2">
								<orgName type="institution">Taras Shevchenko Kyiv National University</orgName>
								<address>
									<settlement>Kyiv</settlement>
									<country key="UA">Ukraine</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Oleksandr</forename><surname>Pidmohylnyi</surname></persName>
							<affiliation key="aff3">
								<orgName type="institution">State University of Information and Communication Technologies</orgName>
								<address>
									<settlement>Kyiv</settlement>
									<country key="UA">Ukraine</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Artem</forename><surname>Antonenko</surname></persName>
							<email>artem.v.antonenko@gmail.com</email>
							<affiliation key="aff4">
								<orgName type="institution">National University of Life and Environmental Sciences of Ukraine</orgName>
								<address>
									<settlement>Kyiv</settlement>
									<country key="UA">Ukraine</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Exploring Advanced Hypothesis Generation in Astronomy Through the Implementation of a Mathematical Model of Linguistic Neural Networks</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">E96E023E513A6396A0D040F24EA6A4FA</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T16:51+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Natural Language Processing (NLP)</term>
					<term>transformer architecture</term>
					<term>large Language Models (LLMs)</term>
					<term>GPT-4</term>
					<term>astronomy</term>
					<term>underlying physical processes</term>
					<term>hypothesis generation</term>
					<term>NASA&apos;s Astrophysics Data System</term>
					<term>in-context and adversarial prompting</term>
					<term>comprehensive corpus</term>
					<term>comprehensive corpus training</term>
					<term>astronomical knowledge</term>
					<term>exploration in astronomy</term>
					<term>astro-GPT</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The current research conducts an exhaustive analysis into the integration of Large Language Models (LLMs), focusing on GPT-4, within the domain of Astronomy. We deploy advanced incontext and adversarial prompting methods, harnessing a database of over 1000 articles from NASA's Astrophysics Data System, aiming to augment the model's performance in domainspecific literature comprehension. This comprehensive inquiry demonstrates a marked enhancement in the model's capability for generating robust and insightful hypotheses in astronomy. Furthermore, the study highlights the role of adversarial prompting in enabling GPT-4 to sift through extensive data to yield meaningful and informed scientific hypotheses, showcasing a pioneering advancement for the implementation of LLMs in Astronomy research.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>The field of Natural Language Processing (NLP) has seen monumental progress propelled by advanced attention mechanisms and innovative transformer architecture. This technological evolution has culminated in the inception of Large Language Models (LLMs) like GPT-4, which stand as paragons of computational language understanding. These models unfurl a vast expanse of capabilities in not only understanding and generating human language but also in performing intricate interactions and operations with it. They transcend the boundaries of simple language tasks, venturing into the realm of making profound deductions and establishing intricate relationships across a panorama of diverse contexts.</p><p>The expansive potentials of these models have been underscored by two particular aspects that have fanned the flames of excitement in the research community. The first aspect delves into the LLMs' ability to sample the posterior means of languages. Even though this ability occasionally stumbles into the issues of non-trivial hallucination, a significant leap in their performance has been observed through the employment of in-context prompting <ref type="bibr" target="#b0">[1]</ref><ref type="bibr" target="#b1">[2]</ref><ref type="bibr" target="#b2">[3]</ref>. This strategic enhancement bolsters the models' capacity to grapple with complex, domain-specific tasks, displaying an exemplary augmentation in their functionality <ref type="bibr" target="#b3">[4]</ref>.</p><p>The second facet shines light on the harmonious integration of these models with cutting-edge technologies like Langchain. This integration saturates the LLMs with extensive context, amplifying their operational bandwidth across a myriad of fields. Despite the availability of other methods like the use of adapters <ref type="bibr" target="#b4">[5]</ref>, which can significantly enhance performance for domain-specific tasks by fine-tuning the LLMs, the challenges they pose, especially for institutions constrained by limited resources, cannot be overlooked.</p><p>Venturing into the astronomical domain, this research explores the application of cost-effective incontext prompting <ref type="bibr" target="#b5">[6]</ref>. The field of Astronomy emerges as an exceptional candidate for this study for three pivotal reasons. Firstly, despite its rich literary background, the incorporation of astronomical texts in the comprehensive corpus utilized for training GPT models is presumably limited. This limitation precipitates apparent hallucination problems when deploying naive versions of LLMs <ref type="bibr" target="#b6">[7]</ref>. Secondly, the advancement in astronomy is often a product of synthesizing information across various subfields, a feature accentuated by the universality of underlying physical processes at diverse scales. This characteristic bolsters the assumption that extensive in-context prompting can substantially amplify hypothesis generation, especially when LLMs are preliminarily exposed to a diverse literature spectrum.</p><p>Lastly, the enduring "open sky" policy of astronomy further solidifies its position as a promising field for in-context prompting research. The policy's emphasis on public availability of most datasets, either immediately or after a brief proprietary period <ref type="bibr" target="#b7">[8]</ref>, in conjunction with a comprehensive, meticulously curated literature database, facilitates seamless engagement with the vast reservoir of astronomical knowledge. With the NASA's Astrophysics Data System hosting a staggering excess of 15 million resources, the entirety of the astronomical literature spectrum accessible to researchers is effectively covered <ref type="bibr" target="#b8">[9]</ref>. This extensive accessibility plays a crucial role in enhancing our interaction with the astronomy database, offering a fertile ground for further research and exploration.</p><p>In conclusion, the application of LLMs, especially with the integration of in-context and adversarial prompting, holds the promise of a revolution in various domains, including astronomy. The outlined advancements and integrations not only mitigate the challenges faced by these models but also amplify their capabilities, offering a beacon of hope for future research and applications in diverse fields.</p><p>The procedure begins with the pre-processing and embedding of Galactic Astronomy papers. A similarity search is conducted on the embedded query, and relevant document chunks are retrieved. A further contextual compression is performed to remove irrelevant information from the chunks. These compressed texts serve as input to a GPT-4 instance, which generates an idea. This idea is then critiqued by a second GPT-4 model, and the feedback is moderated by a third GPT-4 model (Fig. <ref type="figure" target="#fig_0">1</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Retrieval and Pre-processing of Literature</head><p>In this research, our scrutiny was centered on Galactic Astronomy. Our domain expertise was leveraged to evaluate the outcomes, selecting Galactic Astronomy for its holistic character that amalgamates knowledge from various sub-disciplines. Exploring galaxy evolution encompasses not only a foundational comprehension of stars and stellar groups, but also considers the impact of extensive cosmological environmental elements <ref type="bibr" target="#b9">[10,</ref><ref type="bibr" target="#b10">11]</ref>. This multifaceted approach to understanding galaxy evolution presents both thrilling challenges and plentiful opportunities for harnessing the tacit knowledge ensconced within the expansive literary network.</p><p>For our research, a collection of 2,000 pertinent Galactic Astronomy papers were assembled from the NASA Astrophysics Data System (ADS) is a digital library portal for researchers in astronomy and physics. It is operated by the Smithsonian Astrophysical Observatory (SAO) under a NASA grant. The system is a valuable resource for accessing a wide range of publications related to astrophysics, including research articles, conference proceedings, and other relevant literature.</p><p>The ADS maintains three bibliographic databases containing more than 14 million records:  Astronomy and Astrophysics  Physics and Geophysics  arXiv e-prints. The selection process involved a database query founded on specific criteria like the mention of 'Gaia' in the abstract, publications from the last decade (aligning with the Gaia launch date), peer-reviewed journal articles, and the incorporation of related keywords such as 'galaxy kinematics and dynamics', 'galaxy structure', 'galaxy disk', 'galaxy halo', 'galaxy abundances', and 'galaxy evolution'. The initial inquiry generated over 2,000 papers, prompting us to focus on the most recent contributions.</p><p>Our meticulously compiled collection encompasses various details like the ArxivID, Publication Date, Authors, Title, Abstract, Citation, and Key, forming a thorough dataset for our examination. For ensuring reproducibility, both the complete dataset and the utilized codebase in our analysis are made available here.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Astro-GPT Flow.</head><p>Our investigation harnesses the capabilities of the GPT-4 model by OpenAI. The initial step in in-context prompting involves the pre-processing of 2,000 documents from the Galactic Astronomy corpus utilizing the Langchain library. Each paper, converted from PDF to text format, is then divided into segments or 'chunks' each containing 2,000 tokens. These segments are further embedded using the text-ada-002 embedding model by OpenAI.</p><p>During the retrieval phase, chat history and input query are transformed into a singular input and then embedded. A search for similarity is executed between the embedded query and the vector database. Utilizing Langchain's contextual compression, irrelevant information is filtered out from the individual segments. The final texts, combined with the single input, lay the groundwork for the GPT-4 model, operating with a context window of around 8,000 tokens, to develop ideas.</p><p>To evaluate the model's competence, an adversarial experiment is constructed. This includes a secondary GPT-4 model tasked with analyzing the idea, pinpointing its weaknesses, and proposing possible improvements. This feedback is restructured within a feedback-question format by a third GPT-4 instance and relayed to the initial model.</p><p>In executing our experimental arrangement, we utilize Nk papers, where k is a member of the set {1, 10, 100, 1000}. Each sample is subjected to hypothesis generation by the 'Generation GPT-4' instance (our incontext prompted model on k papers). This is followed by an adversarial response from 'Adversarial GPT-4,' which is then reshaped by a moderator GPT-4 instance and returned to the generator model. This loop, producing three hypotheses and two critiques per trial, is repeated twice for each Nk and replicated five times overall. The same methodology is employed for 1,000 papers, without resampling, amassing a total of 60 hypotheses and 40 critiques.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Results</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Human Evaluation.</head><p>Given the inherently qualitative aspect of hypothesis generation, it was crucial to adopt an assessment process that, while subjective, would align with the anticipations of human experts. Recognizing this need, we engaged two domain experts specializing in Galactic Astronomy to scrutinize the quality of the hypotheses generated. Each hypothesis was critically evaluated and rated based on the volume of incorporated papers within the domain-specific context. This method allowed us to compute an average score from these dual human evaluations for each formulated hypothesis.</p><p>The grading rubric was divided into three distinct categoriesscientific accuracy, creativity, and feasibility. The hypotheses were meticulously assessed in these domains, and the average score across these three areas was considered the final score. Additionally, we conducted a comprehensive evaluation of the critiques provided by the AI judge, ensuring it had access to identical contextual information for a fair and balanced assessment. The graphical representation in the left panel of underscores the pivotal role of adversarial prompting in enhancing hypothesis generation substantially. In the absence of adversarial prompting, the quality of hypothesis generation exhibited negligible dependence on the number of consulted papers. This observation underscored that in-context prompting alone, albeit beneficial for reducing hallucination, fell short of providing an exhaustive comprehension of the corpus.</p><p>Nevertheless, the incorporation of adversarial prompting dramatically transformed this scenario. We observed a substantial escalation in the quality of hypothesis generation for both the AI generator and the AI judge. This improvement prevailed even without the need for explicitly aligning the models with human expectations. Most notably, the introduction of adversarial prompting established a robust correlation between the quality of the hypotheses and the quantity of reviewed papers, particularly at a larger context (N = 1000). It also fostered greater consistency in terms of the quality of both the hypotheses and the critiques.</p><p>The mean quality score witnessed a considerable augmentation, escalating from 2.5 (with a context of 10 papers, where a score of 3/5 corresponds to an average hypothesis by a proficient PhD student) to an almost expert level of 4/5 with the inclusion of 1,000 papers in the context. This marked enhancement accentuates the immense potential of employing adversarial prompting to bolster the quality of scientific hypothesis generation. Detailed examples underscoring these observations are elaborated in the Appendix.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Exploration of Embeddings.</head><p>In an endeavor to grasp the profound impact of adversarial prompting, we initiated the process by passing the abstracts of our set of 2,000 astronomy papers through the text-ada-002 embedding model. This was followed by their organization into a 2D TSNE projection, adeptly capturing the contextual disparities and parallels amongst the 2,000 papers.</p><p>For every hypothesis generated, an in-depth determination was conducted to ascertain the specific papers that contributed to its inception by querying the GPT model. Visually presents this 'knowledge footprint' for each hypothesis, depicted as black polygons within a green hull that signifies all the papers GPT-4 had access to. A discernible observation upper panels reveals that with the escalation in the number of papers, GPT notably expands its scope. It begins to assimilate diverse topics to construct interconnected hypotheses. It's pertinent to highlight that even with a modest training pool of 10 papers, our experiment is methodically designed to encompass a broad spectrum of topics. However, GPT-4's lack of adequate context leads to the formulation of more generalized hypotheses (refer to Appendix for further insight).</p><p>Focusing on the scenario with 1,000 papers, the bottom panel in provides an illuminating exploration of the evolution of the knowledge footprint with varying numbers of adversarial attacks. In the initial iteration (bottom left), the judge adeptly pinpoints areas for critique, grounded on the knowledge that the original response inadvertently overlooked. This proactively propels the generator to widen its scope further, as visibly illustrated in the bottom middle and right panels. This expansion aids in appropriately addressing the surfaced criticisms.</p><p>As showcased in some instances in the Appendix, the implementation of adversarial prompting empowers the GPT-4 model to genuinely leverage a voluminous number of contextual inputs. This strategic guidance navigates the model towards a more integrated and comprehensive comprehension of the topic, rather than resulting in a mere aggregation of disjointed facts. The Appendix further elucidates specific examples of generated hypotheses and their corresponding critiques, offering more profound insights into this exploration.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Exemplary Hypotheses and Critiques Generated.</head><p>This portion showcases three kinds of hypotheses, created via our in-context and adversarial prompting methods, coupled with the related critiques offered by our adversarial GPT-4 model. This feedback resulted in notable enhancements in hypothesis formation. The initial example illustrates a hypothesis formed post subjecting the model to in-context prompting through a review of 1000 papers, succeeded by two sequences of adversarial critiques <ref type="bibr" target="#b11">[12]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>I suggest undertaking a study to explore the influence of dark matter mini-halos on the formation and progression of globular clusters (GCs) within the Milky Way. This research would merge the detailed astrometric information from Gaia EDR3 and DR3 with spectral data acquired from major terrestrial surveys such as APOGEE, GALAH, SDSS SEGUE, and LAMOST. This combination aims to analyze the movement of stars in the outer regions of GCs. The primary objective is to ascertain if these GCs are situated within dark matter mini-halos, potentially shedding light on their inception. Building on the probabilistic method pioneered by Kuzma et al. (2021) -which evaluates the outer areas of GCs by applying a mixture model in spatial and proper motion space to delineate cluster, extra-tidal, and contaminating stellar groups -this research would incorporate the impact of dark matter mini-halos on star kinematics in the outer regions of GCs. By broadening this methodology, we can evaluate the proposition that dark matter is instrumental in shaping and developing Gcs. Additionally, this analysis would offer a deeper comprehension of the dispersion and attributes of dark matter within the Milky Way, enriching the wider domain of proximate cosmology. Fusing this data with the existing understanding of the Milky Way's hierarchical structure and the significance of swift gas accretion in the inner Galactic disc would afford a holistic perspective on the genesis narrative of our Galaxy and the inherent distribution of dark matter.</head><p>This hypothesis garnered significant endorsement from the human panel, attributed to its extensive contextual comprehension of Galactic Astronomy, a solid thesis proposal, suitable references for conceptual foundations, and its expansion upon established methods and research. On the contrary, the subsequent hypothesis was generated when in-context prompting was confined to 10 papers. Hypotheses formed with such limited literary exposure frequently seem generic, devoid of meaningful insights and offering only trivial enhancements to existing work <ref type="bibr" target="#b12">[13]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>I propose conducting research to examine the impact of intermediate-mass gas-rich mergers on the chemical and kinematic development of galaxies similar to the Milky Way, spanning various redshifts. This investigation would expand on the discoveries made by Ciuca et al. (2023), which indicate a significant influence of the Gaia-Sausage-Enceladus (GSE) merger on the creation of the Galactic disk and the shift from the thick disk to the thin disk. The study could incorporate data from extensive spectroscopic surveys like APOGEE, GALAH, and Gaia-ESO, alongside high-resolution cosmological simulations from projects such as the Auriga project. Central goals would encompass the characterization of chemical abundance patterns and kinematic traits of stars in the disk and halo sections during and post-mergers. It would also explore the effect of these mergers on the radial distribution and movement of stars throughout the Galactic disk.</head><p>By centering the research on intermediate-mass gas-rich mergers, it will bridge the existing knowledge void regarding the significance of these events in sculpting the form and progression of Milky Way-like galaxies. The results would further offer essential insights for the enhancement of galaxy formation models, contributing to a more exhaustive comprehension of our Galaxy's intricate history.</p><p>Without sufficient context, the model tends to extrapolate based on existing literature, resulting in incremental, less innovative suggestions. Importantly, despite the exposure to a corpus of 1000 papers, without adversarial attacks, the hypotheses can often seem like a jumble of facts, as shown in the example below:</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>I suggest a thorough, multi-wavelength investigation into the unexplored areas of the Galactic plane, especially focusing on the South Galactic cap, encompassing regions below a declination of -65°. The endeavor to chart these untouched regions can be realized by integrating existing extensive photometric, spectroscopic, and astrometric surveys like Gaia, LAMOST, APOGEE, and GALAH, along with new highresolution surveys targeting the mapping of these unknown zones of the Milky Way.</head><p>Emphasis should be placed on identifying and characterizing previously undiscovered ultra-faint dwarf galaxies, chemically unusual stars, and stellar streams, coupled with assessing their origins and potential impact on our comprehension of the Milky Way's formation and evolution. This research initiative takes inspiration from the unexplored territories highlighted in Antoja et al. <ref type="bibr">(2015)</ref>, the suggested enhancements by Gorski and Barmby (2020), and the advancements in large-scale surveys explored in studies like Yuan et al. ( <ref type="formula">2014</ref>) and An and Beers (2020).</p><p>Broadening our insight into these uncharted regions of the Galactic plane will greatly augment our understanding of the Milky Way's structure, formation, and evolution. Moreover, it will empower astronomers to uncover and evaluate previously unidentified star populations and their kinematic characteristics, further honing our understanding of the diverse elements and phenomena that mold our Galaxy.</p><p>Demonstrating adversarial prompting is critical to ensure that the GPT can harness the broad context provided in a meaningful way. Finally, an example of high-quality critique provided by the adversarial GPT-4 model is showcased. These adversarial critiques, after being primed with 1000 papers, are highly accurate and can rival the quality of expert human critique <ref type="bibr" target="#b13">[14]</ref>.</p><p>The proposed interdisciplinary method aspires to offer a thorough insight into the Milky Way's gravitational potential by merging photometric data, high-definition simulations, and innovative approaches for phase-space spirals analysis. Despite its promise, several weaknesses and limitations need attention.</p><p>Firstly, the heavy reliance on photometric data, while less susceptible to sampling biases, may encounter uncertainties from dust obscuration and crowding. The precision of photometric metallicity measurements could also fall short when compared to spectroscopic data.</p><p>Secondly, the usage of high-definition simulations for superior modeling of the Milky Way's potential comes with its drawbacks. These simulations, albeit detailed, demand extensive computational resources. The existing mass and spatial resolutions might not meet the requirements for a meticulous analysis. Additionally, the Milky Way's complexity, marked by its non-steady-state nature and the presence of various structures like spiral arms and bars, introduces additional hurdles to the modeling process.</p><p>Thirdly, employing novel techniques for analyzing phase-space spirals could be influenced by various systematic biases. The proposal somewhat overlooks the method of addressing these biases and the possible constraints of these techniques amidst the context of a multifaceted and evolving galaxy.</p><p>Lastly, the proposal inadequately covers the implications of different selection biases in spectroscopic surveys and the complications in incorporating the galaxy's complexity into extensive cosmological models. In summary, while the interdisciplinary strategy holds potential, substantial additional work is crucial to overcome these limitations and weaknesses for a more precise and thorough comprehension of the Milky Way's gravitational potential.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Conclusion and Future Direction</head><p>Our investigation goes beyond just the theoretical framework or limited-scope testing. It is grounded in comprehensive, real-world applications, reflecting an intent to resolve actual challenges faced in the field of Galactic Astronomy. A nuanced understanding of the specifics of in-context and adversarial prompting allows us to refine and adapt our methodologies continuously, ensuring that the resulting enhancements are not just incremental but transformative.</p><p>An essential part of our future work lies in the significant expansion of the dataset employed for incontext prompting. Utilizing a more extensive and diverse collection of documents from the NASA ADS will further hone the precision and relevance of the generated hypotheses, allowing for a richer, more comprehensive exploration of potential research areas. Moreover, the utilization of advanced analytics tools and techniques will enable more in-depth analysis and evaluation, ensuring that the generated hypotheses stand up to rigorous scrutiny from both AI and human evaluators.</p><p>A parallel line of inquiry will be pursued, focusing on the integration of the GPT-4 model with other advanced AI and machine learning technologies. This collaborative approach seeks to harness the collective strengths of multiple AI models and algorithms, enhancing the robustness and reliability of hypothesis generation. This integration will also facilitate a more seamless and efficient evaluation process, with multiple AI models working in tandem to analyze, assess, and refine the generated hypotheses.</p><p>Furthermore, the insights gleaned from our extensive analysis will serve as a robust foundation for the development of more advanced, efficient, and reliable in-context and adversarial prompting mechanisms. These improved mechanisms will not only enhance the quality and reliability of hypothesis generation but will also significantly expedite the entire process. This heightened efficiency is crucial, given the rapidly growing volume and complexity of astronomical research, ensuring that researchers can keep pace with the latest developments and insights in the field.</p><p>In conclusion, this research signifies a substantial stride towards revolutionizing hypothesis generation in Galactic Astronomy through the adept utilization of advanced AI models like GPT-4. By continually refining and enhancing our methodologies and approaches, we aim to make a meaningful and lasting contribution to the field of astronomical research, driving forward the frontier of knowledge and discovery in this critical scientific domain. Our commitment to excellence, innovation, and continuous improvement will ensure that our research remains at the cutting edge, providing valuable insights, tools, and methodologies for the global astronomical research community.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Broader Impact.</head><p>This research primarily centered on in-context prompting, a conscious choice motivated by the objective to broaden the access and application of Large Language Models (LLMs) for scientific exploration. The current versions of GPT models, with their colossal parameter counts, often make the process of fine-tuning a nonviable endeavor. In this era, characterized by the proliferation of LLMs, a vital question arises: Can all academic entities, irrespective of their scale or computational capabilities, stay abreast with these swiftly progressing technologies?</p><p>This pressing question stands at the heart of our investigation, with the results shedding a beam of optimism on the scenario. Our research underlines that employing the right strategies and involving human experts in the loop can substantially diminish the obstacles in unleashing the full potential of these sophisticated LLMs. The insights gleaned from the study propel us towards a future where institutions, regardless of their size or resources, are not only contributors to but also beneficiaries of the rapid strides in AI. This harmonious and inclusive advancement bolsters the collective mission of unearthing scientific discoveries. In addition to establishing a framework that enables wider access to LLMs, our study also contributes to the understanding of their effective implementation. It highlights the significance of human experts' involvement, underscoring the invaluable contribution of human insight in enhancing AI capabilities. This confluence of human and AI expertise stands to fuel innovative solutions and unexpected discoveries in diverse scientific domains.</p><p>Our voyage into this novel realm of LLMs is in its nascent stage, and it holds the allure of a fascinating journey teeming with unanticipated knowledge and transformative milestones. As we navigate this path, we anticipate contributing to and witnessing the democratization of AI technologies, a development that holds the potential to revolutionize diverse facets of scientific research and discovery globally <ref type="bibr" target="#b14">[15]</ref>.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 .</head><label>1</label><figDesc>Figure 1. This figure illustrates the adversarial in-context prompting workflow using OpenAI's GPT-4 model.</figDesc><graphic coords="3,72.00,150.26,465.00,249.00" type="bitmap" /></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<title level="m" type="main">Self-consistency improves chain of thought reasoning in language models</title>
		<author>
			<persName><forename type="first">X</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Wei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Schuurmans</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Q</forename><surname>Le</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Chi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Zhou</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2203.11171</idno>
		<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<title level="m" type="main">Chain of thought prompting elicits reasoning in large language models</title>
		<author>
			<persName><forename type="first">J</forename><surname>Wei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Schuurmans</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Bosma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Chi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Q</forename><surname>Le</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Zhou</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2201.11903</idno>
		<imprint>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<title level="m" type="main">Automatic chain of thought prompting in large</title>
		<author>
			<persName><forename type="first">Z</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Smola</surname></persName>
		</author>
		<idno type="DOI">10.48550/arXiv.2210.03493</idno>
		<imprint>
			<date type="published" when="2022-10">October 2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title/>
		<author>
			<persName><forename type="first">A</forename><surname>Accomazzi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Eichhorn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">J</forename><surname>Kurtz</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ASP Conf. Proc</title>
		<imprint>
			<biblScope unit="volume">377</biblScope>
			<biblScope unit="page">69</biblScope>
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title/>
		<author>
			<persName><forename type="first">A</forename><surname>Allen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Duprie</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Berriman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ASP Conf. Proc</title>
		<imprint>
			<biblScope unit="volume">475</biblScope>
			<biblScope unit="page">387</biblScope>
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<title level="m" type="main">An explanation of in-context learning as implicit bayesian inference</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">M</forename><surname>Xie</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Raghunathan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Liang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Ma</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2111.02080</idno>
		<imprint>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">S</forename><surname>Grant</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">M</forename><surname>Thompson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Chyla</surname></persName>
		</author>
		<idno type="arXiv">arXiv:1406.4542</idno>
		<title level="m">These Proceedings Henneken</title>
				<editor>
			<persName><forename type="first">A</forename><forename type="middle">A</forename><surname>Accomazzi</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><forename type="middle">J</forename><surname>Kurtz</surname></persName>
		</editor>
		<imprint>
			<date type="published" when="2014">2014. 2014. 2014</date>
		</imprint>
	</monogr>
	<note>These Proceedings Egret</note>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">The Eighteenth Data Release of the Sloan Digital Sky Surveys: Targeting and First Spectra from SDSS-V</title>
		<author>
			<persName><forename type="first">A</forename><surname>Almeida</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">F</forename><surname>Anderson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Argudo-Fernández</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Badenes</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Barger</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">K</forename><surname>Barrera-Ballesteros</surname></persName>
		</author>
		<idno type="DOI">10.3847/1538-4365/acda98</idno>
	</analytic>
	<monogr>
		<title level="j">Astrophysical Journal, Supplement Series</title>
		<imprint>
			<biblScope unit="volume">267</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page">44</biblScope>
			<date type="published" when="2023-08-01">2023 Aug 1</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">ADS: The Next Generation Search Platform</title>
		<author>
			<persName><forename type="first">A</forename><surname>Accomazzi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">J</forename><surname>Kurtz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">A</forename><surname>Henneken</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Chyla</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Luker</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">S</forename><surname>Grant</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">M</forename><surname>Thompson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Holachek</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Dave</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">S</forename><surname>Murray</surname></persName>
		</author>
		<idno type="DOI">10.48550/arXiv.1503.04194</idno>
	</analytic>
	<monogr>
		<title level="m">Open Science at the Frontiers of Librarianship</title>
		<title level="s">Astronomical Society of the Pacific Conference Series</title>
		<editor>
			<persName><forename type="first">A</forename><surname>Holl</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Lesteven</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">D</forename><surname>Dietrich</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Gasperini</surname></persName>
		</editor>
		<imprint>
			<date type="published" when="2015-04">April 2015</date>
			<biblScope unit="volume">492</biblScope>
			<biblScope unit="page">189</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Study of galaxies on large-scale filaments in simulations</title>
		<author>
			<persName><forename type="first">A</forename><surname>Singh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Mahajan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">S</forename><surname>Bagla</surname></persName>
		</author>
		<author>
			<persName><surname>Vig</surname></persName>
		</author>
		<idno type="DOI">10.1093/mnras/staa1913</idno>
		<idno type="arXiv">arXiv:1904.02679</idno>
	</analytic>
	<monogr>
		<title level="j">Monthly Notices of the Royal Astronomical Society</title>
		<imprint>
			<biblScope unit="volume">497</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="2265" to="2275" />
			<date type="published" when="2019">jul 2020. 2019</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
	<note>, J. Visualizing attention in transformer-based language representation models</note>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Galaxy evolution in all five CANDELS fields and IllustrisTNG: Morphological, structural, and the major merger evolution to z ∼ 3</title>
		<author>
			<persName><forename type="first">A</forename><surname>Whitney</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Ferreira</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">J</forename><surname>Conselice</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Duncan</surname></persName>
		</author>
		<idno type="DOI">10.3847/1538-4357/ac1422</idno>
	</analytic>
	<monogr>
		<title level="j">The Astrophysical Journal</title>
		<imprint>
			<biblScope unit="volume">919</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page">139</biblScope>
			<date type="published" when="2021-10">oct 2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<author>
			<persName><forename type="first">J</forename><surname>Holmquist</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">J</forename><surname>Kurtz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Eichhorn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Accomazzi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">These Proceedings</title>
				<imprint>
			<date type="published" when="2000">2014. 2000</date>
			<biblScope unit="volume">143</biblScope>
			<biblScope unit="page">41</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">J</forename><surname>Kurtz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Eichhorn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Accomazzi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">S</forename><surname>Grant</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">S</forename><surname>Murray</surname></persName>
		</author>
		<title level="m">SPIE Conf.Proc</title>
				<imprint>
			<date type="published" when="2002">2002</date>
			<biblScope unit="volume">4847</biblScope>
			<biblScope unit="page">238</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title/>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">J</forename><surname>Kurtz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Eichhorn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Accomazzi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of the American Society for Information Science and Technology</title>
		<imprint>
			<biblScope unit="volume">56</biblScope>
			<biblScope unit="page">2</biblScope>
			<date type="published" when="2000">2005. 2000</date>
		</imprint>
	</monogr>
	<note>A&amp;AS</note>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">The Role of Innovation in Economic Growth: Information and Analytical Aspect</title>
		<author>
			<persName><forename type="first">O</forename><surname>Kalivoshko</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Kraevsky</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Burdeha</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Lyutyy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Kiktev</surname></persName>
		</author>
		<idno type="DOI">10.1109/PICST54195.2021.9772201</idno>
	</analytic>
	<monogr>
		<title level="m">IEEE 8th International Conference on Problems of Infocommunications, Science and Technology (PIC S&amp;T)</title>
				<meeting><address><addrLine>Kharkiv, Ukraine</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2021">2021. 2021</date>
			<biblScope unit="page" from="120" to="124" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
