Assessing the Emergent Symbolic Reasoning Abilities of Llama Large Language Models

Petruzzellis, F.; Testolin, A.; Sperduti, A.

doi:10.1007/978-3-031-72344-5_18

Large Language Models (LLMs) achieve impressive performance in a wide range of tasks, even if they are often trained with the only objective of chatting fluently with users. Among other skills, LLMs show emergent abilities in mathematical reasoning benchmarks, which can be elicited with appropriate prompting methods. In this work, we systematically investigate the capabilities and limitations of popular open-source LLMs on different symbolic reasoning tasks. We evaluate three models of the Llama 2 family on two datasets that require solving mathematical formulas of varying degrees of difficulty. We test a generalist LLM (Llama 2 Chat) as well as two fine-tuned versions of Llama 2 (MAmmoTH and MetaMath) specifically designed to tackle mathematical problems. We observe that both increasing the scale of the model and fine-tuning it on relevant tasks lead to significant performance gains. Furthermore, using fine-grained evaluation measures, we find that such performance gains are mostly observed with mathematical formulas of low complexity, which nevertheless often remain challenging even for the largest fine-tuned models.

Assessing the Emergent Symbolic Reasoning Abilities of Llama Large Language Models

Petruzzellis F.;Testolin A.;Sperduti A.

2024

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2024
			
	Titolo del Libro
	
				LECTURE NOTES IN COMPUTER SCIENCE
			
	Collana/serie monografica
	
				LECTURE NOTES IN COMPUTER SCIENCE
			
	Titolo convegno
	
				International Conference on Artificial Neural Networks
			
	Codice DOI
	
				https://dx.doi.org/10.1007/978-3-031-72344-5_18
			
	Codice WOS
	
				WOS:001331889300018
			
	Codice Scopus
	
				2-s2.0-85205298938
			
	Codice OpenAlex
	
				W4402798551
			
	Codice ISBN
	
				9783031723438
9783031723445
			
	Appare nelle tipologie:
	
				04.01 - Contributo in atti di convegno

File in questo prodotto:

File	Dimensione	Formato
2406.06588v1.pdf accesso aperto Descrizione: main article file Tipologia: Preprint (AM - Author's Manuscript - submitted) Licenza: Altro Dimensione 442.12 kB Formato Adobe PDF Visualizza/Apri	442.12 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/3540732

Assessing the Emergent Symbolic Reasoning Abilities of Llama Large Language Models

Petruzzellis F.;Testolin A.;Sperduti A.

2024

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Pubblicazioni consigliate

Citazioni

social impact

Assessing the Emergent Symbolic Reasoning Abilities of Llama Large Language Models

Petruzzellis F.;Testolin A.;Sperduti A.

2024

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)