View a PDF of the paper titled Next-Token Prediction Task Assumes Optimal Data Ordering for LLM Training in Proof Generation, by Chenyang An and 10 other authors
View PDF
HTML (experimental)
Abstract:In the field of large language model (LLM)-based proof generation, despite extensive training on large datasets such as ArXiv, LLMs still exhibit only modest performance on proving tasks of moderate difficulty. We believe that this is partly due to the widespread presence of suboptimal ordering within the data for each proof used in training. For example, published proofs often follow a purely logical order, where each step logically proceeds from the previous steps based on the deductive rules. This order is designed to facilitate the verification of the proof’s soundness, rather than to help people and models learn the discovery process of the proof. In proof generation, we argue that the optimal order for one training data sample occurs when the relevant intermediate supervision for a particular proof step in the proof is always positioned to the left of that proof step. We call such order the intuitively sequential order. We validate our claims using two tasks: intuitionistic propositional logic theorem-proving and digit multiplication. Our experiments verify the order effect and provide support for our explanations. We demonstrate that training is most effective when the proof is in the intuitively sequential order. Moreover, the order effect and the performance gap between models trained on different data orders can be substantial — with an 11 percent improvement in proof success rate observed in the propositional logic theorem-proving task, between models trained on the optimal order compared to the worst order. Lastly, we define a common type of order issue in advanced math proofs and find that 17.3 percent of theorems with nontrivial proofs in the first two chapters of a widely used graduate-level mathematics textbook suffer from this issue. A detailed list of those proofs is provided in the appendix.
Submission history
From: Chenyang An [view email]
[v1]
Wed, 30 Oct 2024 18:00:04 UTC (308 KB)
[v2]
Thu, 3 Jul 2025 15:14:51 UTC (271 KB)
13 Comments
карго из китая в россию доставка карго
русское порно бесплатно смотреть русское порно
Want to have fun? porno bangladesh melbet Watch porn, buy heroin or ecstasy. Pick up whores or buy marijuana. Come in, we’re waiting
Новые актуальные промокод iherb для выгодных покупок! Скидки на витамины, БАДы, косметику и товары для здоровья. Экономьте до 30% на заказах, используйте проверенные купоны и наслаждайтесь выгодным шопингом.
решение курсовых заказать курсовую в москве
займ онлайн быстрый займ онлайн
займ на карту онлайн мгновенно займы онлайн на карту без проверок
нотариус перевод документов услуги бюро переводов
buy drugs in prague cocaine prague
buy coke in telegram cocain in prague from columbia
prague drugstore cocain in prague fishscale
plug in prague cocain in prague from columbia
joszaki regisztracio https://joszaki.hu/