Browsing: arXiv AI
[Submitted on 11 Aug 2024 (v1), last revised 28 May 2025 (this version, v2)] Authors:Chunyu Qiang, Wang Geng, Yi Zhao,…
arXiv:2505.21887v1 Announce Type: new Abstract: Robust routing under uncertainty is central to real-world logistics, yet most benchmarks assume static, idealized…
[Submitted on 17 Mar 2025 (v1), last revised 28 May 2025 (this version, v3)] View a PDF of the paper…
[Submitted on 2 Feb 2025 (v1), last revised 27 May 2025 (this version, v2)] View a PDF of the paper…
arXiv:2505.22425v1 Announce Type: cross Abstract: Large language models (LLMs) have made significant advances in complex reasoning tasks, yet they remain…
[Submitted on 25 May 2025 (v1), last revised 28 May 2025 (this version, v2)] View a PDF of the paper…
arXiv:2505.22617v1 Announce Type: cross Abstract: This paper aims to overcome a major obstacle in scaling RL for reasoning with LLMs,…
[Submitted on 28 May 2025] View a PDF of the paper titled Training RL Agents for Multi-Objective Network Defense Tasks,…
arXiv:2505.20313v1 Announce Type: new Abstract: Knowledge representation and reasoning in neural networks have been a long-standing endeavor which has attracted…
arXiv:2505.20316v1 Announce Type: new Abstract: Large Language Models (LLMs) have been widely adopted in ranking systems such as information retrieval…