Browsing: arXiv AI
arXiv:2505.17005v1 Announce Type: cross Abstract: Large Language Models (LLMs) are powerful but prone to hallucinations due to static knowledge. Retrieval-Augmented…
arXiv:2505.16927v1 Announce Type: cross Abstract: When language model (LM) users aim to improve the quality of its generations, it is…
arXiv:2505.14970v1 Announce Type: new Abstract: Reinforcement learning (RL) has proven effective for fine-tuning large language models (LLMs), significantly enhancing their…
arXiv:2505.15068v1 Announce Type: new Abstract: Recent progress in large language models (LLMs) has enabled substantial advances in solving mathematical problems.…
arXiv:2505.15011v1 Announce Type: new Abstract: Our society is governed by a set of norms which together bring about the values…
[Submitted on 3 May 2025 (v1), last revised 21 May 2025 (this version, v3)] View a PDF of the paper…
[Submitted on 11 Feb 2025 (v1), last revised 20 May 2025 (this version, v3)] View a PDF of the paper…
[Submitted on 21 May 2025] View a PDF of the paper titled Improving planning and MBRL with temporally-extended actions, by…
arXiv:2505.15647v1 Announce Type: cross Abstract: We investigate the problem of finding second-order stationary points (SOSP) in differentially private (DP) stochastic…
[Submitted on 29 Apr 2025 (v1), last revised 21 May 2025 (this version, v2)] View a PDF of the paper…