Browsing: Hugging Face
Large Language Model (LLM) safety is one of the most pressing challenges for enabling wide-scale deployment. While most studies and…
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain – Takara TLDR
The relationship between computing systems and the brain has served as motivation for pioneering theoreticians since John von Neumann and…
We introduce OceanGym, the first comprehensive benchmark for ocean underwater embodied agents, designed to advance AI in one of the…
Developing autonomous agents that effectively interact with Graphic User Interfaces (GUIs) remains a challenging open problem, especially for small on-device…
Voice Evaluation of Reasoning Ability: Diagnosing the Modality-Induced Performance Gap – Takara TLDR
We present Voice Evaluation of Reasoning Ability (VERA), a benchmark for evaluating reasoning ability in voice-interactive systems under real-time conversational…
Recent advances in video generation have enabled high-fidelity video synthesis from user provided prompts. However, existing models and benchmarks fail…
While large language models (LLMs) with reasoning capabilities are progressing rapidly on high-school math competitions and coding, can they reason…
While previous AI Scientist systems can generate novel findings, they often lack the focus to produce scientifically valuable contributions that…
Panorama has a full FoV (360$^\circ\times$180$^\circ$), offering a more complete visual description than perspective images. Thanks to this characteristic, panoramic…
Large Language Models (LLMs), despite being trained on text alone, surprisingly develop rich visual priors. These priors allow latent visual…