Browsing: Hugging Face
We introduce 🤗 MigrationBench dataset, a benchmark dataset tailored for repository-level code migration, specifically targeting java 8 to 17 or…
Intelligent game creation represents a transformative advancement in game development, utilizing generative artificial intelligence to dynamically generate and enhance game…
Sparse mixture of experts (SMoE) offers an appealing solution to scale up the model complexity beyond the mean of increasing…
Despite their remarkable success and deployment across diverse workflows, language models sometimes produce untruthful responses. Our limited understanding of how…
This research offers a unique evaluation of how AI systems interpret the digital language of Generation Alpha (Gen Alpha, born…
Despite significant advances in large language models (LLMs), their knowledge memorization capabilities remain underexplored, due to the lack of standardized…
Tokenization is the first – and often underappreciated – layer of computation in language models. While Chain-of-Thought (CoT) prompting enables…
Image generation models have achieved widespread applications. As an instance, the TarFlow model combines the transformer architecture with Normalizing Flow…
While Multimodal Large Language Models (MLLMs) have achieved impressive progress in vision-language understanding, they still struggle with complex multi-step reasoning,…
Reasoning Language Models, capable of extended chain-of-thought reasoning, have demonstrated remarkable performance on tasks requiring complex logical inference. However, applying…