arXiv AI

A Chart-Metadata Generation Framework for Multi-Task Chart Understanding

By Advanced AI EditorMay 23, 2025No Comments2 Mins Read

[Submitted on 21 May 2025 (v1), last revised 22 May 2025 (this version, v2)]

View a PDF of the paper titled ChartCards: A Chart-Metadata Generation Framework for Multi-Task Chart Understanding, by Yifan Wu and 5 other authors

View PDF
HTML (experimental)

Abstract:The emergence of Multi-modal Large Language Models (MLLMs) presents new opportunities for chart understanding. However, due to the fine-grained nature of these tasks, applying MLLMs typically requires large, high-quality datasets for task-specific fine-tuning, leading to high data collection and training costs. To address this, we propose ChartCards, a unified chart-metadata generation framework for multi-task chart understanding. ChartCards systematically synthesizes various chart information, including data tables, visualization code, visual elements, and multi-dimensional semantic captions. By structuring this information into organized metadata, ChartCards enables a single chart to support multiple downstream tasks, such as text-to-chart retrieval, chart summarization, chart-to-table conversion, chart description, and chart question answering. Using ChartCards, we further construct MetaChart, a large-scale high-quality dataset containing 10,862 data tables, 85K charts, and 170 K high-quality chart captions. We validate the dataset through qualitative crowdsourcing evaluations and quantitative fine-tuning experiments across various chart understanding tasks. Fine-tuning six different models on MetaChart resulted in an average performance improvement of 5% across all tasks. The most notable improvements are seen in text-to-chart retrieval and chart-to-table tasks, with Long-CLIP and Llama 3.2-11B achieving improvements of 17% and 28%, respectively.

Submission history

From: Lutao Yan [view email]
[v1]
Wed, 21 May 2025 03:07:47 UTC (2,894 KB)
[v2]
Thu, 22 May 2025 15:16:47 UTC (2,895 KB)

Previous ArticleMIT CSAIL researchers develop tool for creating domain-specific languages

Next Article Exclusive-Musk’s DOGE expanding his Grok AI in U.S. government, raising conflict concerns

Advanced AI Editor

Leave A Reply