BlackBoxToBlueprint: Extracting Interpretable Logic from Legacy Systems using Reinforcement Learning and Counterfactual Analysis

arXiv:2507.00180v1 Announce Type: new
Abstract: Modernizing legacy software systems is a critical but challenging task, often hampered by a lack of documentation and understanding of the original system’s intricate decision logic. Traditional approaches like behavioral cloning merely replicate input-output behavior without capturing the underlying intent. This paper proposes a novel pipeline to automatically extract interpretable decision logic from legacy systems treated as black boxes. The approach uses a Reinforcement Learning (RL) agent to explore the input space and identify critical decision boundaries by rewarding actions that cause meaningful changes in the system’s output. These counterfactual state transitions, where the output changes, are collected and clustered using K-Means. Decision trees are then trained on these clusters to extract human-readable rules that approximate the system’s decision logic near the identified boundaries. I demonstrated the pipeline’s effectiveness on three dummy legacy systems with varying complexity, including threshold-based, combined-conditional, and non-linear range logic. Results show that the RL agent successfully focuses exploration on relevant boundary regions, and the extracted rules accurately reflect the core logic of the underlying dummy systems, providing a promising foundation for generating specifications and test cases during legacy migration.

Source link

What's Hot

AI job predictions become corporate America’s newest competitive sport

5000 Fellow Scholars Special! | Two Minute Papers

Google’s Launches Gemma 3n to Deliver Smarter, Offline AI to Mobile Devices and Laptops

BlackBoxToBlueprint: Extracting Interpretable Logic from Legacy Systems using Reinforcement Learning and Counterfactual Analysis

A Comparative Study of Whisper and Wav2Vec-BERT on Bangla

SEZ-HARN: Self-Explainable Zero-shot Human Activity Recognition Network

Thinking About Thinking: SAGE-nano's Inverse Reasoning for Self-Aware Language Models

Khaled Sabsabi Reinstated as Australia’s Venice Biennale Artist

Peter Phillips, British Pop Art Originator, Dies at 86

Hundreds of Ancient Ceramics Found In Preserved Shipwreck in Turkey

Canaletto Auction Record Smashed at Christie’s London

AI job predictions become corporate America’s newest competitive sport

5000 Fellow Scholars Special! | Two Minute Papers

Google’s Launches Gemma 3n to Deliver Smarter, Offline AI to Mobile Devices and Laptops

What's Hot

BlackBoxToBlueprint: Extracting Interpretable Logic from Legacy Systems using Reinforcement Learning and Counterfactual Analysis

Related Posts

Subscribe to Updates