Causal Inference in Neural Language Generation Models Through Interventional Probing and Counterfactual Evaluation

H R Anderson

Go Back Research Article October, 2021

ISCSITR-International Journal of Scientific Research in Artificial Intelligence and Machine Learning

Causal Inference in Neural Language Generation Models Through Interventional Probing and Counterfactual Evaluation

H R Anderson

Abstract

Understanding the causal mechanisms underlying neural language generation models (NLGM) is essential for improving model interpretability and controllability. This paper explores causal inference within large-scale transformer-based language models using interventional probing and counterfactual evaluation. We propose a framework to disentangle causal contributions of internal representations to linguistic output through synthetic interventions and assess model behavior across counterfactual scenarios. Our empirical results on GPT-2 and BART demonstrate that causal traces in hidden layers correspond to syntactic and semantic decision points. This study contributes to a growing body of literature integrating causal inference with deep learning interpretability.

Keywords

causal inference neural language models interventional probing counterfactual analysis interpretability transformer models

Document Preview

Download PDF

Details

Volume 2

Issue 1

Pages 1-6

ISSN 3067-753X

Causal Inference in Neural Language Generation Models Through Interventional Probing and Counterfactual Evaluation

Abstract

Keywords

Cite this publication

QUICKLINKS

CONTACT US