Academic

Noise Steering for Controlled Text Generation: Improving Diversity and Reading-Level Fidelity in Arabic Educational Story Generation

arXiv:2604.03380v1 Announce Type: new Abstract: Generating diverse, pedagogically valid stories for Arabic early-grade reading assessments requires balancing tight constraints on vocabulary, reading level, and narrative structure against the need to avoid repetitive plots that undermine assessment validity. We investigate noise steering, injecting calibrated Gaussian perturbations into the internal representations of transformer models at inference time, as a training-free diversity method evaluated across five small Arabic-centric language models (7-9B parameters). We compare four injection strategies against high-temperature sampling baselines, measuring diversity, quality, constraint adherence, and reading grade level. Residual stream noise consistently improves narrative diversity with minimal quality or constraint cost and preserves early-grade reading level across all models. Attention entropy noise injection (AENI) stabilizes the otherwise unreliable attention-logit noise while

arXiv:2604.03380v1 Announce Type: new Abstract: Generating diverse, pedagogically valid stories for Arabic early-grade reading assessments requires balancing tight constraints on vocabulary, reading level, and narrative structure against the need to avoid repetitive plots that undermine assessment validity. We investigate noise steering, injecting calibrated Gaussian perturbations into the internal representations of transformer models at inference time, as a training-free diversity method evaluated across five small Arabic-centric language models (7-9B parameters). We compare four injection strategies against high-temperature sampling baselines, measuring diversity, quality, constraint adherence, and reading grade level. Residual stream noise consistently improves narrative diversity with minimal quality or constraint cost and preserves early-grade reading level across all models. Attention entropy noise injection (AENI) stabilizes the otherwise unreliable attention-logit noise while recovering quality. High-temperature sampling inflates reading grade level and causes catastrophic collapse on several models. We find internal representation-level perturbation to be a more suitable diversity strategy than output-level stochasticity for constrained educational content generation.

Executive Summary

This study examines the use of noise steering to enhance diversity and reading-level fidelity in Arabic educational story generation. The researchers employ four different noise injection strategies and compare them to high-temperature sampling baselines. The results indicate that residual stream noise and attention entropy noise injection are effective in improving narrative diversity while maintaining quality and reading grade level. The study highlights the potential of internal representation-level perturbation as a suitable diversity strategy for constrained educational content generation. The findings have implications for the development of AI-powered educational tools and assessments.

Key Points

  • Noise steering is a training-free diversity method that can improve narrative diversity in Arabic educational story generation.
  • Residual stream noise and attention entropy noise injection are effective noise injection strategies for enhancing diversity without compromising quality or reading grade level.
  • Internal representation-level perturbation is a more suitable diversity strategy than output-level stochasticity for constrained educational content generation.

Merits

Strength in Methodology

The study employs a rigorous experimental design, including the use of five small Arabic-centric language models and multiple noise injection strategies, to evaluate the effectiveness of noise steering in enhancing diversity and reading-level fidelity.

Contributions to the Field

The study provides valuable insights into the use of noise steering for constrained educational content generation, highlighting its potential as a training-free diversity method for enhancing narrative diversity and reading grade level.

Demerits

Limitation in Generalizability

The study focuses on small Arabic-centric language models, which may limit the generalizability of the findings to larger language models or other languages.

Need for Further Research

The study suggests that further research is needed to explore the use of noise steering in other educational contexts and to develop more effective noise injection strategies.

Expert Commentary

The study provides a valuable contribution to the field of natural language processing, highlighting the potential of noise steering as a training-free diversity method for enhancing narrative diversity and reading grade level in Arabic educational story generation. The study's use of a rigorous experimental design and multiple noise injection strategies provides a robust evaluation of the effectiveness of noise steering. However, the study's focus on small Arabic-centric language models may limit the generalizability of the findings. Furthermore, the study suggests that further research is needed to explore the use of noise steering in other educational contexts and to develop more effective noise injection strategies.

Recommendations

  • Future studies should investigate the use of noise steering in other educational contexts, such as language learning or mathematics education.
  • Developers of AI-powered educational tools and assessments should consider the use of noise steering as a training-free diversity method for enhancing narrative diversity and reading grade level.

Sources

Original: arXiv - cs.CL