Academic

TradeFM: A Generative Foundation Model for Trade-flow and Market Microstructure

arXiv:2602.23784v1 Announce Type: new Abstract: Foundation models have transformed domains from language to genomics by learning general-purpose representations from large-scale, heterogeneous data. We introduce TradeFM, a 524M-parameter generative Transformer that brings this paradigm to market microstructure, learning directly from billions of trade events across >9K equities. To enable cross-asset generalization, we develop scale-invariant features and a universal tokenization scheme that map the heterogeneous, multi-modal event stream of order flow into a unified discrete sequence -- eliminating asset-specific calibration. Integrated with a deterministic market simulator, TradeFM-generated rollouts reproduce key stylized facts of financial returns, including heavy tails, volatility clustering, and absence of return autocorrelation. Quantitatively, TradeFM achieves 2-3x lower distributional error than Compound Hawkes baselines and generalizes zero-shot to geographically out-of-dist

arXiv:2602.23784v1 Announce Type: new Abstract: Foundation models have transformed domains from language to genomics by learning general-purpose representations from large-scale, heterogeneous data. We introduce TradeFM, a 524M-parameter generative Transformer that brings this paradigm to market microstructure, learning directly from billions of trade events across >9K equities. To enable cross-asset generalization, we develop scale-invariant features and a universal tokenization scheme that map the heterogeneous, multi-modal event stream of order flow into a unified discrete sequence -- eliminating asset-specific calibration. Integrated with a deterministic market simulator, TradeFM-generated rollouts reproduce key stylized facts of financial returns, including heavy tails, volatility clustering, and absence of return autocorrelation. Quantitatively, TradeFM achieves 2-3x lower distributional error than Compound Hawkes baselines and generalizes zero-shot to geographically out-of-distribution APAC markets with moderate perplexity degradation. Together, these results suggest that scale-invariant trade representations capture transferable structure in market microstructure, opening a path toward synthetic data generation, stress testing, and learning-based trading agents.

Executive Summary

This article introduces TradeFM, a generative Transformer model that learns from billions of trade events to capture market microstructure. TradeFM achieves state-of-the-art results in reproducing key stylized facts of financial returns and generalizes to geographically out-of-distribution markets. The model's scale-invariant features and universal tokenization scheme enable cross-asset generalization, paving the way for synthetic data generation, stress testing, and learning-based trading agents. The authors' use of a deterministic market simulator and Compound Hawkes baseline further strengthens their findings. Overall, TradeFM represents a significant advancement in the field of market microstructure and has far-reaching implications for the finance industry.

Key Points

  • TradeFM is a generative Transformer model that learns from billions of trade events.
  • The model achieves state-of-the-art results in reproducing key stylized facts of financial returns.
  • TradeFM generalizes to geographically out-of-distribution markets with moderate perplexity degradation.

Merits

Strength in Interpretable Results

The authors' use of a deterministic market simulator and Compound Hawkes baseline provides strong evidence for the effectiveness of TradeFM in capturing market microstructure.

Potential for Synthetic Data Generation

TradeFM's ability to generate synthetic data has the potential to revolutionize the finance industry by providing a more efficient and cost-effective alternative to traditional data collection methods.

Demerits

Limited Generalizability to Extreme Events

The authors acknowledge that TradeFM's performance may degrade in extreme events, such as market crashes or economic downturns, which could limit its practical applications.

Dependence on High-Quality Training Data

TradeFM's performance relies heavily on the quality of the training data, which may not be readily available or accurate for all markets or assets.

Expert Commentary

The introduction of TradeFM represents a significant breakthrough in the field of market microstructure. The model's ability to capture the complex dynamics of financial markets and generate synthetic data has far-reaching implications for the finance industry. However, the authors acknowledge several limitations, including the potential for performance degradation in extreme events and dependence on high-quality training data. Despite these limitations, TradeFM has the potential to transform the finance industry by providing a more efficient and cost-effective alternative to traditional data collection methods.

Recommendations

  • Further research is needed to explore the potential applications of TradeFM in high-frequency trading strategies and risk management practices.
  • The authors should investigate methods to improve the model's performance in extreme events and reduce its dependence on high-quality training data.

Sources