# Xin DU
Assistant Professor, School of Fundamental Science and Engineering
Waseda University, Tokyo, Japan
I work with Professor Kumiko Tanaka-Ishii on complex-systems approaches to understanding natural language and autoregressive language models. I’m also investigating financial econometrics and risk modeling from a complex-system perspective.
[Lab homepage](https://ml-waseda.jp ) [研究室HP](https://ja.ml-waseda.jp)
# Selected Publications
**Language and Complexity**
- Xin Du and Kumiko Tanaka-Ishii. Correlation Dimension of Autoregressive Large Language Models. _NeurIPS 2025_
- Xin Du and Kumiko Tanaka-Ishii. Correlation Dimension of Natural Language in A Statistical Manifold. _Physical Review Research. 2024_
- Xin Du and Kumiko Tanaka-Ishii. FIRE: Semantic Field of Words Represented as Nonlinear Functions. _NeurIPS 2022_
**Retrieval, Clustering**
- Xin Du and Kumiko Tanaka-Ishii. Information-Theoretic Generative Clustering of Documents. _AAAI 2025_
- Xin Du, Lixin Xiu, and Kumiko Tanaka-Ishii. Bottleneck-Minimal Indexing for Generative Document Retrieval. _ICML 2024 Oral_
**Finance and Language**
- Xin Du and Kumiko Tanaka-Ishii. Stock embeddings acquired from news articles and price history, and an application to portfolio optimization. _ACL 2020_
- Xin Du and Kumiko Tanaka-Ishii. Stock portfolio selection balancing variance and tail risk via stock vector representation acquired from price data and texts. _Knowledge-Based Systems_