Hi, Iโ€™m Haonan (Eric) Gao ๐Ÿ˜Š.

Welcome

I am currently a second-year Masterโ€™s student in Biostatistics (Data Science track) at Yale University, working at the intersection of machine learning, statistics, and practical decision systems.

Machine Learning Statistical Modeling High-dimensional and large-scale data analysis NLP / LLM Optimization Data Science
๐Ÿ’ป Applied Machine Learning

I build and deploy machine learning models for real-world applications, covering the full workflow from data preparation and feature engineering to model training, evaluation, and production deployment.

๐Ÿ“Š Data Science & Statistical Modeling

I apply statistical modeling and data science techniques to extract insights from complex and high-dimensional datasets, with an emphasis on robust analysis and data-driven decision making.

โš™๏ธ AI Systems & Scalable Data

I design scalable data and AI systems for large datasets, working with distributed data processing, modern ML infrastructure, and efficient pipelines for large-scale machine learning workflows.

๐Ÿงฌ Current Work on Brain SC-FC Connectivity

With Prof. Yize Zhao, I develop statistical/computational methods for structural-functional brain network modeling, including low-rank + sparse factorization, proximal coordinate methods, and synthetic pipelines for identifiability studies.

๐Ÿ“ˆ Current Work on Financial GenAI Work

With Prof. Song Ma and Prof. Allen Hu, I build a RAG system over 2TB+ of financial text data, reaching ~85% forecasting accuracy while reducing token usage by ~30% through memory optimization.