Who am I?

A curious data miner

who cannot fall asleep upon new findings — from trillion-scale trading signals to higher-order patterns in complex networks.

A creative programmer

who loves building tools — from award-winning network visualization software to smart home automation and custom financial news readers.

A critical thinker

who leads teams by asking the right questions, from data methodology standards to firm-wide research strategy.

Research Interest

Data mining and network science

Applications

Using network models and data science to solve interdisciplinary problems in complex systems such as financial market (e.g., information diffusion and trading behavior), social network (e.g., online / mobile phone social interactions), and biology (e.g., species invasions via global shipping) in close collaboration with domain experts.

Theory

Network representation of various types of data; in particular, methods and influences of embedding rich information such as higher-order dependencies into networks.

What's new

Aug 2025
KDD 2025 — Sponsoring Finance Day
Toronto, Canada
Citadel sponsors the Finance Day at the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining
Jul–Aug 2025
ACL 2025
Vienna, Austria
Attending the 63rd Annual Meeting of the Association for Computational Linguistics
Jul 2025
ICML 2025
Vancouver, Canada
Attending the 42nd International Conference on Machine Learning
2025
RISE AI — University of Notre Dame
Notre Dame, IN
Participating in RISE AI, Notre Dame's initiative at the intersection of AI and society
2021–2022
Guest Lecturer — University of Illinois Urbana-Champaign
Invited talk on the alternative data landscape for the graduate class "Portfolio Management" and undergraduate class "Financial Innovation"
2022
Guest Lecturer — University of Notre Dame
Invited talk on alternative data for fixed income for the MBA class "Fixed Income Securities"
Jul 5, 2017
Joined Citadel as data scientist
Data Strategies Group
Apr 18 and Jun 20, 2017
Present higher-order network at IoTDI (CPSWeek) and HONS (NetSci)
IoTDI: ACM/IEEE International Conference on Internet-of-Things Design and Implementation, at Pittsburgh, PA
14:30 - 15:00, White River 103, JW Marriot Indianapolis
May 19, 2017
Successful Ph.D. Defense: Representing Big Data as Networks: New Methods and Insights
Dec 7, 2016
Presentation at Northwestern University
Network Science Collaborative Technology Alliance (NS CTA)
For project "Harvesting Social Signals in Multi-genre Networks to Detect Threatening Emergent Phenomena (I4)"
Aug 2015 and Aug 2016
Network Science Collaborative Technology Alliance (NS CTA)
On project "Discovering Network Processes in Time-evolving Networks (C2)"
Jun 24 - 26, 2016
For higher-order network (HON), and the coauthored paper "Structural diversity and homophily: A study across more than one hundred large-scale networks" with Yuxiao Dong, Reid Johnson, and Nitesh Chawla
May 20, 2016
In the top 5% of all research outputs scored by Altmetric, 97th percentile attention score compared to outputs of the same age, featured in 8 news outlets

Professional Experience

2025 – Present
Head, Data Strategies Group
Citadel LLC, New York, NY
Manage and scale the Data Strategies Group, the firm's central alternative data and ML/AI research team. Oversee alternative data research initiatives delivering alpha-generating features from trillion-scale datasets; team's signals directly impact trading decisions across equities, fixed income, quant strategies, commodities, and more.
2021 – 2025
Quantitative Data Research Lead
Citadel LLC, New York, NY
Pioneered research on the firm's most important alternative data sets; defined firm-level methodology and standards for representative panel construction, bias correction, and cohort analysis. Recognized internally as the expert on alternative data; leads biweekly data roundtable since 2018.
2017 – 2021
Data Scientist
Citadel LLC, Chicago, IL → New York, NY
Architected and launched the Alternative Data Observatory, the hedge fund's one-stop-shop for alternative data signals, used weekly by >30% of investment professionals.
Aug 2015 and 2016
Research Intern
US Army Research Lab, Adelphi MD, USA
May 2016
Research Intern
Purdue University, West Lafayette, USA
May 2014 - Aug 2014
Research Intern
IBM Research, Dublin, Ireland
Nov 2009 - Jul 2012
Research Assistant
Adaptive Networks and Control Lab
Fudan University, Shanghai, China

Publications

Representing higher-order dependencies in networks

Science Advances

J. Xu, T.L. Wickramarathne, N.V. Chawla

In the top 5% of all research outputs scored by Altmetric, 97th percentile attention score compared to outputs of the same age, featured in 8 news outlets

Improving management of aquatic invasions by integrating shipping network, ecological, and environmental data: data mining for social good

KDD 2014

J. Xu, T.L. Wickramarathne, N.V. Chawla, E.K. Grey, K. Steinhaeuser, R.P. Keller, J.M. Drake, D.M. Lodge

Catching fire: an anatomy of information diffusion using Retweets

Northern Finance Association Conference 2014

Research in Behavioral Finance Conference 2014

10th Annual Central Bank Workshop on the Microstructure of Financial Markets 2014

N.V. Chawla, Z. Da, J. Xu, M. Ye

Human interactive patterns in temporal networks

IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans

Y.Q. Zhang, X. Li, J. Xu, A.V.Vasilakos

Mining Features Associated with Effective Tweets

International Conference on Advances in Social Networks Analysis and Mining (ASONAM) 2017

J. Xu, N.V. Chawla

Efficient modeling of higher-order dependencies in networks: from algorithm to application for anomaly detection

EPJ Data Science, 2020

M. Saebi, J. Xu, L.M. Kaplan, B. Ribeiro, N.V. Chawla

Network analysis of ballast-mediated species transfer reveals important introduction and dispersal patterns in the arctic

Scientific Reports, 2020

M. Saebi, J. Xu, S.R. Curasi, E.K. Grey, N.V. Chawla, D.M. Lodge

Higher-order patterns of aquatic species spread through the global shipping network

PLOS ONE, 2020

M. Saebi, J. Xu, E.K. Grey, D.M. Lodge, J.J. Corbett, N. Chawla

Structural Diversity and Homophily: A Study Across More Than One Hundred Big Networks

ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) 2017

J. Xu, N.V. Chawla

Skills

Programming

Python and SQL for data mining; C# for graphical interface; C and Common Lisp for high performance computing

Big Data

Snowflake, BigQuery, Vertica, and Spark for trillion-scale database queries; Airflow and Linux shell scripts for managing parallelized jobs on Amazon S3 and distributed systems.

Tools

Tableau and Gephi for visualization; Bloomberg Terminal for trading; ArcGIS for geographical information system; NetworkX for network analysis; JavaScript and Flask for web app development; Requests and Selenium for web scraping.

Education

2012 – 2017
Ph.D. in Computer Science and Engineering
University of Notre Dame, USA
GPA: 4.00/4.00
2008 - 2012
B.Sc. in Electronics Engineering
Fudan University, Shanghai, China
(Top 5 in China)

Honors & Awards

Jun 2017
Advanced Teaching Scholar Certificate
Notre Dame
Nov 2016
Striving for Excellence in Teaching Certificate
Notre Dame
Nov 2015
Outstanding Research Poster Award – Faculty Vote
Notre Dame
May 2013
1st Prize, Schurz Innovation Award on Data Mining
Schurz Communications Inc.
Jun 2012
Outstanding Bachelor Thesis Award
Fudan University
Best in the Department of Electronics Engineering
May 2011
National University Student of the Year in 2010
Chinese Ministry of Education
Only 10 students in China
2011
2nd Prize, National Undergraduate Electronic Design Contest, Shanghai site
As team leader
2010
Excellent Student in Fudan University
Fudan University
Top 5% in the university
2010
Top Ten Students in the School of Information Technology
Out of 1200 students in the school
2010
People's Scholarship
Top 10% in the department. Awarded twice.
2006
1st Prize, National Olympiad in Informatics in Provinces

Hobbies

Flute

Amateur Lv. 9 "Excellent"

Trivia

DIY projects

Instead of using smart speakers like Alexa and subscribe to podcasts, I built my own "radio alarm", which everyday at 8AM, it starts by telling my girlfriend how much I love her (with a different phrase everyday), plays our favorite music, reads the weather forecast, and reads the top financial and tech news from my RSS feeds with a TTS engine.

I also enjoy automating my home, from auto feeders for cats and fish, to the power switches behind my TV.

Higher-order Network
Interdisciplinary paper / Science Advances
HON visualization
Software package / IEEE PacificVis
Aquatic invasion
data mining
Data mining paper / KDD 2014
Tweet diffusion
Finance paper / Under review
Temporal motifs
Networks paper / IEEE SMCA
Effective Tweeting
Paper / Software
Leadership &
media coverage
Media coverage
Lecture Room 5023
Video series

Contact me



Curriculum Vitae

Let's keep in touch!