Open source·Free·Fully reproducible

Bridging academic rigor and quantitative trading.

An independent research lab publishing, in the open, what actually works in systematic trading and what doesn't, tested at scale. Built to contribute to both worlds at once: rigorous enough for academics, honest enough for practitioners.

Everything here is free, forever. No paywalls, no sign-ups, nothing to sell. Every paper, model, and dataset is open and reproducible.

INITIALIZING WEBGL…

Open source

104,351

lines of research code

across 17 public repositories on GitHub, plus the Research Review and Topology toolkit codebases (release pending), counted 2026-06-18

  • Python87.9%· 91,756
  • Rust10.8%· 11,259
  • R0.8%· 882
  • Shell0.4%· 454

Open by default, fork it, cite it, build on it. Support the research

By language

Python · 87.9%Rust · 10.8%R · 0.8%Shell · 0.4%104,351LINES
Lines per project

Every public repository featured on this site, the paper reproducibility packages, the article companion code, the single-project repos, and the walk-forward backtesting framework, plus the self-contained toolkit core of the in-flight Topology program. Counts are source lines only.

  • Research-ReviewRelease pending28,753 lines

  • topology-toolkitRelease pending19,210 lines

    The self-contained core of the Topology of Strategy-Space program, the importable Python package (I/O, strategy-space, topology, evaluation, the baseline-model zoo, clustering, the CLI apps, the figure theme). Open-sources at program completion; the program's ~43k further lines of experiment scaffolding, figure scripts and tests aren't counted here.

  • Monte-Carlo-paper10,000 lines

    Reproducibility package for the Monte-Carlo filter-evaluation paper, analysis scripts (Python / Rust / R), corrected aggregate tables, and a self-contained reproducer of the floating-point summation-order pitfall documented in the May 2026 revision.

  • The walk-forward backtesting & robustness framework, the original Python build.

  • The Rust port of the backtesting framework, the production build.

  • Pre-registered study of parameter-surface stability, smooth ridges vs brittle spikes.

  • Empirical variance decomposition of in-sample → out-of-sample Sharpe degradation.

  • Geometry of strategy space, dimensionality and manifold structure.

  • Latent regime classification on market data, then regime-conditional strategy selection.

  • tail-evt403 lines

    Extreme-value analysis of cross-asset tail dependence, peaks-over-threshold with GPD fits.

  • Robust portfolio construction over a strategy universe.

  • Population-level statistics over a universe of algorithmic trading strategies.

  • strategy-rmt208 lines

    Random-matrix analysis of strategy correlation matrices, noise vs signal decomposition.

  • Reproducible synthetic demos behind the article “The signal is collective”.

  • hc-knockoffs184 lines

    Higher Criticism under dependence + Model-X knockoffs for FDR-controlled strategy selection.

  • strategy-tda181 lines

    Persistent homology of strategy space, topological data analysis on strategy populations.

  • Cross-asset correlation cube, pairwise correlations between asset return series.

Source lines only, READMEs, methodology notes, data files and configs aren't counted. 56,388 lines sit in the 17 public repos. Two further codebases open-source at program completion: the López-de-Prado Research Review pipelines (28,753 lines) and the Topology toolkit core (19,210 lines). The Topology program's ~43,000 further lines of experiment scaffolding, figure scripts and tests, and its Rust accelerator crates, are not counted here.

Working papers

Peer-facing research with full reproducibility packages. Three studies on the boundaries of statistical edge, tested at scale across asset classes.

SSRN 6955879 · cross-market study

The Reach of the World Cup Distraction Effect

The whole planet watches the World Cup, yet the global markets that carry most of the trading barely flinch. Across 11 instruments, 5 market structures, 26 countries and 7 World Cups, 158 statistical tests turn up nothing that survives correction, and the headline effects that seem to travel are artifacts of how they are measured.

Scope: 11 instruments, 5 market structures, 26 countries, 7 World Cups
158 statistical tests, 0 survive multiple-testing correction
A planet-scale attention shock the markets do not price
Effects that seem to 'travel' are measurement artifacts
Read the study
BrazilArgentinaFranceGermanySpainEnglandItalyNetherlandsPortugalCroatiaUruguayMexicoJapanSouth KoreaMoroccoUnited StatesBelgiumSwitzerlandPolandDenmarkAustraliaCanada
26 countries · 7 World Cups158 tests · 0 survive

Paper · SSRN 6636018

Does a popular statistical filter actually pick winners?

A simple question, tested at scale: does acting on a widely-used permutation test help you choose winning strategies ahead of time? The answer is no, and a famous result claiming it hurts turns out to be a rounding bug.

Scope: 437,911 strategies, 9 markets, 3 asset classes
Over 6 billion permutation tests run
The filter's edge: indistinguishable from zero
A well-known negative result traced to floating-point rounding
Read the study
0
bootstrap lift, centered on zero

Paper · SSRN 6858778 · Published

Can you profit from the smallest coins after real costs?

Can you make money in coins that only trade on decentralized exchanges, once realistic trading costs are paid? We tested it across 27 chains, with demos you can poke at in the browser.

Scope: 27 chains, DEX-only coins
Long-only, with realistic costs applied
Any raw edge is eaten by the cost of trading
Explore it yourself with in-browser demos
Read the study
cost drag
27 chains, long-only net of costs

Reproducible essays

Long-form articles

All articles

Interactive, figure-driven write-ups of the central empirical claims, each reproducible from a public data table.

01 · Article · Flagship

Edge is in the Process

How we turned an unprofitable pile of half a million strategies into a portfolio that actually makes money, without touching the strategies themselves. The whole trick is which ones you choose to keep.

Scope: 533,638 strategies across 12 markets
Held all together: the pool loses money
Selected by past performance: profitable on 8 of 12
The edge is in the selection, not the strategy
Read the article
break-even
pool loses, selection clears break-even

02 · Article · Pre-registered

The partition does the work

A smarter way to build a portfolio out of a pool of strategies, held to a strict statistical bar and pitted against the standard benchmarks. It beats all of them.

Scope: 26 markets, ~1.24 million strategies
Beats the usual 'rank by Sharpe' approach
Clears a pre-registered significance bar
Standard benchmarks left far behind
Read the article
FWER 5%PartitionRank-SharpeHRPEqual-wt
partition beats rank-by-Sharpe (Holm 5%)

03 · Article · The corpus

What “1.6 million strategies” actually means

‘1.6 million strategies’ sounds like data-mining. Here is why it is not, shown with the actual numbers behind the corpus.

Scope: 1.6 million strategies across 30 markets
The strategies are nearly independent of each other
Only ~3% are closely correlated
Every result is walk-forward, never in-sample
Read the article
01|ρ|=0.5mean 0.123
pairwise |ρ|, peaked near zero

Open-source models

Twelve reference implementations of the lab's M-models, random matrix theory, Higher Criticism + knockoffs, topological data analysis, tail-EVT, each with an explainer, a live demo, and code.

λ₊
(a) eigenspectrum · MP null
M/01

Eigenspectrum of the strategy correlation matrix

Marchenko–Pastur and parallel-analysis eigenspectrum of strategy correlation matrices. Reference implementation of the firm's M/01 model.

τ̂
W = |Z| − |Z̃| · FDR threshold τ
M/02

Sparse signal detection with FDR control

Higher Criticism plus Model-X knockoffs for FDR-controlled strategy selection. Reference implementation of the firm's M/02 model.

ε=0ε=1
H₀ persistence barcode
M/03

Persistence barcodes on strategy structure

H0 persistence barcode under correlation-distance Vietoris–Rips on strategy populations. Reference implementation of the firm's M/03 model.

u
POT-GPD tail · log P(X > x)
M/04

Peaks-over-threshold and pairwise tail-coupling

Peaks-over-threshold GPD fits and pairwise tail-coupling χ on cross-asset returns. Reference implementation of the firm's M/04 model.

OOSISsel. biasparam σskill ≈ 0
IS−OOS gap · variance decomposition
M/05

Decomposing the IS-OOS Sharpe gap

Variance decomposition of the in-sample / out-of-sample Sharpe gap into selection bias, parameter-choice noise, and residual skill across 10 deep-WFO crypto assets.

ATRMACD
smooth basin · spiky peak
M/09

Does in-sample smoothness predict out-of-sample skill?

Pre-registered empirical test of whether in-sample Sharpe-surface smoothness under a fixed five-perturbation suite predicts out-of-sample skill across SOL / DOGE / BTC walk-forward partitions.

2-D embedding · robust islands vs fragile mass
M/06

PCA + UMAP geometry of the strategy population

PCA + UMAP embedding of large strategy populations from a 90-feature metric vector, with connectivity-based separation of robust vs fragile strategies.

K=4 HMM regimes · posterior γ_t
M/07

Hidden-Markov regime segmentation

Gaussian HMM regime segmentation on (logret, volatility, trend) features with K selected by BIC; cross-asset 4-state preference across crypto majors.

σ_oos2549kLWHuber
OOS vol · universe-saturation curve
M/08

Universe-saturation of minimum-variance portfolios

Universe-saturation analysis for minimum-variance portfolios drawn from large strategy pools, comparing Ledoit–Wolf shrinkage, Huber-style robust, and sample covariance estimators.

LOADING CORPUS STATISTICS…

Public output to date

3
Working papers
on SSRN: World Cup markets, permutation testing, DEX tradeability
3
Long-form articles
interactive sims and figures, fully reproducible
12
Open-source models
Python · Rust · R
23M+
Strategy-windows backtested
full corpus, all 30 partitions
30
Asset / timeframe combos
crypto majors, large, mid-cap, FX
5+
Years coding systematic strategies
Python · Rust · R · TypeScript

The author

Daniel Gatto

  • Independent researcher
  • Economic Sciences candidate, UNIP
  • Quantitative Systems Consultant (NDA)

Daru Finance is the public output of one researcher. The work here is what gets done after-hours, in code, with receipts.

I write systematic-trading research with a strong reproducibility bias, Python for analysis, Rust for compute, R for verification. My current work focuses on the statistical structure of strategy populations: random-matrix bounds on correlation eigenspectra, Higher-Criticism + knockoffs for FDR-controlled selection, topological indicators for cluster stability, and extreme-value theory for tail co-movement. The throughline is that the unit of analysis is the population, not the individual strategy. From the research and the twelve open-source models down to this site itself, design, code, charts, every word, it’s all a one-person build.

Read the full bio

Background, in numbers

9+ yrs
In crypto markets
since 2017
6+ yrs
Quantitative research
systematic methods
5+
Coding languages
Python · Rust · R · TS · C++
8k+
Following on X
@DaruTrading
2K+
SSRN downloads
working papers, to date

Get in touch

Replications, corrections, methodological questions.

Academic correspondence is always welcome, replications, corrections, methodology questions.

Daru Finance: Independent quantitative research