Open source·Free·Fully reproducible

Bridging academic rigor and quantitative trading.

An independent research lab publishing, in the open, what actually works in systematic trading and what doesn't, tested at scale. Built to contribute to both worlds at once: rigorous enough for academics, honest enough for practitioners.

Everything here is free, forever. No paywalls, no sign-ups, nothing to sell. Every paper, model, and dataset is open and reproducible.

Read the papers Long-form articles Browse the lab

INITIALIZING WEBGL…

Open source

104,351

lines of research code

across 17 public repositories on GitHub, plus the Research Review and Topology toolkit codebases (release pending), counted 2026-06-18

Python87.9%· 91,756
Rust10.8%· 11,259
R0.8%· 882
Shell0.4%· 454

Open by default, fork it, cite it, build on it. Support the research

By language

Lines per project

Every public repository featured on this site, the paper reproducibility packages, the article companion code, the single-project repos, and the walk-forward backtesting framework, plus the self-contained toolkit core of the in-flight Topology program. Counts are source lines only.

Research-ReviewRelease pending28,753 lines
topology-toolkitRelease pending19,210 lines
The self-contained core of the Topology of Strategy-Space program, the importable Python package (I/O, strategy-space, topology, evaluation, the baseline-model zoo, clustering, the CLI apps, the figure theme). Open-sources at program completion; the program's ~43k further lines of experiment scaffolding, figure scripts and tests aren't counted here.
dex-tradeability-study18,908 lines
Monte-Carlo-paper10,000 lines
Reproducibility package for the Monte-Carlo filter-evaluation paper, analysis scripts (Python / Rust / R), corrected aggregate tables, and a self-contained reproducer of the floating-point summation-order pitfall documented in the May 2026 revision.
worldcup-attention-markets9,941 lines
quant-research-framework4,697 lines
The walk-forward backtesting & robustness framework, the original Python build.
quant-research-framework-rs8,114 lines
The Rust port of the backtesting framework, the production build.
quant-surface-stability1,180 lines
Pre-registered study of parameter-surface stability, smooth ridges vs brittle spikes.
strategy-overfitting662 lines
Empirical variance decomposition of in-sample → out-of-sample Sharpe degradation.
strategy-manifold460 lines
Geometry of strategy space, dimensionality and manifold structure.
strategy-regime427 lines
Latent regime classification on market data, then regime-conditional strategy selection.
tail-evt403 lines
Extreme-value analysis of cross-asset tail dependence, peaks-over-threshold with GPD fits.
strategy-robust-portfolio396 lines
Robust portfolio construction over a strategy universe.
strategy-stats246 lines
Population-level statistics over a universe of algorithmic trading strategies.
strategy-rmt208 lines
Random-matrix analysis of strategy correlation matrices, noise vs signal decomposition.
signal-is-collective204 lines
Reproducible synthetic demos behind the article “The signal is collective”.
hc-knockoffs184 lines
Higher Criticism under dependence + Model-X knockoffs for FDR-controlled strategy selection.
strategy-tda181 lines
Persistent homology of strategy space, topological data analysis on strategy populations.
strategy-corrcube177 lines
Cross-asset correlation cube, pairwise correlations between asset return series.

Source lines only, READMEs, methodology notes, data files and configs aren't counted. 56,388 lines sit in the 17 public repos. Two further codebases open-source at program completion: the López-de-Prado Research Review pipelines (28,753 lines) and the Topology toolkit core (19,210 lines). The Topology program's ~43,000 further lines of experiment scaffolding, figure scripts and tests, and its Rust accelerator crates, are not counted here.

Working papers

Papers

All research

Peer-facing research with full reproducibility packages. Three studies on the boundaries of statistical edge, tested at scale across asset classes.

SSRN 6955879 · cross-market study

The Reach of the World Cup Distraction Effect

The whole planet watches the World Cup, yet the global markets that carry most of the trading barely flinch. Across 11 instruments, 5 market structures, 26 countries and 7 World Cups, 158 statistical tests turn up nothing that survives correction, and the headline effects that seem to travel are artifacts of how they are measured.

Scope: 11 instruments, 5 market structures, 26 countries, 7 World Cups

158 statistical tests, 0 survive multiple-testing correction

A planet-scale attention shock the markets do not price

Effects that seem to 'travel' are measurement artifacts

Read the study

26 countries · 7 World Cups158 tests · 0 survive

Paper · SSRN 6636018

Does a popular statistical filter actually pick winners?

A simple question, tested at scale: does acting on a widely-used permutation test help you choose winning strategies ahead of time? The answer is no, and a famous result claiming it hurts turns out to be a rounding bug.

Scope: 437,911 strategies, 9 markets, 3 asset classes

Over 6 billion permutation tests run

The filter's edge: indistinguishable from zero

A well-known negative result traced to floating-point rounding

Read the study

bootstrap lift, centered on zero

Paper · SSRN 6858778 · Published

Can you profit from the smallest coins after real costs?

Can you make money in coins that only trade on decentralized exchanges, once realistic trading costs are paid? We tested it across 27 chains, with demos you can poke at in the browser.

Scope: 27 chains, DEX-only coins

Long-only, with realistic costs applied

Any raw edge is eaten by the cost of trading

Explore it yourself with in-browser demos

Read the study

27 chains, long-only net of costs

Reproducible essays

Long-form articles

All articles

Interactive, figure-driven write-ups of the central empirical claims, each reproducible from a public data table.

01 · Article · Flagship

Edge is in the Process

How we turned an unprofitable pile of half a million strategies into a portfolio that actually makes money, without touching the strategies themselves. The whole trick is which ones you choose to keep.

Scope: 533,638 strategies across 12 markets

Held all together: the pool loses money

Selected by past performance: profitable on 8 of 12

The edge is in the selection, not the strategy

Read the article

pool loses, selection clears break-even

02 · Article · Pre-registered

The partition does the work

A smarter way to build a portfolio out of a pool of strategies, held to a strict statistical bar and pitted against the standard benchmarks. It beats all of them.

Scope: 26 markets, ~1.24 million strategies

Beats the usual 'rank by Sharpe' approach

Clears a pre-registered significance bar

Standard benchmarks left far behind

Read the article

partition beats rank-by-Sharpe (Holm 5%)

03 · Article · The corpus

What “1.6 million strategies” actually means

‘1.6 million strategies’ sounds like data-mining. Here is why it is not, shown with the actual numbers behind the corpus.

Scope: 1.6 million strategies across 30 markets

The strategies are nearly independent of each other

Only ~3% are closely correlated

Every result is walk-forward, never in-sample

Read the article

pairwise |ρ|, peaked near zero

Open-source models

Projects

Open the lab

Twelve reference implementations of the lab's M-models, random matrix theory, Higher Criticism + knockoffs, topological data analysis, tail-EVT, each with an explainer, a live demo, and code.

(a) eigenspectrum · MP null

M/01

Eigenspectrum of the strategy correlation matrix

Marchenko–Pastur and parallel-analysis eigenspectrum of strategy correlation matrices. Reference implementation of the firm's M/01 model.

W = |Z| − |Z̃| · FDR threshold τ

M/02

Sparse signal detection with FDR control

Higher Criticism plus Model-X knockoffs for FDR-controlled strategy selection. Reference implementation of the firm's M/02 model.

H₀ persistence barcode

M/03

Persistence barcodes on strategy structure

H0 persistence barcode under correlation-distance Vietoris–Rips on strategy populations. Reference implementation of the firm's M/03 model.

POT-GPD tail · log P(X > x)

M/04

Peaks-over-threshold and pairwise tail-coupling

Peaks-over-threshold GPD fits and pairwise tail-coupling χ on cross-asset returns. Reference implementation of the firm's M/04 model.

IS−OOS gap · variance decomposition

M/05

Decomposing the IS-OOS Sharpe gap

Variance decomposition of the in-sample / out-of-sample Sharpe gap into selection bias, parameter-choice noise, and residual skill across 10 deep-WFO crypto assets.

smooth basin · spiky peak

M/09

Does in-sample smoothness predict out-of-sample skill?

Pre-registered empirical test of whether in-sample Sharpe-surface smoothness under a fixed five-perturbation suite predicts out-of-sample skill across SOL / DOGE / BTC walk-forward partitions.

2-D embedding · robust islands vs fragile mass

M/06

PCA + UMAP geometry of the strategy population

PCA + UMAP embedding of large strategy populations from a 90-feature metric vector, with connectivity-based separation of robust vs fragile strategies.

K=4 HMM regimes · posterior γ_t

M/07

Hidden-Markov regime segmentation

Gaussian HMM regime segmentation on (logret, volatility, trend) features with K selected by BIC; cross-asset 4-state preference across crypto majors.

OOS vol · universe-saturation curve

M/08

Universe-saturation of minimum-variance portfolios

Universe-saturation analysis for minimum-variance portfolios drawn from large strategy pools, comparing Ledoit–Wolf shrinkage, Huber-style robust, and sample covariance estimators.

LOADING CORPUS STATISTICS…

Public output to date

Working papers

on SSRN: World Cup markets, permutation testing, DEX tradeability

Long-form articles

interactive sims and figures, fully reproducible

Open-source models

Python · Rust · R

23M+

Strategy-windows backtested

full corpus, all 30 partitions

Asset / timeframe combos

crypto majors, large, mid-cap, FX

Years coding systematic strategies

Python · Rust · R · TypeScript

The author

Daniel Gatto

Independent researcher
Economic Sciences candidate, UNIP
Quantitative Systems Consultant (NDA)

Daru Finance is the public output of one researcher. The work here is what gets done after-hours, in code, with receipts.

I write systematic-trading research with a strong reproducibility bias, Python for analysis, Rust for compute, R for verification. My current work focuses on the statistical structure of strategy populations: random-matrix bounds on correlation eigenspectra, Higher-Criticism + knockoffs for FDR-controlled selection, topological indicators for cluster stability, and extreme-value theory for tail co-movement. The throughline is that the unit of analysis is the population, not the individual strategy. From the research and the twelve open-source models down to this site itself, design, code, charts, every word, it’s all a one-person build.

Read the full bio

Background, in numbers

9+ yrs

In crypto markets

since 2017

6+ yrs

Quantitative research

systematic methods

Coding languages

Python · Rust · R · TS · C++

8k+

Following on X