Hayaiti Research

Internal R&D · cross-sectional equities

Quant TradingSoftwareInternal R&D — anonymized

Hayaiti Research · cross-sectional momentum harness

Stand up a reproducible backtest harness for a momentum strategy on equities.

Internal research project: a cross-sectional momentum strategy on a liquid US equity universe. The deliverable was less the strategy itself and more the harness — a reproducible, parameter-swept backtest framework that could be used for the next 50 ideas.

Source code is yoursRefund if late

Cross

● live

Throughput

794/s

p95

62ms

Errors

0.03%

Python-5956

33m ago

Pandas-5957

34m ago

NumPy-5958

35m ago

DuckDB-5959

36m ago

Industry: Quant Trading
Timeline: 3 weeks
Team: 2
Service: Software
Project tier: Internal R&D — anonymized

The Problem

What was broken.

Earlier strategy research had been scattered across notebooks with hardcoded paths and inconsistent universe definitions. Two researchers couldn't replicate each other's PnL even when running the 'same' code. We needed a harness, not another notebook.

Our Approach

How we framed it.

Built a small research framework: a universe loader (point-in-time, no survivorship bias), a feature library (zero-mean, point-in-time, no leakage), a portfolio constructor with explicit constraints, and a backtester that records every parameter sweep into MLflow. The momentum strategy itself became a 200-line config on top of that harness.

Capability proof

What this case demonstrates.

This case makes the hidden work visible: strategy, architecture, delivery control, quality evidence, and handoff.

01 / Product judgment

Problem framed before UI

02 / Technical depth

8 stack decisions

Python, Pandas, NumPy, DuckDB, Parquet, Polars

03 / Delivery discipline

3 delivery checkpoints

Universe + data lake / Feature library + harness / Sweep + writeup

04 / Handoff quality

5 shipped artifacts

Point-in-time universe loader + price adjustment pipeline / Feature library (momentum, vol-scaling, simple risk model) / Backtester with explicit constraint handling

Production artifacts

Inspect the work behind the visible result.

Each case exposes the surfaces, systems, evidence, and handoff package that make the shipped product usable after launch.

Experience layer

Buyer or user surface

UXResponsiveA11y

Production engine with backtest → paper → live progression, real-time PnL, and risk circuit breakers.

Proof 01

Stand up a reproducible backtest harness for a momentum strategy on equities.

Proof 02

Built point-in-time universe and price-adjustment pipeline. Stored as partitioned Parquet, queried via DuckDB.

Proof 03

MLflow experiment tracker with sweep helpers

Before / after · product UI mockup

Industry · Quant Trading

term://hayaiti-research/quant-cli

Before

$ python run_strategy.py --symbol SPY --window 20

2026-04-23 09:32:11 loaded 4 strategies from ./strats/

2026-04-23 09:32:11 fetching bars · SPY · 5m · 60d ... ok (4,392)

2026-04-23 09:32:12 computing momentum · window=20 ... ok

2026-04-23 09:32:12 computing vol target · 15% ann ... ok

2026-04-23 09:32:13 WARN signal_engine: missing data 2026-03-08 09:30..09:45

2026-04-23 09:32:13 generated 12 signals (long=8, short=4)

2026-04-23 09:32:14 backtest · sharpe=1.42 maxdd=-8.2% winrate=67%

2026-04-23 09:32:14 PnL summary:

date pnl cum_pnl trades win_rate

2026-04-21 +$1,204 +$42,118 3 100%

2026-04-22 -$487 +$41,631 2 50%

2026-04-23 +$2,107 +$43,738 4 75%

$ tail -f ./logs/strategy.log

14:32:18 signal SPY LONG 0.42 @ 478.12 size=200

14:32:18 order routed → IBKR · ack 142ms

14:32:19 fill SPY 200 @ 478.13 · slip 1bp

█

research.hayaiti.com/strategies

After

Hayaiti Research · Momentum-20

live · 142ms

Signals/hr

Win rate

67%

PnL · day

+$48,210

Sharpe

1.42

Cumulative PnL · 60d+$43,738

14:32:18SPY LONG 0.42 @ $478.13IBKR · ack

14:31:42QQQ SHORT 0.31 @ $412.91IBKR · ack

14:30:11SPY EXIT +$1,204 · slip 1bpIBKR · ack

14:28:54IWM LONG 0.28 @ $214.22IBKR · ack

Before:Strategy lived in a Jupyter notebook; entries triggered manually after end-of-day backtest.

After:Production engine with backtest → paper → live progression, real-time PnL, and risk circuit breakers.

How the engagement ran.

3steps · scroll →

01Week 1
Universe + data lake
Built point-in-time universe and price-adjustment pipeline. Stored as partitioned Parquet, queried via DuckDB.
02Week 2
Feature library + harness
Implemented signed momentum, vol-scaling, and a simple risk model. All features unit-tested against handpicked dates.
03Week 3
Sweep + writeup
Ran a parameter sweep over lookback, holding period, and universe size. Wrote the results into a 12-page internal memo with charts.

1
Week 1
Universe + data lake
Built point-in-time universe and price-adjustment pipeline. Stored as partitioned Parquet, queried via DuckDB.
2
Week 2
Feature library + harness
Implemented signed momentum, vol-scaling, and a simple risk model. All features unit-tested against handpicked dates.
3
Week 3
Sweep + writeup
Ran a parameter sweep over lookback, holding period, and universe size. Wrote the results into a 12-page internal memo with charts.

Deliverables

What we shipped.

✓Point-in-time universe loader + price adjustment pipeline
✓Feature library (momentum, vol-scaling, simple risk model)
✓Backtester with explicit constraint handling
✓MLflow experiment tracker with sweep helpers
✓12-page internal research memo with charts and caveats

Outcomes.

delivered outcomes

Plan: reproducible: same config + same data lake = same PnL, every time

Plan: point-in-time universe and features (no look-ahead bias)

Plan: mLflow-tracked sweeps with run hashes per config

Plan: duckDB query layer over Parquet — fast on a laptop, no infra needed

Plan: strategy config readable in 10 minutes by a new researcher

Honest challenges

What we got wrong (or almost wrong).

The pretty version of any case study skips this part. We don't.

01
Survivorship bias is the easy trap — caught one feature that quietly used today's universe to compute historical signals.
02
DuckDB query plans got expensive on naive joins; rewrote a couple of them and dropped sweep time by an order of magnitude.
03
MLflow's local SQLite backend was fine for one researcher and miserable for two — switched to a shared Postgres backend on day 14.

In our own words

Internal note: the harness mattered more than the strategy. We re-used it three times in the next two months and never had to argue about reproducibility again.

From the Hayaiti team

Engineering · design · security

Technical blueprint

How the work holds together.

Buyers should see that the visual layer is backed by architecture, quality gates, and operational ownership.

Experience

Interfaces, flows, states, accessibility.

Application

Business logic, API contracts, workflow orchestration.

Data

Schema design, storage, data movement, retention.

Operations

Deploy, monitoring, alerts, rollback, runbooks.

Security

Threat model, controls, evidence, remediation.

Stack used

8 technologies

PythonPandasNumPyDuckDBParquetPolarsJupyterMLflow (experiment tracking)

Other cases like this.

B2B SaaS

Northwind Studios

Onboarding rebuild · Series A SaaS

Cut signup-to-aha-moment from 9 minutes to under 90 seconds.

Fintech

Foundry Capital

Broker API + nightly reconciliation

Stand up a reliable broker integration with nightly reconciliation in three weeks.

Climate / ESG

Sentinel Operating

Scope 1-2-3 emissions dashboard

Produce defensible Scope 1-2-3 emissions reports without an army of consultants.

Want a case study like this?

Want this level of production quality on your project?

Send a short brief and we'll reply with scope, timeline, price direction, and the first technical recommendation.

Get a build plan See pricing

Source code is yoursPay 50/50Pause anytime25% refund if lateNDA on day 1

Hayaiti Research · cross-sectional momentum harness

Stand up a reproducible backtest harness for a momentum strategy on equities.

Source code is yoursRefund if late

How we framed it.

What we got wrong (or almost wrong).

The pretty version of any case study skips this part. We don't.

Survivorship bias is the easy trap — caught one feature that quietly used today's universe to compute historical signals.

DuckDB query plans got expensive on naive joins; rewrote a couple of them and dropped sweep time by an order of magnitude.

MLflow's local SQLite backend was fine for one researcher and miserable for two — switched to a shared Postgres backend on day 14.