Computational methods for predictive genomics.
Heru develops the analytical methods behind Haeckel. Every analyzer is calibrated against reference samples, every prediction is backed by confidence intervals, and every method is validated before it ships.
Ancestry inference
MLE-EM with spatial thinning, bootstrap Wald tests, and in-silico AMR deconvolution. 26,000 independent AIMs across 62 sub-populations from 1000 Genomes and HGDP. Significance-gated: ancestry below the confidence threshold is returned as Unassigned.
Polygenic risk scoring
Six PRS methods in production: Basic PRS with palindromic skip, PRS-CS (Bayesian MCMC), C+T clumping, LDpred2 (four models), Lassosum (coordinate descent), SBayesR (four-component mixture). Ensemble weighting with R-hat convergence diagnostics.
Haplogroup resolution
Recursive phylogenetic traversal across 140 nodes with back-mutation handling. Bayesian confidence scoring against random expectation. Y-DNA and mtDNA trees annotated with historical figures and migration context.
Archaic introgression
S* statistic for deep coalescence detection, ABBA-BABA D-statistics with block jackknife standard errors, and a 4-state Viterbi HMM for introgression tract detection with coalescent dating. 53 Neanderthal and Denisovan markers.
Phenotype prediction
HIrisPlex-S models for eye, hair, and skin color. GIANT consortium height prediction. Per-SNP contribution tracking for explainability. 67% coverage threshold to prevent systematic bias from missing MC1R variants.
Pharmacogenomics & safety
Star allele calling for 12 CPIC Level-A genes (CYP2D6, CYP2C19, DPYD, SLCO1B1, HLA-B, and more). 10 FDA Black Box interactions surfaced as critical alerts. Activity score modeling for metabolizer classification.
Whole-genome offspring modeling
The whole-genome inheritance calculator combines polygenic scores with Mendelian predictions across traits. Partner compatibility scoring, offspring trait simulation, and embryo comparison across complex phenotypes.
Quality & validation
Data quality control on every upload: call rate, heterozygosity, Ti/Tv ratio, per-chromosome statistics. Every analyzer is validated against held-out 1000 Genomes and HGDP reference samples before it ships.
Validated before it ships.
Every analyzer is validated against held-out 1000 Genomes and HGDP reference samples before it ships. Confidence intervals, convergence diagnostics, and per-SNP contribution tracking are on by default, and calibration results are published openly inside the product. Where a method has limits, where data is missing, or where a population is underrepresented, we say so.
Open to collaboration.
If you work in genomics, bioinformatics, population genetics, or reproductive medicine and want to collaborate on methods, data, or validation, reach out.
Get in touch