-
Cross River
- New York
- cosentino.me
- @lcosent
Pinned Loading
-
agent-debate
agent-debate PublicMulti-agent advocate/skeptic/judge debate for any consequential decision. Hiring, product, policy, architecture, M&A. Surfaces uncalled risks.
Python
-
bio-bench
bio-bench PublicNarrow eval harness for LLMs on biological reasoning. Five task families, 80 examples, deterministic scoring.
Python
-
clinvar-classify
clinvar-classify PublicResearch probe: LLM-assisted clinical variant classification with retrieval bundles. 60 ClinVar variants, no training.
Python
-
life-scenarios
life-scenarios PublicScenario engine for major life-event decisions. Career breaks, relocations, education funding, large transactions. Numbers, risks, and recommended actions.
Python
-
prompt-eval-arena
prompt-eval-arena PublicRigorous A/B harness for prompt iteration. Both deterministic and LLM-as-judge scoring. Paired Wilcoxon, effect sizes, replicate-run variance.
Python
-
stablecoin-policy
stablecoin-policy PublicSelective-disclosure policy engine for stablecoin payments. Per-role views, predicate attestations, declarative policy language. TypeScript.
TypeScript
If the problem persists, check the GitHub status page or contact support.


