ReasonMath-15

ReasonMath-15 is an official BenchLocal Bench Pack for practical reasoning and math. It measures whether a model can solve grounded reasoning tasks, perform careful calculations, and avoid plausible but incorrect shortcuts.

A Bench Pack is an installable benchmark package that runs inside the BenchLocal desktop app. BenchLocal provides the shared app experience for provider setup, model selection, sampling controls, run histories, and side-by-side comparison across benchmark packs.

This repository contains the benchmark source: reasoning scenarios, scoring logic, methodology, a BenchLocal adapter, and a CLI runner for local development.

Run With BenchLocal

Download BenchLocal from the latest BenchLocal release.
Open BenchLocal and install ReasonMath-15 from the official Bench Pack registry.
Add one or more models, select ReasonMath-15, and start a run.

Bench Pack Structure

lib/                    Benchmark core, scoring, and model runtime
benchlocal/             Thin BenchLocal SDK adapter
cli/                    Non-UI runner
benchlocal.pack.json  Static install/discovery manifest
METHODOLOGY.md          Published benchmark methodology

BenchLocal Adapter

benchlocal/index.ts is the only place that imports @benchlocal/sdk.
lib/ stays framework-agnostic and is shared by the CLI and BenchLocal.
benchlocal.pack.json is the canonical Bench Pack metadata manifest used for install, inspection, and runtime metadata.
Per-pack default sampling belongs on the manifest, not on global host settings.

Development

BenchLocal build: npm run build:benchlocal
CLI runner: npm run cli
Methodology: METHODOLOGY.md

Validation

npm run typecheck
npm run build:benchlocal

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
benchlocal		benchlocal
cli		cli
dist		dist
lib		lib
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
METHODOLOGY.md		METHODOLOGY.md
README.md		README.md
benchlocal.pack.json		benchlocal.pack.json
package-lock.json		package-lock.json
package.json		package.json
tsconfig.benchlocal.json		tsconfig.benchlocal.json
tsconfig.cli.json		tsconfig.cli.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ReasonMath-15

Run With BenchLocal

Bench Pack Structure

BenchLocal Adapter

Development

Validation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ReasonMath-15

Run With BenchLocal

Bench Pack Structure

BenchLocal Adapter

Development

Validation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages