Verified publication track

Publish claims with executable evidence.

Verified is an interactives track for papers whose core contribution can be tested. Authors submit scoped claims, runnable evidence, and a pinned environment; interactives returns a public record of what ran, what passed, and what remains out of scope.

Authorship is not the gate. The verification record is.

See the protocol What qualifies

Pilot protocol

Verified submissions are being prepared; this page defines the track, not an open intake form.

verified.yamlclaim manifest

claim:: "sortpartial returns a sorted array under the stated comparison budget"
evidence:: tests/claim_01.py
env:: python:3.12-slim
network:: disabled
budget:: 2 vCPU / 512 MB / 90 s
gate.execution:: passed
gate.faithfulness:: passed
record:: publish_with_verification

Execution: passedFaithfulness: passedRecord: attached

Contract

The badge is claim-scoped.

Verified does not certify an entire paper. It certifies named claims, checked under named conditions, with the evidence and limits attached.

publication.contractscope rules

Authorship-neutral

The author's identity, reputation, or writing process is not the gate. The declared claim is.

Claim-scoped

A Verified badge applies to specific claims, not to every sentence in the paper.

Evidence-run

Tests, fixtures, seeds, data references, and environment specs carry the evidence.

Protocol

What authors submit.What readers see.

The workflow turns a paper's testable contributions into a record a reader can inspect: claim, evidence, environment, run, verdict, limits.

verification.protocolclaim -> record

01->
Declare
State one contribution as a specific, testable claim.
02->
Package
Attach tests, fixtures, seeds, baselines, data references, and a pinned environment.
03->
Run
Execute the package in a network-isolated CPU sandbox with fixed resource limits.
04->
Assess
Check whether the evidence supports the stated claim, not merely whether the tests pass.
05
Publish
Attach the verification record to claims that pass both gates.

Two gates

Passing tests are not enough.

Gate 1 asks whether the evidence ran. Gate 2 asks whether that evidence is adequate for the claim. The second gate is where the badge earns its value.

gate 1.executionrun record

Gate 1Execution

Did the evidence run?

The submitted package runs in a pinned, deterministic, network-isolated environment. This gate produces the raw execution record: logs, exit status, resource use, and reproducibility signals.

Outputs

exit status
logs
resource use
re-run signal

gate 2.faithfulnessbadge value

Gate 2Faithfulness

Did it support the claim?

Passing tests are not enough. The faithfulness gate checks for tautological tests, cherry-picked fixtures, overfit answers, and manuscript claims that exceed the evidence.

Outputs

claim coverage
adequacy notes
scope limits
verdict

Verified result

The result is a record, not a seal.

Passing submissions are published with the claim, environment, outputs, verdict, and limits attached. Failed runs return logs and costs, but not the Verified publication outcome.

Claim C1paper excerpt

sortpartial returns a sorted array and stays under the stated comparison budget on the pinned workload described in C1.

tests/claim_01.pysandbox input

def test_matches_oracle():
    for seed in range(10_000):
        xs = random_ints(seed, n=1_000)
        assert sortpartial(xs) == sorted(xs)

def test_comparison_budget():
    ops = count_comparisons(sortpartial, n=100_000)
    assert ops <= 3 * 100_000 * log2(100_000)

verification.receiptpublic record

Paper: Sorting with partial comparisons
Claim: C1
Scope: C1 only; no paper-wide truth claim
Execution: Passed
Faithfulness: Passed
Environment: python:3.12-slim
Network: Disabled
Budget: 2 vCPU / 512 MB / 90 s
Compute: 0.8 CPU·min / $0.04
Record: Published with verification record

Scope and policy

Less, precisely.Not more, vaguely.

Verified starts with claims that machines can actually check. The same boundary applies commercially: authors pay for verification work, never for acceptance.

commercial.policycompute ledger

Billed

Sandbox execution, verification work, and metered compute.

Not sold

Acceptance, badge status, or publication outcome.

Public record

The claim, environment, run result, verdict, and limits.

claim.scopev1 boundary

In scope for v1

Deterministic algorithmic correctness
CPU-only benchmarks on pinned data
Exact output reproduction
Machine-checkable proofs or assertions
Reproducible data pipelines for figures and tables

Out of scope for v1

Human-subjects or field claims
Novelty claims, e.g. “first to...”
Qualitative or design contributions
Non-mechanizable theory
GPU-dependent or statistically nondeterministic ML claims

Executable publishing

Narrow claims.Explicit evidence.Public records.

verified.recordpublication outcome

For authors, Verified is a submission path for runnable evidence. For readers, it is a record of what was checked, how it ran, and what was not claimed.

Claim scopedEvidence executedRecord attached

Review the protocol Check scope

!nteractives