Verified publication track

Publish claims with executable evidence.

Verified is an interactives track for papers whose core contribution can be tested. Authors submit scoped claims, runnable evidence, and a pinned environment; interactives returns a public record of what ran, what passed, and what remains out of scope.

Authorship is not the gate. The verification record is.

Pilot protocol

Verified submissions are being prepared; this page defines the track, not an open intake form.

verified.yamlclaim manifest
claim:
"sortpartial returns a sorted array under the stated comparison budget"
evidence:
tests/claim_01.py
env:
python:3.12-slim
network:
disabled
budget:
2 vCPU / 512 MB / 90 s
gate.execution:
passed
gate.faithfulness:
passed
record:
publish_with_verification
Execution: passedFaithfulness: passedRecord: attached

Contract

The badge is claim-scoped.

Verified does not certify an entire paper. It certifies named claims, checked under named conditions, with the evidence and limits attached.

publication.contractscope rules

01

Authorship-neutral

The author's identity, reputation, or writing process is not the gate. The declared claim is.

02

Claim-scoped

A Verified badge applies to specific claims, not to every sentence in the paper.

03

Evidence-run

Tests, fixtures, seeds, data references, and environment specs carry the evidence.

Protocol

What authors submit.What readers see.

The workflow turns a paper's testable contributions into a record a reader can inspect: claim, evidence, environment, run, verdict, limits.

verification.protocolclaim -> record
  1. 01

    Declare

    State one contribution as a specific, testable claim.

  2. 02

    Package

    Attach tests, fixtures, seeds, baselines, data references, and a pinned environment.

  3. 03

    Run

    Execute the package in a network-isolated CPU sandbox with fixed resource limits.

  4. 04

    Assess

    Check whether the evidence supports the stated claim, not merely whether the tests pass.

  5. 05

    Publish

    Attach the verification record to claims that pass both gates.

Two gates

Passing tests are not enough.

Gate 1 asks whether the evidence ran. Gate 2 asks whether that evidence is adequate for the claim. The second gate is where the badge earns its value.

gate 1.executionrun record
Gate 1Execution

Did the evidence run?

The submitted package runs in a pinned, deterministic, network-isolated environment. This gate produces the raw execution record: logs, exit status, resource use, and reproducibility signals.

Outputs

  • exit status
  • logs
  • resource use
  • re-run signal
gate 2.faithfulnessbadge value
Gate 2Faithfulness

Did it support the claim?

Passing tests are not enough. The faithfulness gate checks for tautological tests, cherry-picked fixtures, overfit answers, and manuscript claims that exceed the evidence.

Outputs

  • claim coverage
  • adequacy notes
  • scope limits
  • verdict

Verified result

The result is a record, not a seal.

Passing submissions are published with the claim, environment, outputs, verdict, and limits attached. Failed runs return logs and costs, but not the Verified publication outcome.

Claim C1paper excerpt

sortpartial returns a sorted array and stays under the stated comparison budget on the pinned workload described in C1.

tests/claim_01.pysandbox input
def test_matches_oracle():
    for seed in range(10_000):
        xs = random_ints(seed, n=1_000)
        assert sortpartial(xs) == sorted(xs)

def test_comparison_budget():
    ops = count_comparisons(sortpartial, n=100_000)
    assert ops <= 3 * 100_000 * log2(100_000)
verification.receiptpublic record
Paper
Sorting with partial comparisons
Claim
C1
Scope
C1 only; no paper-wide truth claim
Execution
Passed
Faithfulness
Passed
Environment
python:3.12-slim
Network
Disabled
Budget
2 vCPU / 512 MB / 90 s
Compute
0.8 CPU·min / $0.04
Record
Published with verification record

Scope and policy

Less, precisely.Not more, vaguely.

Verified starts with claims that machines can actually check. The same boundary applies commercially: authors pay for verification work, never for acceptance.

commercial.policycompute ledger

Billed

Sandbox execution, verification work, and metered compute.

Not sold

Acceptance, badge status, or publication outcome.

Public record

The claim, environment, run result, verdict, and limits.

claim.scopev1 boundary
In scope for v1
  • Deterministic algorithmic correctness
  • CPU-only benchmarks on pinned data
  • Exact output reproduction
  • Machine-checkable proofs or assertions
  • Reproducible data pipelines for figures and tables
Out of scope for v1
  • Human-subjects or field claims
  • Novelty claims, e.g. “first to...”
  • Qualitative or design contributions
  • Non-mechanizable theory
  • GPU-dependent or statistically nondeterministic ML claims

Executable publishing

Narrow claims.Explicit evidence.Public records.

verified.recordpublication outcome

For authors, Verified is a submission path for runnable evidence. For readers, it is a record of what was checked, how it ran, and what was not claimed.

Claim scopedEvidence executedRecord attached