Glossary

Terms used across CLI, HTTP, Python, and methodology docs.

Scores

Term

Meaning

Overall disclosure risk score

Weighted headline score (0–100) from nine deterministic components. Higher = more disclosure risk / deterioration. See Understanding Scores for the scale.

Component score

One of nine headline-weighted blended signals (e.g. risk_factor_intensity_score, liquidity_stress_score). See Score Catalog.

Component scores

Nine deterministic components feed the headline overall_disclosure_risk_score (weights in methodology/aggregation). specificity_quality_score is also returned but excluded from headline weights — see the score scale include for its inversion.

Plain English

JSON field

Primary section(s)

Risk-factor tone & volatility

risk_factor_intensity_score

Item 1A

Year-over-year disclosure change

disclosure_change_score

Item 1A, MD&A

MD&A uncertainty & demand stress

mdna_uncertainty_score

Item 7 (10-K) / Item 2 (10-Q)

Legal & regulatory risk language

legal_regulatory_risk_score

Item 1A (+ flags)

Liquidity & covenant stress

liquidity_stress_score

MD&A (+ flags)

Boilerplate & vague risk language

boilerplate_risk_score

Item 1A

Internal controls weakness signals

internal_controls_risk_score

Controls disclosure + Item 1A

Material event severity (diff-only)

event_severity_score

Item 1A

Cross-section negative tone

tone_negativity_score

Item 1A + MD&A

Scores (continued)

Term

Meaning

Specificity quality score

Higher = better specificity (inverse of most other components).

Score coverage ratio

Fraction of headline components that were computed (non-null). Lower when required sections are missing.

Confidence score

0–1 blend of extraction quality, diff confidence, and coverage.

Missing components

Component names that could not be computed (usually missing sections or no prior filing).

Provenance

Per-component breakdown of inputs used in aggregation (include=provenance on HTTP matrix).

Pipeline

Term

Meaning

Deterministic scoring

Scores from word lists, diffs, flags, and densities only — no LLM.

Prior filing

Earlier comparable filing (same form type) used for section diffs.

Language delta

Change in a tone ratio vs the prior section (percentage points).

Boilerplate

Fixed-phrase repetition proxy in Item 1A (see Research Foundation).

Sections

Term

Meaning

Section name

Stable ID such as item_1a_risk_factors, item_7_mdna. See Section Taxonomy.

Required sections

Sections needed for full coverage on a form (10-K: Item 1A + Item 7 MD&A).

Versions

Artifact

Current value

Parser

section_extractor_v1

Metrics engine

text_metrics_v3

Scoring model

deterministic_scoring_v2

Dictionary

built_in_dictionaries_v3

See Changelog for history.