71.9M
financial-modelled awards
(fin_awards)
12.3M
investable awards — linked
to a tradable security
25.0M
open tenders
(forward pipeline)
3.3M
subcontract graph edges
(sub → prime linkage)
Global Conventions
- Country codes — ISO 3166-1 alpha-2 (US, GB, DE).
- Currency — ISO 4217 in the
currency column; award_value is in that currency.
- Point-in-time integrity —
ticker_as_of is the ticker as of award_date, populated only from a genuine dated listing window; otherwise NULL (pit_confirmed flags genuineness). Never a current value dressed as as-of.
- Entity join —
supplier_entity_id → entity_dim.entity_id (LEI-bridged).
- Subcontracts —
is_subcontract=TRUE, tier_level>0; linked_prime_id → the prime award; full chain in subcontract_graph.
Compliance / Cleaning (applied to the feed)
- Email PII removed (verified 0). Business-contact phones retained as published procurement data.
- CUI markings redacted; radioactive/CUI rows excluded from the standard feed.
- China excluded (ccgp_* sources + buyer_country='CN'). Russia/Belarus excluded on/after 2022-02-24.
award_value: -1 / 0 stub values normalized to NULL.
- IDIQ/framework ceiling values flagged
value_is_ceiling=TRUE (top-0.1%-of-currency value repeated across ≥10 distinct suppliers) — exclude for real obligated-value analysis.
fin_awards — financial-modelled awards
71,888,513 rows · 12,345,679 investable · the tradeable/analytic award table
| Column | Type | Description |
| award_id | BIGINT | Unique award id (PK). |
| buyer_name | VARCHAR | Contracting public buyer. |
| buyer_country | VARCHAR | Buyer country (ISO-2). |
| supplier_name | VARCHAR | Awarded supplier (email-PII scrubbed). |
| supplier_country | VARCHAR | Supplier country (ISO-2). |
| award_date | DATE | Award date (the point-in-time anchor). |
| award_value | DOUBLE | Obligated value in currency (stub values → NULL). |
| value_type | VARCHAR | award (value>0) / modification (value<-1, de-obligation) / undisclosed (NULL). |
| value_is_ceiling | BOOLEAN | TRUE = IDIQ/framework ceiling, not obligated — exclude for real value analysis. |
| currency | VARCHAR | ISO-4217 currency of award_value. |
| cpv_primary | VARCHAR | Primary CPV procurement category. |
| vpv_code | VARCHAR | Valan harmonised procurement vocabulary code (cross-source category). |
| naics_codes | VARCHAR | NAICS industry code(s). |
| estimated_annual_value | DOUBLE | Modelled annual value (frameworks/IDIQ). |
| financial_quality_score | INTEGER | Data-quality score for financial fields (0–100). |
| is_framework | BOOLEAN | TRUE = framework / IDIQ vehicle. |
| linked_tender_id | BIGINT | FK → originating tender (fin_tenders.tender_id). |
| supplier_entity_id | VARCHAR | Resolved entity key → entity_dim.entity_id (LEI-bridged). |
| lei | VARCHAR | Supplier Legal Entity Identifier. |
| isin | VARCHAR | ISIN of the supplier's primary listing. |
| ticker_as_of | VARCHAR | Point-in-time ticker as of award_date (NULL if no real dated window). |
| ticker_current | VARCHAR | Current/latest ticker. |
| ticker_exchange | VARCHAR | Listing exchange. |
| ticker_mic | VARCHAR | Market Identifier Code (ISO-10383). |
| ticker_confidence | DOUBLE | Confidence of the deterministic name→ticker resolution (0–1). |
| pit_confirmed | BOOLEAN | TRUE = ticker_as_of came from a genuine dated window (0 forward-bias). |
| ultimate_parent_ticker | VARCHAR | Tradable ticker of the ultimate parent (for subsidiaries of listed parents). |
| ultimate_parent_lei | VARCHAR | LEI of the ultimate parent. |
| investable_flag | BOOLEAN | TRUE = a tradable ticker is present (own or ultimate-parent rollup). |
| is_subcontract | BOOLEAN | TRUE = subcontract award. |
| tier_level | SMALLINT | Subcontract tier (0 = prime). |
| linked_prime_id | BIGINT | FK → prime award (subcontracts); see subcontract_graph. |
master_awards — descriptive awards
71,857,265 rows · 4,787,331 investable · broader, text-rich award table (titles/descriptions/URLs)
Same identity/ticker block as fin_awards; award_value / award_date are raw VARCHAR.
| Column | Type | Description |
| award_id | VARCHAR | Unique award id. |
| source | VARCHAR | Origin portal/source system. |
| detail_url | VARCHAR | Link to the source notice. |
| buyer_name / buyer_country / buyer_city | VARCHAR | Buyer org, country (ISO-2), city. |
| supplier_name / supplier_country | VARCHAR | Supplier (PII-scrubbed), country (ISO-2). |
| award_date | VARCHAR | Award date (raw text). |
| award_value | VARCHAR | Award value (raw; stub values → NULL). |
| value_type / value_is_ceiling / currency | MIXED | As fin_awards. |
| cpv_primary / vpv_code / naics_codes | VARCHAR | Category codes. |
| title / short_description / full_description | VARCHAR | Notice text. |
| data_quality_score | VARCHAR | Overall record quality score. |
| [identity + PIT ticker block] | MIXED | supplier_entity_id, lei, isin, ticker_as_of, ticker_current, ticker_exchange, ticker_mic, pit_confirmed, ultimate_parent_ticker, investable_flag, is_subcontract, tier_level — see fin_awards. |
entity_dim — company dimension
312,156 rows · join target for all supplier_entity_id keys
| Column | Type | Description |
| entity_id | VARCHAR | Resolved entity key (PK) ← *.supplier_entity_id. |
| legal_name | VARCHAR | Legal name. |
| country | VARCHAR | Domicile (ISO-2). |
| lei | VARCHAR | Legal Entity Identifier. |
| isin | VARCHAR | Primary listing ISIN. |
| primary_ticker | VARCHAR | Primary ticker. |
| exchange | VARCHAR | Primary exchange. |
| mic | VARCHAR | Market Identifier Code. |
| ultimate_parent_entity_id | VARCHAR | Ultimate parent entity (self-join). |
| parent_ticker | VARCHAR | Ultimate parent's ticker. |
| is_public | BOOLEAN | Publicly listed. |
| is_sanctioned | BOOLEAN | Flagged on a recognised sanctions list. |
subcontract_graph — sub→prime linkage
3,311,541 rows
| Column | Type | Description |
| sub_award_id | BIGINT | Subcontract award (fin_awards.award_id). |
| prime_award_id | BIGINT | The prime award it sits under. |
| tier_level | SMALLINT | Tier depth (1 = first-tier sub). |
| sub_supplier_entity_id | VARCHAR | Subcontractor entity. |
| prime_supplier_name | VARCHAR | Prime supplier name. |
| prime_supplier_entity_id | VARCHAR | Prime entity. |
| prime_ultimate_parent_ticker | VARCHAR | Tradable ticker the prime ultimately rolls up to. |
fin_tenders / master_tenders — forward pipeline
24,984,433 / 24,274,179 rows · open solicitations
| Column | Type | Description |
| tender_id | BIGINT | Unique tender id (PK). |
| source | VARCHAR | Origin portal. |
| buyer_name / buyer_country / buyer_region / buyer_city | VARCHAR | Buyer org / country (ISO-2) / region / city. |
| published_date | DATE/TZ | Publication date. |
| deadline | DATE/TZ | Submission deadline. |
| currency | VARCHAR | Tender currency. |
| contract_type / procurement_method | VARCHAR | Vehicle / method (master_tenders). |
| cpv_primary / vpv_code / naics_codes | VARCHAR | Category codes. |
| title / short_description | VARCHAR | Tender text. |
| tender_status | VARCHAR | Lifecycle status. |
| financial_quality_score / data_quality_score | INTEGER | Quality score. |
company_god — global resolved entity master
72,351,281 rows · internal reference — not part of the standard client feed
The full global entity resolution master (LEI/ISIN/MIC/ticker for all entities, is_sanctioned flagged). entity_dim is the award-touched, client-facing projection of it.
Generated 2026-06-04 from the 2026-05-30 feed manifest and parquet schemas. SHA256 checksums per table are available in the manifest. For data access or institutional enquiries: john@valan.io