# Valan — Procurement Intelligence · Public Sample Bundle

**Public evaluation sample · Feed snapshot 2026-05-30 · Valan Technologies Limited**

A point-in-time, entity-resolved slice of Valan's global government-procurement
dataset: awards linked to the awarded company's tradable security, with the
point-in-time ticker and compliance rules applied. Schema reference:
[https://valan.io/data-dictionary](https://valan.io/data-dictionary) → the **`fin_awards`** table.

---

## Files

| File | Notes |
|---|---|
| `valan_sample_1k_fin_awards.parquet` / `.csv` | 1,000-row US-heavy investable slice. Identical content in both formats. |
| `valan_sample_loader.py` | Reference loader. Applies PIT and compliance rules; keeps PIT and current strictly separate. |
| [Data dictionary](https://valan.io/data-dictionary) | Full column reference. |

Run: `python valan_sample_loader.py <path>` · requires `pandas`, `pyarrow` (`duckdb` optional).

---

## Scope — read first

This is a **curated investable-only slice** (`investable_flag = True` on every
row). In the full universe only **~17%** of awards (12.3M of 71.9M) carry a
tradable ticker, so **do not extrapolate coverage from this file.** It is built
to show the schema, the identity resolution, and the point-in-time mechanics —
not the live hit-rate.

## Point-in-time / forward-bias — read before backtesting

Two distinct tradability concepts; do not conflate them:

- **`ticker_as_of` with `pit_confirmed = True`** — the ticker **as of the award
  date**, sourced from a genuine dated listing window. **Forward-bias-free.
  Use this for backtests.**
- **`ultimate_parent_ticker`** (parent rollup) — the supplier's **current**
  ownership link (who owns it *today*), **not** as-of the award. Useful for
  screening; look-ahead present. The loader surfaces this split directly rather
  than hiding it inside the rollup.

---

## Other usage rules

- **Currency** — `award_value` is in the **local** currency (`currency` column;
  this slice spans AUD, BRL, CZK, EUR, HUF, PLN, USD). **Never sum across
  currencies.**
- **Real obligated value** — use `obligated_awards()`: `value_type = 'award'`,
  positive value, and IDIQ/framework **ceilings excluded** (`value_is_ceiling`).
- **Schema** — this slice carries the full **`fin_awards`** identity + financial
  block: `financial_quality_score`, `is_framework`, `ticker_confidence`,
  `ticker_exchange`, `ticker_mic`, `ultimate_parent_lei`, `linked_tender_id`,
  `linked_prime_id`. `award_value` is numeric and `award_date` is a date.

## Compliance (verified at export)

PII-clean (email PII removed, verified zero; business-contact phones retained as
published procurement data). **No PRC-sourced data** (`ccgp_*` sources and
`buyer_country = 'CN'` excluded). **RU/BY excluded** on/after 2022-02-24.
CUI/radioactive rows excluded from the standard feed. No sanctioned buyer or
supplier present in this slice.

---

*Valan Technologies Limited · Public evaluation sample. The full feed —
71.9M awards, 25M tenders, daily refresh — is available by institutional
arrangement. Questions: john@valan.io*
