> ## Documentation Index
> Fetch the complete documentation index at: https://www.datalegion.ai/docs/llms.txt
> Use this file to discover all available pages before exploring further.

# 1.1.0_2026-03-10

> Company dataset launch, CASS/NCOA address validation, classification accuracy improvements, and new person fields.

## New

### Company Dataset

We're launching a dedicated company dataset. Each company record includes firmographic data (industry, size, domain, LinkedIn profile) and is linked to person records via `company_legion_id`. This is our first release of company data — coverage and depth will expand in upcoming builds.

Person records now include company linkage fields:

* **`company_legion_id`** — Links to the corresponding company record
* **`company_linkedin_url`**, **`company_linkedin_id`** — Employer LinkedIn identifiers
* **`experience.organization.legion_id`** — Each job in a person's history links to its company record

Workforce analytics available on company records:

* **`legion_employee_count`** — headcount derived from person experience data
* **`legion_new_hire_count`** — new hires across 1-month, 3-month, 6-month, and 12-month windows
* **`legion_attrition_count`** — employees who left across the same time windows
* **`legion_employee_growth_rate`** — net growth rate across 1-month, 3-month, 6-month, and 12-month windows
* **`legion_turnover_rate`** — attrition as a percentage of starting headcount across the same time windows
* **`legion_seniority_distribution`** — current employee breakdown by seniority level
* **`legion_job_function_distribution`** — current employee breakdown by job function
* **`legion_expense_category_distribution`** — current employee breakdown by expense category
* **`legion_tenure_distribution`** — current employee breakdown by tenure bucket
* **`legion_education_distribution`** — current employee breakdown by highest degree level
* **`legion_seniority_growth_rate`** — growth rate broken out by seniority level
* **`legion_job_function_growth_rate`** — growth rate broken out by job function
* **`legion_expense_category_growth_rate`** — growth rate broken out by expense category
* **`legion_employee_count_by_month`** — monthly headcount time series with gross hires and departures (up to 60 months)
* **`legion_average_tenure`** — average tenure in months of current employees

### Address Validation (CASS/NCOA)

We've begun processing US mailing addresses through USPS CASS certification and NCOA (National Change of Address). This is an ongoing effort — coverage will increase over the next several builds as we work through the full dataset.

For addresses processed so far:

* Validated to the delivery point level with ZIP+4
* NCOA forwarding moves applied, keeping addresses current for up to 48 months after a move
* Addresses that fail delivery point validation are removed
* Location confidence scoring reflects validation status

### New Person Fields

* **`linkedin_id`** — Numeric LinkedIn member ID
* **`is_platform_worker`** — Identifies gig economy and platform-based workers (rideshare, delivery, freelance marketplace)
* **`postal_code_4`** — ZIP+4 extension for CASS-validated US addresses
* **`current_jobs_last_updated`** — Date when a person's current job information last changed
* **`current_jobs_last_confirmed`** — Date when a source last observed the current job
* **`current_location_last_updated`** / **`current_location_last_confirmed`** — Same pair for location data

***

## Bug Fixes

### Seniority Classification

* Individual contributor roles where "manager" is part of the role name (product manager, project manager, account manager, etc.) were incorrectly classified as `seniority_level = "manager"`. Filtering for managers now returns people with direct reports and budget authority. IC managers get seniority from explicit modifiers — "Senior Product Manager" → `senior`, "Associate Product Manager" → `junior`, "Product Manager" → `null`.
* HR business partners, talent partners, care partners, and similar non-equity roles were incorrectly classified as `seniority_level = "partner"`. Partner now reliably identifies law firm, consulting, and PE/VC partners.
* Sole proprietors now classified as `owner`. Chairman/chairwoman/chairperson now classified as `c_level`.

### Job Function Classification

* Company industry is now used as a disambiguation signal for ambiguous titles.
* Corrected classification for service advisors, call center roles, insurance producers, and other edge cases.

### Top-Level Job Selection

* When a person holds multiple current positions, volunteer and honorary roles (committee member, chapter president, trustee, deacon) were sometimes selected as the primary job. These are now deprioritized so `job_title` and `company_name` reflect the person's primary professional role.

### Address Formatting

* Invalid secondary address lines (e.g., fabricated apartment numbers on single-family homes) are stripped.
