Skip to main content
Statistics on field coverage, confidence distribution, and dataset composition.

Dataset Snapshot

  • Total contacts: 184,170,424

Confidence Distribution (linkage confidence)

AttributehighmoderatelowTotal
Phones57,670,33015,255,74616,993,62389,919,699
Emails138,990,57761,048,40762,578,528262,617,512
Locations85,302,52282,081,435218,924,208386,308,165
Socials129,100,36218,441,95215,990,665163,532,979

Coverage Summary

FieldTypeDescriptionNullableFill RateCount
legion_idstringStable contact identifierNo100%184,170,424
full_namestringBest-available full nameNo100%184,170,422
first_namestringFirst nameNo100%184,170,406
middle_namestringMiddle name (non-initial)Yes5%8,690,502
middle_initialstringMiddle initial (provided or derived)Yes16%28,557,132
last_namestringLast nameNo100%184,169,913
last_initialstringLast name initialNo100%184,169,913
suffixstringName suffixYes1%2,235,681
prefixstringName prefixYes0%112,811
sexenumSex valueYes90%165,924,124
birth_datestringBirth date (precision preserved)Yes34%61,990,888
birth_yearintegerBirth yearYes34%61,990,888
birth_monthintegerBirth monthYes34%61,928,373
birth_dayintegerBirth dayYes1%1,914,306
ageintegerDerived ageYes34%61,990,888
job_titlestringCurrent job title (derived from experience)Yes62%113,350,812
company_namestringCurrent company name (derived from experience)Yes60%110,315,718
company_domainstringCurrent company domain (derived from experience)Yes39%71,582,892
company_industrystringCurrent company industry (from enriched company data)Yes37%68,615,151
company_sizeenumCurrent company size (see enums/company_size.csv)Yes40%74,013,685
seniority_levelenumSeniority level (c_level/owner/partner/vp/director/manager/senior/junior/training/intern)Yes27%49,764,605
job_functionenumJob function (engineering/sales/marketing/healthcare/education/etc.)Yes62%113,333,674
expense_categoryenumP&L expense category (general_and_administrative/research_and_development/sales_and_marketing/cost_of_services/not_applicable)Yes62%113,333,674
is_decision_makerbooleanTrue if seniority is c_level/owner/partner/vp/directorYes100%184,170,424
years_of_experienceintegerYears since earliest job start dateYes39%71,591,342
avg_tenure_monthsnumberAverage job tenure in months across all experience entriesYes39%71,549,552
highest_degree_levelenumHighest education level (doctorate/masters/bachelors/associates/high_school)Yes26%47,801,448
work_emailstringCurrent work email (professional, current=true)Yes28%51,263,585
mobile_phonestringCurrent mobile phone (mobile, current=true)Yes24%44,191,513
citystringCurrent city (from current location)Yes76%139,264,298
statestringCurrent state/province (from current location)Yes76%139,471,780
state_codestringCurrent state ISO 3166-2 code (e.g., US-CA, GB-ENG)Yes76%139,471,779
countrystringCurrent country (from current location)Yes76%139,516,185
country_codestringCurrent country ISO 3166-1 alpha-2 code (e.g., US, GB)Yes76%139,516,185
linkedin_urlstringPrimary LinkedIn profile URL (current=true)Yes70%128,497,988
headlineobjectObject containing headline text and raw snippetsNo64%118,218,188
summaryobjectObject containing summary text and raw snippetsYes20%36,193,834
num_sourcesintegerCount of contributing sourcesNo100%184,170,424
last_seenstringRecord last-seen dateYes96%176,551,578
build_versionstringBuild version identifierNo100%184,170,424
skills[]arrayArray of skills objectsNo25%45,140,683
skills[].raw[]arraySkills raw snippetsNo25%45,140,683
skills[].cleanedstringSkills textYes25%45,140,683
languages[]arrayArray of languages objectsNo11%19,922,659
languages[].raw[]arrayLanguages raw snippetsNo11%19,922,659
languages[].cleanedstringLanguages textYes11%19,922,659
languages[].proficiencyenumLanguages proficiencyYes3%5,637,461
headline.cleanedstringHeadline textYes64%118,218,188
headline.raw[]arrayHeadline raw snippetsNo64%118,218,308
summary.cleanedstringSummary textYes20%36,193,834
summary.raw[]arraySummary raw snippetsNo20%36,193,834
phones[]arrayContact has ≥1 phoneNo38%70,287,175
phones[].typeenumPhone typeNo38%70,287,175
phones[].numberstringPhone number presentNo38%70,287,175
phones[].currentbooleanPhone flagged currentNo38%70,287,175
phones[].confidenceenumPhone confidence bucketNo38%70,287,175
phones[].num_sourcesintegerNumber of sources that contributed to this phoneNo38%70,287,175
phones[].last_seenstringPhone last-seen dateYes31%57,447,889
emails[]arrayContact has ≥1 emailNo80%148,059,577
emails[].addressstringEmail addressNo80%148,059,577
emails[].typeenumEmail typeYes80%148,059,577
emails[].currentboolean/nullCurrent status (null for personal emails)Yes80%148,059,577
emails[].validatedbooleanEmail validatedNo80%148,059,577
emails[].validation_statusenumValidation statusYes15%28,527,909
emails[].confidenceenumEmail confidence bucketNo80%148,059,577
emails[].num_sourcesintegerNumber of sources that contributed to this emailNo80%148,059,577
emails[].last_seenstringEmail last-seen dateYes74%137,153,075
emails[].hash_sha256stringSHA-256 hash of normalized email addressYes80%148,059,577
emails[].hash_sha1stringSHA-1 hash of normalized email addressYes80%148,059,577
emails[].hash_md5stringMD5 hash of normalized email addressYes80%148,059,577
locations[]arrayContact has ≥1 normalized locationsNo100%184,170,424
locations[].street_addressstringStreet line presentYes82%151,099,048
locations[].address_line_2stringAddress line 2 presentYes18%32,578,304
locations[].citystringCity presentYes100%183,798,490
locations[].statestringState/province name (lowercase)Yes100%183,779,237
locations[].state_codestringISO 3166-2 code (e.g., US-NY)Yes100%183,775,998
locations[].countrystringCountry name (lowercase)Yes100%184,121,378
locations[].country_codestringISO 3166-1 alpha-2 (e.g., US)Yes100%184,121,365
locations[].postal_codestringPostal code presentYes86%158,357,428
locations[].continentstringContinent nameYes100%184,121,364
locations[].continent_codestringContinent codeYes100%184,121,364
locations[].raw[]arrayRaw location from the sourceNo100%184,170,423
locations[].currentbooleanLocation flagged currentNo100%184,170,424
locations[].confidenceenumLocation confidence bucketYes100%184,170,424
locations[].num_sourcesintegerNumber of sources that contributed to this locationNo100%184,170,424
locations[].last_seenstringLocation last-seen dateYes84%154,918,282
experience[]arrayExperience entriesNo65%120,248,797
experience[].titleobjectExperience title objectYes65%120,248,797
experience[].title.cleanedstringExperience title cleaned valueYes65%120,248,797
experience[].title.raw[]arrayExperience title raw snippetsYes65%120,248,797
experience[].organizationobjectExperience organization objectYes65%120,248,797
experience[].organization.nameobjectExperience organization name objectYes65%120,248,797
experience[].organization.name.cleanedstringExperience organization name cleaned valueYes64%118,041,271
experience[].organization.name.raw[]arrayExperience organization name raw snippetsYes64%118,240,842
experience[].organization.websitestringExperience organization website URLYes47%87,271,423
experience[].organization.linkedin_urlstringExperience organization LinkedIn URLYes49%90,742,614
experience[].organization.industrystringExperience organization industry (from enriched company data)Yes46%85,169,414
experience[].organization.sizeenumExperience organization size (see enums/company_size.csv)Yes48%89,239,630
experience[].start_datestringExperience start dateYes39%71,600,421
experience[].end_datestringExperience end dateYes30%54,788,344
experience[].currentbooleanExperience current flagYes39%71,598,646
experience[].tenure_monthsintegerJob tenure in months (calculated from start_date and end_date)Yes39%71,591,342
experience[].seniority_levelenumSeniority level classification (see enums/seniority_level.csv)Yes37%68,849,953
experience[].job_functionenumJob function classification (see enums/job_function.csv)Yes65%120,233,042
experience[].expense_categoryenumP&L expense category (see enums/expense_category.csv)Yes65%120,233,042
experience[].is_decision_makerbooleanTrue if seniority is c_level/owner/partner/vp/directorYes65%120,248,797
experience[].descriptionobjectExperience description objectYes65%120,248,797
experience[].description.cleanedstringExperience description textYes23%43,109,956
experience[].description.raw[]arrayExperience description raw snippetsYes23%43,119,623
education[]arrayEducation entriesNo37%68,475,827
education[].organizationobjectEducation organization objectYes37%68,475,827
education[].organization.nameobjectEducation organization name objectYes37%68,475,827
education[].organization.name.cleanedstringEducation organization name cleaned valueYes37%68,475,826
education[].organization.name.raw[]arrayEducation organization name raw snippetsYes37%68,475,827
education[].organization.linkedin_urlstringEducation organization LinkedIn URLYes35%64,710,227
education[].degreeobjectEducation degree objectYes37%68,475,827
education[].degree.cleanedstringEducation degree cleaned valueYes29%53,673,110
education[].degree.raw[]arrayEducation degree raw snippetsYes29%53,673,118
education[].degree_levelenumDegree level (see enums/degree_level.csv)Yes26%47,803,303
education[].field_of_studyobjectField of study objectYes37%68,475,827
education[].field_of_study.cleanedstringField of study cleaned valueYes28%52,379,067
education[].field_of_study.raw[]arrayField of study raw snippetsYes28%52,379,072
education[].start_datestringEducation start dateYes31%57,571,177
education[].end_datestringEducation end dateYes31%57,931,270
education[].currentbooleanWhether currently enrolled (True=ongoing, False=completed, null=unknown)Yes37%68,475,827
socials[]arraySocial linksNo73%133,536,198
socials[].networkenumSocial networkNo73%133,536,198
socials[].urlstringSocial URLNo73%133,536,198
socials[].idstringSocial IDYes65%120,023,374
socials[].usernamestringSocial usernameYes73%133,536,198
socials[].currentbooleanSocial is currentNo73%133,536,198
socials[].confidenceenumSocial confidence bucketNo73%133,536,198
socials[].num_sourcesintegerNumber of sources that contributed to this socialNo73%133,536,198
socials[].last_seenstringSocial last-seen dateYes58%106,869,089