Free Dataset - Companies
Following SavvyIQ's acquisition of BigPicture technology, our team is building the next generation of business intelligence APIs with significantly enhanced capabilities. While this free dataset remains available, our paid offerings now include:
- 265M+ government-verified entities (vs 17M in free dataset)
- Deep industry classification (NAICS codes, business models, and products & services)
- Legal entity information and corporate hierarchies
- Works for global businesses - with or without web presence
Overview
Current dataset: 2024 Q2
Updated: 05/06/2024
This collection of data includes over seventeen million global companies. The dataset has information such as a company's name, website domain, size, year founded, industry, city/state, country and the handle of their LinkedIn URL.
To download the data, create a free account.
Note: This is a raw, source dataset. This means that the website domains may not be reliable - a domain may not resolve (e.g. company no longer exists), the domain redirects (e.g. company was acquired), or the domain is a free email provider (e.g. gmail.com). If resolving the domain is important to your use case, our paid datasets handle these issues.
Fields
Field | Field Type | Description | Example |
---|---|---|---|
handle | string | This is the unique handle from the company's LinkedIn profile. To get the full URL simply prepend "https://linkedin.com/". | company/uber-com |
name | string | The company's name. | Uber |
domain | string | This company's website domain name. | uber.com |
website | string | This company's website. | https://www.uber.com |
industry | enum (string) | The self-reported industry - the enum is from LinkedIn's standard industries. | Software Development |
size | enum (string) | A range representing the number of people working at the company - the enum is a normalized size range. | 10K+ |
type | enum (string) | The type of business entity - the enum is from LinkedIn's standard business types. | Public Company |
founded | integer | The year the company was founded. | 2009 |
city | string | The city of the company's current headquarters. | San Francisco |
state | string | The state/region of the company's current headquarters. | California |
country_code | enum (string) | The ISO alpha-2 country code of the company's current headquarters. | US |
Stats
This section provides field-level summary statistics for the dataset.
Field | Count | Numeric | Fill Percentage |
---|---|---|---|
handle | 19,486,334 | 19486334 | 100.00% |
name | 18,436,171 | 18436171 | 94.61% |
domain | 14,644,979 | 14644979 | 75.16% |
website | 14,667,470 | 14667470 | 75.27% |
industry | 16,951,045 | 16951045 | 86.99% |
size | 15,310,672 | 15310672 | 78.57% |
type | 12,461,530 | 12461530 | 63.95% |
founded | 9,089,364 | 9089364 | 46.64% |
city | 14,560,842 | 14560842 | 74.72% |
state | 13,087,225 | 13087225 | 67.16% |
country_code | 15,068,831 | 15068831 | 77.33% |
Field Comparison to Paid Dataset
Below is how the free dataset compares to our paid datasets and API.
API Field | Free Dataset Field | Free | Paid |
---|---|---|---|
name | name | X | X |
legalName | X | ||
aliases | X | ||
url | X | ||
domain | website | X | X |
domainAliases | X | ||
logo | X | ||
tags | X | ||
tech | X | ||
phone | X | ||
ticker | X | ||
type | type | X | X |
foundedYear | founded | X | X |
description | X | ||
emailProvider | X | ||
metrics.raised | X | ||
metrics.employees | X | ||
metrics.employeesRange | size | X | X |
metrics.marketCap | X | ||
metrics.annualRevenue | X | ||
metrics.estimatedAnnualRevenue | X | ||
metrics.trancoRank | X | ||
metrics.alexaUsRank | X | ||
metrics.alexaGlobalRank | X | ||
category.sector | X | ||
category.industryGroup | X | ||
category.industry | X | ||
category.subIndustry | X | ||
category.naicsCode | X | ||
linkedin.handle | handle | X | X |
linkedin.industry | industry | X | X |
facebook.handle | X | ||
twitter.id | X | ||
twitter.bio | X | ||
twitter.site | X | ||
twitter.avatar | X | ||
twitter.handle | X | ||
twitter.location | X | ||
twitter.followers | X | ||
twitter.following | X | ||
crunchbase.handle | X | ||
location | X | ||
geo.streetNumber | X | ||
geo.streetName | X | ||
geo.subPremise | X | ||
geo.city | city | X | X |
geo.state | state | X | X |
geo.stateCode | X | ||
geo.postalCode | X | ||
geo.country | X | ||
geo.countryCode | country_code | X | X |
indexedAt | X |
License
Open Data Commons Attribution License (ODC-By)
Copyright (c) Big Picture Technologies, Inc.
This BigPicture Free Company Dataset is made available under the Open Data Commons Attribution License: http://opendatacommons.org/licenses/by/1.0/