Employment, salary and firm data from IBGE’s Cadastro Central de Empresas (CEMPRE). This comprehensive dataset provides information on companies and other organizations registered with Brazil’s tax authority (Receita Federal), including employment levels, wage information, and business establishment data across Brazilian municipalities and sectors.
The CEMPRE dataset is one of the most detailed sources of firm-level data in Brazil, covering virtually all formal enterprises and organizations operating in the country.
The CEMPRE dataset includes:
The data is available at three different aggregation levels: - Country Level: Aggregate statistics for all of Brazil - State Level: Data aggregated by state (27 units) - Municipality Level: Data disaggregated to municipality level (5,570+ municipalities)
Data can be retrieved with sector disaggregation or aggregate form: - Sectoral Disaggregation: Detailed breakdown by CNAE 2.0 (main divisions and subdivisions) - Aggregate: Total across all sectors
Options:
dataset: "cempre"
raw_data:
TRUE: Returns the data in its original format from
IBGEFALSE: Returns cleaned and standardized datageo_level:
"country": National aggregate"state": Aggregated by state"municipality": Disaggregated to municipality level
(detailed results)time_period: Specifies the years for which data
will be downloaded (e.g., 2010:2020 for 2010 through
2020)
language:
"pt": Portuguese language (variable names and
labels)"eng": English languagesectors:
TRUE: Data is returned separated and disaggregated by
economic sector (CNAE)FALSE: Data is aggregated across all sectors# download raw data at the country level from 2008 to 2010
data <- load_cempre(
raw_data = TRUE,
geo_level = "country",
time_period = 2008:2010,
language = "eng"
)
# download treated state-level data split by sector in portuguese
data <- load_cempre(
raw_data = FALSE,
geo_level = "state",
time_period = 2008:2010,
language = "pt",
sectors = TRUE
)